The global issue of water quality has led to the use of machine learning models, like ANN and SVM, to predict water potability. However, these models can be complex and resource-intensive. This research aimed to find a simpler, more efficient model for water quality prediction.
The mountain chain of the Western Ghats on the Indian peninsula, a UNESCO World Heritage site, is home to about 200 frog species, 89 of which are endemic. Distinctive to each frog species, their vocalizations can be used for species recognition. Manually surveying frogs at night during the rain in elephant and big cat forests is difficult, so being able to autonomously record ambient soundscapes and identify species is essential. An effective machine learning (ML) species classifier requires substantial training data from this area. The goal of this study was to assess data augmentation techniques on a dataset of frog vocalizations from this region, which has a minimal number of audio recordings per species. Consequently, enhancing an ML model’s performance with limited data is necessary. We analyzed the effects of four data augmentation techniques (Time Shifting, Noise Injection, Spectral Augmentation, and Test-Time Augmentation) individually and their combined effect on the frog vocalization data and the public environmental sounds dataset (ESC-50). The effect of combined data augmentation techniques improved the model's relative accuracy as the size of the dataset decreased. The combination of all four techniques improved the ML model’s classification accuracy on the frog calls dataset by 94%. This study established a data augmentation approach to maximize the classification accuracy with sparse data of frog call recordings, thereby creating a possibility to build a real-world automated field frog species identifier system. Such a system can significantly help in the conservation of frog species in this vital biodiversity hotspot.
Additive manufacturing (AM) is transforming the production of complex metal parts, but challenges like internal cracking can arise, particularly in critical sectors such as aerospace and automotive. Traditional methods to assess cracking susceptibility are costly and time-consuming, prompting the use of machine learning (ML) for more efficient predictions. This study developed a multi-model ML pipeline that predicts solidification cracking susceptibility (SCS) more accurately by considering secondary alloy properties alongside composition, with Random Forest models showing the best performance, highlighting a promising direction for future research into SCS quantification.
Rechargeable batteries are playing an increasingly prominent role in our lives due to the ongoing transition from fossil energy sources to green energy. The purpose of this study was to investigate variables that impact the effectiveness of rechargeable batteries. Alkaline (non-rechargeable) and rechargeable batteries share common features that are critical for the operation of a battery. The positive and negative electrodes, also known as the cathode and anode, are where the energy of the battery is stored. The electrolyte is what facilitates the transfer of cations and anions in a battery to generate electricity. Due to the importance of these components, we felt that a systematic investigation examining the surface area of the cathode and anode as well the impact of electrolytes with different properties on battery performance was justified. Utilizing a copper cathode and aluminum anode coupled with a water in salt electrolyte, a model rechargeable battery system was developed to test two hypotheses: a) increasing the contact area between the electrodes and electrolyte would improve battery capacity, and b) more soluble salt-based electrolytes would improve battery capacity. After soaking in an electrolyte solution, the battery was charged and the capacity, starting voltage, and ending voltage of each battery were measured. The results of this study supported our hypothesis that larger anode/cathodes surface areas and more ionic electrolytes such as sodium chloride, potassium chloride and potassium sulfate resulted in superior battery capacity. Incorporating these findings can help maximize the efficiency of commercial rechargeable batteries.
Coronary artery disease, the leading cause of death worldwide, results from cholesterol build-up in coronary arteries, limiting blood and oxygen flow to the heart. This study investigated the use of gold and silver nanoparticles coated with aspirin and activated by near-infrared light to improve blood flow in a clogged artery model. The nanoparticles increased simulated blood flow rates, demonstrating potential as a less invasive and more targeted treatment for cardiovascular disease.
The purpose of our study was to examine the correlation of glycosylated hemoglobin (HbA1c), blood pressure (BP) readings, and lipid levels with retinopathy. Our main hypothesis was that poor glycemic control, as evident by high HbA1c levels, high blood pressure, and abnormal lipid levels, causes an increased risk of retinopathy. We identified the top two features that were most important to the model as age and HbA1c. This indicates that older patients with poor glycemic control are more likely to show presence of retinopathy.
Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.
Zoos offer educational and scientific advantages but face high maintenance costs and challenges in animal care due to diverse species' habits. Challenges include tracking animals, detecting illnesses, and creating suitable habitats. We developed a deep learning framework called SmartZoo to address these issues and enable efficient animal monitoring, condition alerts, and data aggregation. We discovered that the data generated by our model is closer to real data than random data, and we were able to demonstrate that the model excels at generating data that resembles real-world data.
Here seeking to develop a method to diagnose, hypertrophic cardiomyopathy which can cause sudden cardiac death, the authors investigated the use of a convolutional neural network (CNN) and long short-term memory (LSTM) models to classify cardiac magnetic resonance and heart electrocardiogram scans. They found that the CNN model had a higher accuracy and precision and better other qualities, suggesting that machine learning models could be valuable tools to assist physicians in the diagnosis of hypertrophic cardiomyopathy.
The global mental health crisis has led to increased substance abuse among youth. Prescription drug abuse causes approximately 115 American deaths daily. Understanding intergenerational transmission of substance abuse is complex due to lengthy human studies and socioeconomic variables. Recent FDA guidelines mandate abuse liability testing for neuro-active drugs but overlook intergenerational transfer. Brown planaria, due to their nervous system development similarities with mammals, offer a novel model.