Browse Articles

Machine learning for retinopathy prediction: Unveiling the importance of age and HbA1c with XGBoost

Ramachandran et al. | Sep 05, 2024

Machine learning for retinopathy prediction: Unveiling the importance of age and HbA1c with XGBoost

The purpose of our study was to examine the correlation of glycosylated hemoglobin (HbA1c), blood pressure (BP) readings, and lipid levels with retinopathy. Our main hypothesis was that poor glycemic control, as evident by high HbA1c levels, high blood pressure, and abnormal lipid levels, causes an increased risk of retinopathy. We identified the top two features that were most important to the model as age and HbA1c. This indicates that older patients with poor glycemic control are more likely to show presence of retinopathy.

Read More...

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Predicting smoking status based on RNA sequencing data
Image credit: Yang and Stanley 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Read More...

SmartZoo: A Deep Learning Framework for an IoT Platform in Animal Care

Ji et al. | Aug 07, 2024

SmartZoo: A Deep Learning Framework for an IoT Platform in Animal Care

Zoos offer educational and scientific advantages but face high maintenance costs and challenges in animal care due to diverse species' habits. Challenges include tracking animals, detecting illnesses, and creating suitable habitats. We developed a deep learning framework called SmartZoo to address these issues and enable efficient animal monitoring, condition alerts, and data aggregation. We discovered that the data generated by our model is closer to real data than random data, and we were able to demonstrate that the model excels at generating data that resembles real-world data.

Read More...

Diagnosing hypertrophic cardiomyopathy using machine learning models on CMRs and EKGs of the heart

Kolluri et al. | Jul 29, 2024

Diagnosing hypertrophic cardiomyopathy using machine learning models on CMRs and EKGs of the heart
Image credit: Jesse Orrico

Here seeking to develop a method to diagnose, hypertrophic cardiomyopathy which can cause sudden cardiac death, the authors investigated the use of a convolutional neural network (CNN) and long short-term memory (LSTM) models to classify cardiac magnetic resonance and heart electrocardiogram scans. They found that the CNN model had a higher accuracy and precision and better other qualities, suggesting that machine learning models could be valuable tools to assist physicians in the diagnosis of hypertrophic cardiomyopathy.

Read More...

Enhancing marine debris identification with convolutional neural networks

Wahlig et al. | Apr 03, 2024

Enhancing marine debris identification with convolutional neural networks
Image credit: The authors

Plastic pollution in the ocean is a major global concern. Remotely Operated Vehicles (ROVs) have promise for removing debris from the ocean, but more research is needed to achieve full effectiveness of the ROV technology. Wahlig and Gonzales tackle this issue by developing a deep learning model to distinguish trash from the environment in ROV images.

Read More...

The non-nutritive sweeteners acesulfame potassium and neotame slow the regeneration rate of planaria

Russo et al. | Nov 29, 2023

The non-nutritive sweeteners acesulfame potassium and neotame slow the regeneration rate of planaria
Image credit: Russo et al. 2023

The consumption of sugar substitute non-nutritive sweeteners (NNS) has dramatically increased in recent years. Despite being advertised as a healthy alternative, NNS have been linked to adverse effects on the body, such as neurodegenerative diseases (NDs). In NDs, neural stem cell function is impaired, which inhibits neuron regeneration. The purpose of this study was to determine if the NNS acesulfame potassium (Ace-K) and neotame affect planaria neuron regeneration rates. Since human neurons may regenerate, planaria, organisms with extensive regenerative capabilities due to stem cells called neoblasts, were used as the model organism. The heads of planaria exposed to either a control or non-toxic concentrations of NNS were amputated. The posterior regions of the planaria were observed every 24 hours to see the following regeneration stages: (1) wound healing, (2) blastema development, (3) growth, and (4) differentiation. The authors hypothesized that exposure to the NNS would slow planaria regeneration rates. The time it took for the planaria in the Ace-K group and the neotame group to reach the second, third, and fourth regeneration stage was significantly greater than that of the control. The results of this study indicated that exposure to the NNS significantly slowed regeneration rates in planaria. This suggests that the NNS may adversely impact neoblast proliferation rates in planaria, implying that it could impair neural stem cell proliferation in humans, which plays a role in NDs. This study may provide insight into the connection between NNS, human neuron regeneration, and NDs.

Read More...

Fractal dimensions of crumpled paper

Zhou et al. | Aug 10, 2023

Fractal dimensions of crumpled paper
Image credit: Richard Dykes

Here, beginning from an interest in fractals, infinitely complex shapes. The authors investigated the fractal object that results from crumpling a sheet of paper. They determined its fractal dimension using continuous Chi-squared analysis, thereby testing and validating their model against the more conventional least squares analysis.

Read More...

Quantitative definition of chemical synthetic pathway complexity of organic compounds

Baranwal et al. | Jun 19, 2023

Quantitative definition of chemical synthetic pathway complexity of organic compounds

Irrespective of the final application of a molecule, synthetic accessibility is the rate-determining step in discovering and developing novel entities. However, synthetic complexity is challenging to quantify as a single metric, since it is a composite of several measurable metrics, some of which include cost, safety, and availability. Moreover, defining a single synthetic accessibility metric for both natural products and non-natural products poses yet another challenge given the structural distinctions between these two classes of compounds. Here, we propose a model for synthetic accessibility of all chemical compounds, inspired by the Central Limit Theorem, and devise a novel synthetic accessibility metric assessing the overall feasibility of making chemical compounds that has been fitted to a Gaussian distribution.

Read More...

Modeling and optimization of epidemiological control policies through reinforcement learning

Rao et al. | May 23, 2023

Modeling and optimization of epidemiological control policies through reinforcement learning

Pandemics involve the high transmission of a disease that impacts global and local health and economic patterns. Epidemiological models help propose pandemic control strategies based on non-pharmaceutical interventions such as social distancing, curfews, and lockdowns, reducing the economic impact of these restrictions. In this research, we utilized an epidemiological Susceptible, Exposed, Infected, Recovered, Deceased (SEIRD) model – a compartmental model for virtually simulating a pandemic day by day.

Read More...

Evaluating machine learning algorithms to classify forest tree species through satellite imagery

Gupta et al. | Mar 18, 2023

Evaluating machine learning algorithms to classify forest tree species through satellite imagery
Image credit: Sergei A

Here, seeking to identify an optimal method to classify tree species through remote sensing, the authors used a few machine learning algorithms to classify forest tree species through multispectral satellite imagery. They found the Random Forest algorithm to most accurately classify tree species, with the potential to improve model training and inference based on the inclusion of other tree properties.

Read More...