Articles | Journal of Emerging Investigators

Analyzing market dynamics and optimizing sales performance with machine learning

Kamat et al. | May 31, 2025

This study uses interpretable machine learning models, lasso and ridge regression with Shapley analysis, to identify key sales drivers for Corporación Favorita, Ecuador’s largest grocery chain. The results show that macroeconomic factors, especially labor force size, have the greatest impact on sales, though geographic and seasonal variables like city altitude and holiday proximity also play important roles. These insights can help businesses focus on the most influential market conditions to enhance competitiveness and profitability.

Forecasting air quality index: A statistical machine learning and deep learning approach

Pasula et al. | Feb 17, 2025

Here the authors investigated air quality forecasting in India, comparing traditional time series models like SARIMA with deep learning models like LSTM. The research found that SARIMA models, which capture seasonal variations, outperform LSTM models in predicting Air Quality Index (AQI) levels across multiple Indian cities, supporting the hypothesis that simpler models can be more effective for this specific task.

SOS-PVCase: A machine learning optimized lignin peroxidase with polyvinyl chloride (PVC) degrading properties

Ahuja et al. | Sep 30, 2024

The authors looked at the primary structure of lignin peroxidase in an attempt to identify mutations that would improve both the stability and solubility of the peroxidase protein. The goal is to engineer peroxidase enzymes that are stable to help break down polymers, such as PVC, into monomers that can be reused instead of going to landfills.

Diagnosing hypertrophic cardiomyopathy using machine learning models on CMRs and EKGs of the heart

Kolluri et al. | Jul 29, 2024

Here seeking to develop a method to diagnose, hypertrophic cardiomyopathy which can cause sudden cardiac death, the authors investigated the use of a convolutional neural network (CNN) and long short-term memory (LSTM) models to classify cardiac magnetic resonance and heart electrocardiogram scans. They found that the CNN model had a higher accuracy and precision and better other qualities, suggesting that machine learning models could be valuable tools to assist physicians in the diagnosis of hypertrophic cardiomyopathy.

The precision of machine learning models at classifying autism spectrum disorder in adults

Raj Kumar et al. | Jun 28, 2024

Autism spectrum disorder (ASD) is hard to correctly diagnose due to the very subjective nature of diagnosing it: behavior analysis. Due to this issue, we sought to find a machine learning-based method that diagnoses ASD without behavior analysis or helps reduce misdiagnosis.

Utilizing meteorological data and machine learning to predict and reduce the spread of California wildfires

Bilwar et al. | Jan 15, 2024

This study hypothesized that a machine learning model could accurately predict the severity of California wildfires and determine the most influential meteorological factors. It utilized a custom dataset with information from the World Weather Online API and a Kaggle dataset of wildfires in California from 2013-2020. The developed algorithms classified fires into seven categories with promising accuracy (around 55 percent). They found that higher temperatures, lower humidity, lower dew point, higher wind gusts, and higher wind speeds are the most significant contributors to the spread of a wildfire. This tool could vastly improve the efficiency and preparedness of firefighters as they deal with wildfires.

A novel approach for predicting Alzheimer’s disease using machine learning on DNA methylation in blood

Adami et al. | Sep 20, 2023

Here, recognizing the difficulty associated with tracking the progression of dementia, the authors used machine learning models to predict between the presence of cognitive normalcy, mild cognitive impairment, and Alzheimer's Disease, based on blood DNA methylation levels, sex, and age. With four machine learning models and two dataset dimensionality reduction methods they achieved an accuracy of 53.33%.

Differential privacy in machine learning for traffic forecasting

Vinay et al. | Dec 21, 2022

In this paper, we measured the privacy budgets and utilities of different differentially private mechanisms combined with different machine learning models that forecast traffic congestion at future timestamps. We expected the ANNs combined with the Staircase mechanism to perform the best with every value in the privacy budget range, especially with the medium high values of the privacy budget. In this study, we used the Autoregressive Integrated Moving Average (ARIMA) and neural network models to forecast and then added differentially private Laplacian, Gaussian, and Staircase noise to our datasets. We tested two real traffic congestion datasets, experimented with the different models, and examined their utility for different privacy budgets. We found that a favorable combination for this application was neural networks with the Staircase mechanism. Our findings identify the optimal models when dealing with tricky time series forecasting and can be used in non-traffic applications like disease tracking and population growth.

A novel CNN-based machine learning approach to identify skin cancers

Rao et al. | Nov 18, 2022

In this study, the authors developed and assessed the accuracy of a machine learning algorithm to identify skin cancers using images of biopsies.

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

Ramprasad et al. | Mar 18, 2020

In this study, the authors test whether providing a larger dataset of glass transition temperatures (T_g) to train the machine-learning platform Polymer Genome would improve its accuracy. Polymer Genome is a machine learning based data-driven informatics platform for polymer property prediction and T_g is one property needed to design new polymers in silico. They found that training the model with their larger, curated dataset improved the algorithm's T_g, providing valuable improvements to this useful platform.

Browse Articles

Analyzing market dynamics and optimizing sales performance with machine learning

Forecasting air quality index: A statistical machine learning and deep learning approach

SOS-PVCase: A machine learning optimized lignin peroxidase with polyvinyl chloride (PVC) degrading properties

Diagnosing hypertrophic cardiomyopathy using machine learning models on CMRs and EKGs of the heart

The precision of machine learning models at classifying autism spectrum disorder in adults

Utilizing meteorological data and machine learning to predict and reduce the spread of California wildfires

A novel approach for predicting Alzheimer’s disease using machine learning on DNA methylation in blood

Differential privacy in machine learning for traffic forecasting

A novel CNN-based machine learning approach to identify skin cancers

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags