Articles | Journal of Emerging Investigators

Correlating inlet gas composition to conversion efficiency in plasma-assisted landfill gas reforming

Kim et al. | Jun 28, 2025

The escalating crisis of climate change, driven by the accumulation of greenhouse gases from human activities, demands urgent and innovative solutions to curb rising global temperatures. Plasma-based methane (CH₄) and carbon dioxide (CO₂) reforming offers a promising pathway for carbon capture and the sustainable production of hydrogen fuel and syngas components. To advance this technology, particularly in terms of energy efficiency and selectivity, it is essential to enhance the conversion efficiencies of CO₂ and CH₄.

Exploring the Factors that Drive Coffee Ratings

Agarwal et al. | May 19, 2025

This study explores the factors that influence coffee quality ratings using data from the Coffee Quality Institute. Through a regression model based on gradient descent, the authors aimed to predict coffee ratings (total cup points) and hypothesized that sweetness and the coffee producer would be the most influential factors.

Identifying factors, such as low sleep quality, that predict suicidal thoughts using machine learning

Dong et al. | Apr 30, 2024

Sadly, around 800,000 people die by suicide worldwide each year. Dong and Pearce analyze health survey data to identify associations between suicidal ideation and relevant variables, such as sleep quality, hopelessness, and anxious behavior.

Creating a drought prediction model using convolutional neural networks

Bora et al. | Oct 08, 2024

Droughts kill over 45,000 people yearly and affect the livelihoods of 55 million others worldwide, with climate change likely to worsen these effects. However, unlike other natural disasters (hurricanes, etc.), there is no early detection system that can predict droughts far enough in advance to be useful. Bora, Caulkins, and Joycutty tackle this issue by creating a drought prediction model.

Predicting clogs in water pipelines using sound sensors and machine learning linear regression

Rajawat et al. | Oct 11, 2025

The authors looked the ability of sound sensors to predict clogged pipes when the sound intensity data is run through a machine learning algorithm.

Predicting the spread speed of red imported fire ants under different temperature conditions in China

Wang et al. | Sep 07, 2025

The authors looked at non-natural factors that influenced the spread rate of fire ants in multiple cities in China.

Predicting voting and union support in certification elections: Evidence from Starbucks workers, 2021-2024

Zhang et al. | Aug 28, 2025

The authors looked at unionization petitions from Starbucks workers between August 2021 and July 2024 to determine what factors influence votes for or against unionization.

Predicting and explaining illicit financial flows in developing countries: A machine learning approach

Putta et al. | Aug 24, 2025

The authors looked at the ability of different machine learning algorithms to predict the level of financial corruption in different countries.

Stock price prediction: Long short-term memory vs. Autoformer and time series foundation model

Lau et al. | Aug 12, 2025

The authors looked the ability to predict future stock prices using various machine learning models.

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Browse Articles

Correlating inlet gas composition to conversion efficiency in plasma-assisted landfill gas reforming

Exploring the Factors that Drive Coffee Ratings

Identifying factors, such as low sleep quality, that predict suicidal thoughts using machine learning

Creating a drought prediction model using convolutional neural networks

Predicting clogs in water pipelines using sound sensors and machine learning linear regression

Predicting the spread speed of red imported fire ants under different temperature conditions in China

Predicting voting and union support in certification elections: Evidence from Starbucks workers, 2021-2024

Predicting and explaining illicit financial flows in developing countries: A machine learning approach

Stock price prediction: Long short-term memory vs. Autoformer and time series foundation model

Predicting smoking status based on RNA sequencing data

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags