Articles | Journal of Emerging Investigators

Correlating inlet gas composition to conversion efficiency in plasma-assisted landfill gas reforming

Kim et al. | Jun 28, 2025

The escalating crisis of climate change, driven by the accumulation of greenhouse gases from human activities, demands urgent and innovative solutions to curb rising global temperatures. Plasma-based methane (CH₄) and carbon dioxide (CO₂) reforming offers a promising pathway for carbon capture and the sustainable production of hydrogen fuel and syngas components. To advance this technology, particularly in terms of energy efficiency and selectivity, it is essential to enhance the conversion efficiencies of CO₂ and CH₄.

Exploring the Factors that Drive Coffee Ratings

Agarwal et al. | May 19, 2025

This study explores the factors that influence coffee quality ratings using data from the Coffee Quality Institute. Through a regression model based on gradient descent, the authors aimed to predict coffee ratings (total cup points) and hypothesized that sweetness and the coffee producer would be the most influential factors.

Identifying factors, such as low sleep quality, that predict suicidal thoughts using machine learning

Dong et al. | Apr 30, 2024

Sadly, around 800,000 people die by suicide worldwide each year. Dong and Pearce analyze health survey data to identify associations between suicidal ideation and relevant variables, such as sleep quality, hopelessness, and anxious behavior.

Suppress that algae: Mitigating the effects of harmful algal blooms through preemptive detection & suppression

Natarajan et al. | Jul 17, 2023

A bottleneck in deleting algal blooms is that current data section is manual and is reactionary to an existing algal bloom. These authors made a custom-designed Seek and Destroy Algal Mitigation System (SDAMS) that detects harmful algal blooms at earlier time points with astonishing accuracy, and can instantaneously suppress the pre-bloom algal population.

Creating a drought prediction model using convolutional neural networks

Bora et al. | Oct 08, 2024

Droughts kill over 45,000 people yearly and affect the livelihoods of 55 million others worldwide, with climate change likely to worsen these effects. However, unlike other natural disasters (hurricanes, etc.), there is no early detection system that can predict droughts far enough in advance to be useful. Bora, Caulkins, and Joycutty tackle this issue by creating a drought prediction model.

Predicting the spread speed of red imported fire ants under different temperature conditions in China

Wang et al. | Sep 07, 2025

The authors looked at non-natural factors that influenced the spread rate of fire ants in multiple cities in China.

Predicting voting and union support in certification elections: Evidence from Starbucks workers, 2021-2024

Zhang et al. | Aug 28, 2025

The authors looked at unionization petitions from Starbucks workers between August 2021 and July 2024 to determine what factors influence votes for or against unionization.

Stock price prediction: Long short-term memory vs. Autoformer and time series foundation model

Lau et al. | Aug 12, 2025

The authors looked the ability to predict future stock prices using various machine learning models.

Investigating AlphaFold’s handling of nanobody-antigen complex prediction

Swaminathan et al. | Apr 16, 2025

Predicting antibody structures and antibody-antigen complexes using AlphaFold

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Browse Articles

Correlating inlet gas composition to conversion efficiency in plasma-assisted landfill gas reforming

Exploring the Factors that Drive Coffee Ratings

Identifying factors, such as low sleep quality, that predict suicidal thoughts using machine learning

Suppress that algae: Mitigating the effects of harmful algal blooms through preemptive detection & suppression

Creating a drought prediction model using convolutional neural networks

Predicting the spread speed of red imported fire ants under different temperature conditions in China

Predicting voting and union support in certification elections: Evidence from Starbucks workers, 2021-2024

Stock price prediction: Long short-term memory vs. Autoformer and time series foundation model

Investigating AlphaFold’s handling of nanobody-antigen complex prediction

Predicting smoking status based on RNA sequencing data

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags