Articles | Journal of Emerging Investigators

Depression detection in social media text: leveraging machine learning for effective screening

Shin et al. | Mar 25, 2025

Depression affects millions globally, yet identifying symptoms remains challenging. This study explored detecting depression-related patterns in social media texts using natural language processing and machine learning algorithms, including decision trees and random forests. Our findings suggest that analyzing online text activity can serve as a viable method for screening mental disorders, potentially improving diagnosis accuracy by incorporating both physical and psychological indicators.

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

Balaji et al. | Sep 11, 2021

The purpose of the study was to determine whether graph-based machine learning techniques, which have increased prevalence in the last few years, can accurately classify data into one of many clusters, while requiring less labeled training data and parameter tuning as opposed to traditional machine learning algorithms. The results determined that the accuracy of graph-based and traditional classification algorithms depends directly upon the number of features of each dataset, the number of classes in each dataset, and the amount of labeled training data used.

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

Lachwani et al. | Oct 23, 2024

In 2021, over 20 million people died from cardiovascular diseases, highlighting the need for a deeper understanding of factors influencing heart failure outcomes. This study examined multiple variables affecting mortality after heart failure, using random forest models to identify time, serum creatinine, and ejection fraction as key predictors. These findings could contribute to personalized medicine, improving survival rates by tailoring treatment strategies for heart failure patients.

Comparison of the ease of use and accuracy of two machine learning algorithms – forestry case study

Bhatia et al. | Mar 21, 2021

Machine learning algorithms are becoming increasingly popular for data crunching across a vast area of scientific disciplines. Here, the authors compare two machine learning algorithms with respect to accuracy and user-friendliness and find that random forest algorithms outperform logistic regression when applied to the same dataset.

Machine learning predictions of additively manufactured alloy crack susceptibilities

Gowda et al. | Nov 12, 2024

Additive manufacturing (AM) is transforming the production of complex metal parts, but challenges like internal cracking can arise, particularly in critical sectors such as aerospace and automotive. Traditional methods to assess cracking susceptibility are costly and time-consuming, prompting the use of machine learning (ML) for more efficient predictions. This study developed a multi-model ML pipeline that predicts solidification cracking susceptibility (SCS) more accurately by considering secondary alloy properties alongside composition, with Random Forest models showing the best performance, highlighting a promising direction for future research into SCS quantification.

Using machine learning to develop a global coral bleaching predictor

Madireddy et al. | Feb 21, 2023

Coral bleaching is a fatal process that reduces coral diversity, leads to habitat loss for marine organisms, and is a symptom of climate change. This process occurs when corals expel their symbiotic dinoflagellates, algae that photosynthesize within coral tissue providing corals with glucose. Restoration efforts have attempted to repair damaged reefs; however, there are over 360,000 square miles of coral reefs worldwide, making it challenging to target conservation efforts. Thus, predicting the likelihood of bleaching in a certain region would make it easier to allocate resources for conservation efforts. We developed a machine learning model to predict global locations at risk for coral bleaching. Data obtained from the Biological and Chemical Oceanography Data Management Office consisted of various coral bleaching events and the parameters under which the bleaching occurred. Sea surface temperature, sea surface temperature anomalies, longitude, latitude, and coral depth below the surface were the features found to be most correlated to coral bleaching. Thirty-nine machine learning models were tested to determine which one most accurately used the parameters of interest to predict the percentage of corals that would be bleached. A random forest regressor model with an R-squared value of 0.25 and a root mean squared error value of 7.91 was determined to be the best model for predicting coral bleaching. In the end, the random model had a 96% accuracy in predicting the percentage of corals that would be bleached. This prediction system can make it easier for researchers and conservationists to identify coral bleaching hotspots and properly allocate resources to prevent or mitigate bleaching events.

Predicting the factors involved in orthopedic patient hospital stay

D’Souza et al. | Dec 13, 2023

Long hospital stays can be stressful for the patient for many reasons. We hypothesized that age would be the greatest predictor of hospital stay among patients who underwent orthopedic surgery. Through our models, we found that severity of illness was indeed the highest factor that contributed to determining patient length of stay. The other two factors that followed were the facility that the patient was staying in and the type of procedure that they underwent.

The influence of economic factors on United States household energy consumption in 2020

Ramanathan et al. | Jun 08, 2026

This study used machine learning models to examine which factors most influenced U.S. household energy consumption in 2020 using data from 18,496 households.

Weather-based power outage prediction in New York City: An ensemble machine learning approach

Mohan et al. | Apr 29, 2026

This study contributes to our understanding of how urban energy systems respond to climate variability and inform strategies for enhancing power grid resilience. The findings can help inform urban planners and infrastructure developers by identifying the factors that make regions within a power grid more vulnerable.

Environmental contributors of asthma via explainable AI: Green spaces, climate, traffic & air quality

Chen et al. | Aug 12, 2025

This study explored how green spaces, climate, traffic, and air quality (GCTA) collectively influence asthma-related emergency department visits in the U.S using machine learning models and explainable AI.

Browse Articles

Depression detection in social media text: leveraging machine learning for effective screening

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

Comparison of the ease of use and accuracy of two machine learning algorithms – forestry case study

Machine learning predictions of additively manufactured alloy crack susceptibilities

Using machine learning to develop a global coral bleaching predictor

Predicting the factors involved in orthopedic patient hospital stay

The influence of economic factors on United States household energy consumption in 2020

Weather-based power outage prediction in New York City: An ensemble machine learning approach

Environmental contributors of asthma via explainable AI: Green spaces, climate, traffic & air quality

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags