Articles | Journal of Emerging Investigators

An explainable model for content moderation

Cao et al. | Aug 16, 2023

The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.

Uncovering the hidden trafficking trade with geographic data and natural language processing

Aqid et al. | Oct 14, 2024

The authors use machine learning to develop an evidence-based detection tool for identifying human trafficking.

Machine learning predictions of additively manufactured alloy crack susceptibilities

Gowda et al. | Nov 12, 2024

Additive manufacturing (AM) is transforming the production of complex metal parts, but challenges like internal cracking can arise, particularly in critical sectors such as aerospace and automotive. Traditional methods to assess cracking susceptibility are costly and time-consuming, prompting the use of machine learning (ML) for more efficient predictions. This study developed a multi-model ML pipeline that predicts solidification cracking susceptibility (SCS) more accurately by considering secondary alloy properties alongside composition, with Random Forest models showing the best performance, highlighting a promising direction for future research into SCS quantification.

Using explainable artificial intelligence to identify patient-specific breast cancer subtypes

Suresh et al. | Jan 12, 2024

Breast cancer is the most common cancer in women, with approximately 300,000 diagnosed with breast cancer in 2023. It ranks second in cancer-related deaths for women, after lung cancer with nearly 50,000 deaths. Scientists have identified important genetic mutations in genes like BRCA1 and BRCA2 that lead to the development of breast cancer, but previous studies were limited as they focused on specific populations. To overcome limitations, diverse populations and powerful statistical methods like genome-wide association studies and whole-genome sequencing are needed. Explainable artificial intelligence (XAI) can be used in oncology and breast cancer research to overcome these limitations of specificity as it can analyze datasets of diagnosed patients by providing interpretable explanations for identified patterns and predictions. This project aims to achieve technological and medicinal goals by using advanced algorithms to identify breast cancer subtypes for faster diagnoses. Multiple methods were utilized to develop an efficient algorithm. We hypothesized that an XAI approach would be best as it can assign scores to genes, specifically with a 90% success rate. To test that, we ran multiple trials utilizing XAI methods through the identification of class-specific and patient-specific key genes. We found that the study demonstrated a pipeline that combines multiple XAI techniques to identify potential biomarker genes for breast cancer with a 95% success rate.

Can the attributes of an app predict its rating?

Feng et al. | Jul 03, 2024

In this article the authors looked at different attributes of apps within the Google Play store to determine how those may impact the overall app rating out of five stars. They found that review count, amount of storage needed and when the app was last updated to be the most influential factors on an app's rating.

An analysis of the feasibility of SARIMAX-GARCH through load forecasting

Satheesh et al. | Nov 20, 2022

The authors found that SARIMAX-GARCH is more accurate than SARIMAX for load forecasting with respect to energy consumption.

Building deep neural networks to detect candy from photos and estimate nutrient portfolio

An et al. | Sep 13, 2023

The authors use pictures of candy wrappers and neural networks to improve nutritional accuracy of diet-tracking apps.

Evaluating the predicted eruption times of geysers in Yellowstone National Park

Rhee et al. | Jun 25, 2024

The authors compare the predicted versus actual geyser eruption times for the Old Faithful and Beehive Geysers at Yellowstone National Park.

The use of computer vision to differentiate valley fever from lung cancer via CT scans of nodules

El Kereamy et al. | Nov 12, 2024

Pulmonary diseases like lung cancer and valley fever pose serious health challenges, making accurate and rapid diagnostics essential. This study developed a MATLAB-based software tool that uses computer vision techniques to differentiate between these diseases by analyzing features of lung nodules in CT scans, achieving higher precision than traditional methods.

Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis

Sirohi et al. | Sep 25, 2022

Over the last few decades, childhood stunting has persisted as a major global challenge. This study hypothesized that TPTO (Tree-based Pipeline Optimization Tool), an AutoML (automated machine learning) tool, would outperform all pre-existing machine learning models and reveal the positive impact of economic prosperity, strong familial traits, and resource attainability on reducing stunting risk. Feature correlation plots revealed that maternal height, wealth indicators, and parental education were universally important features for determining stunting outcomes approximately two years after birth. These results help inform future research by highlighting how demographic, familial, and socio-economic conditions influence stunting and providing medical professionals with a deployable risk assessment tool for predicting childhood stunting.

Browse Articles

An explainable model for content moderation

Uncovering the hidden trafficking trade with geographic data and natural language processing

Machine learning predictions of additively manufactured alloy crack susceptibilities

Using explainable artificial intelligence to identify patient-specific breast cancer subtypes

Can the attributes of an app predict its rating?

An analysis of the feasibility of SARIMAX-GARCH through load forecasting

Building deep neural networks to detect candy from photos and estimate nutrient portfolio

Evaluating the predicted eruption times of geysers in Yellowstone National Park

The use of computer vision to differentiate valley fever from lung cancer via CT scans of nodules

Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags