Articles | Journal of Emerging Investigators

Applying centrality analysis on a protein interaction network to predict colorectal cancer driver genes

Saha et al. | Nov 18, 2023

In this article the authors created an interaction map of proteins involved in colorectal cancer to look for driver vs. non-driver genes. That is they wanted to see if they could determine what genes are more likely to drive the development and progression in colorectal cancer and which are present in altered states but not necessarily driving disease progression.

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

Lachwani et al. | Oct 23, 2024

In 2021, over 20 million people died from cardiovascular diseases, highlighting the need for a deeper understanding of factors influencing heart failure outcomes. This study examined multiple variables affecting mortality after heart failure, using random forest models to identify time, serum creatinine, and ejection fraction as key predictors. These findings could contribute to personalized medicine, improving survival rates by tailoring treatment strategies for heart failure patients.

Development of a novel machine learning platform to identify structural trends among NNRTI HIV-1 reverse transcriptase inhibitors

Ashok et al. | Jun 24, 2022

With advancements in machine learning a large data scale, high throughput virtual screening has become a more attractive method for screening drug candidates. This study compared the accuracy of molecular descriptors from two cheminformatics Mordred and PaDEL, software libraries, in characterizing the chemo-structural composition of 53 compounds from the non-nucleoside reverse transcriptase inhibitors (NNRTI) class. The classification model built with the filtered set of descriptors from Mordred was superior to the model using PaDEL descriptors. This approach can accelerate the identification of hit compounds and improve the efficiency of the drug discovery pipeline.

Machine learning on crowd-sourced data to highlight coral disease

Narayan et al. | Jul 26, 2021

Triggered largely by the warming and pollution of oceans, corals are experiencing bleaching and a variety of diseases caused by the spread of bacteria, fungi, and viruses. Identification of bleached/diseased corals enables implementation of measures to halt or retard disease. Benthic cover analysis, a standard metric used in large databases to assess live coral cover, as a standalone measure of reef health is insufficient for identification of coral bleaching/disease. Proposed herein is a solution that couples machine learning with crowd-sourced data – images from government archives, citizen science projects, and personal images collected by tourists – to build a model capable of identifying healthy, bleached, and/or diseased coral.

A natural language processing approach to skill identification in the job market

Suram et al. | Oct 29, 2024

The authors looked at using machine learning to identify skills needed to apply for certain jobs, specifically looking at different techniques to parse apart the text. They found that Bidirectional Encoder Representation of Transforms (BERT) performed best.

Training neural networks on text data to model human emotional understanding

Sathish et al. | Mar 14, 2025

The authors train a neural network to detect text-based emotions including joy, sadness, anger, fear, love, and surprise.

Advancing pediatric cancer predictions through generative artificial intelligence and machine learning

Yadav et al. | Dec 21, 2024

Pediatric cancers pose unique challenges due to their rarity and distinct biological factors, emphasizing the need for accurate survival prediction to guide treatment. This study integrated generative AI and machine learning, including synthetic data, to analyze 9,184 pediatric cancer patients, identifying age at diagnosis, cancer types, and anatomical sites as significant survival predictors. The findings highlight the potential of AI-driven approaches to improve survival prediction and inform personalized treatment strategies, with broader implications for innovative healthcare applications.

Can the nucleotide content of a DNA sequence predict the sequence accessibility?

Balachandran et al. | Mar 10, 2023

Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.

Can the attributes of an app predict its rating?

Feng et al. | Jul 03, 2024

In this article the authors looked at different attributes of apps within the Google Play store to determine how those may impact the overall app rating out of five stars. They found that review count, amount of storage needed and when the app was last updated to be the most influential factors on an app's rating.

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Choudhary et al. | Jul 26, 2021

Auto-Regressive Integrated Moving Average (ARIMA) models are known for their influence and application on time series data. This statistical analysis model uses time series data to depict future trends or values: a key contributor to crime mapping algorithms. However, the models may not function to their true potential when analyzing data with many different patterns. In order to determine the potential of ARIMA models, our research will test the model on irregularities in the data. Our team hypothesizes that the ARIMA model will be able to adapt to the different irregularities in the data that do not correspond to a certain trend or pattern. Using crime theft data and an ARIMA model, we determined the results of the ARIMA model’s forecast and how the accuracy differed on different days with irregularities in crime.

Browse Articles

Applying centrality analysis on a protein interaction network to predict colorectal cancer driver genes

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

Development of a novel machine learning platform to identify structural trends among NNRTI HIV-1 reverse transcriptase inhibitors

Machine learning on crowd-sourced data to highlight coral disease

A natural language processing approach to skill identification in the job market

Training neural networks on text data to model human emotional understanding

Advancing pediatric cancer predictions through generative artificial intelligence and machine learning

Can the nucleotide content of a DNA sequence predict the sequence accessibility?

Can the attributes of an app predict its rating?

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags