Articles | Journal of Emerging Investigators

Genetic algorithm based features selection for predicting the unemployment rate of India

Mohammed et al. | Mar 16, 2024

The authors looked at using genetic algorithms to look at the Indian labor market and what features might best explain any variation seen. They found that features such as economic growth and household consumption, among others, best explained variation.

Analyzing carbon dividends’ impact on financial security via ML & metaheuristic search

Babel et al. | Jun 14, 2025

Impact of carbon tax and dividend on financial security

Using explainable artificial intelligence to identify patient-specific breast cancer subtypes

Suresh et al. | Jan 12, 2024

Breast cancer is the most common cancer in women, with approximately 300,000 diagnosed with breast cancer in 2023. It ranks second in cancer-related deaths for women, after lung cancer with nearly 50,000 deaths. Scientists have identified important genetic mutations in genes like BRCA1 and BRCA2 that lead to the development of breast cancer, but previous studies were limited as they focused on specific populations. To overcome limitations, diverse populations and powerful statistical methods like genome-wide association studies and whole-genome sequencing are needed. Explainable artificial intelligence (XAI) can be used in oncology and breast cancer research to overcome these limitations of specificity as it can analyze datasets of diagnosed patients by providing interpretable explanations for identified patterns and predictions. This project aims to achieve technological and medicinal goals by using advanced algorithms to identify breast cancer subtypes for faster diagnoses. Multiple methods were utilized to develop an efficient algorithm. We hypothesized that an XAI approach would be best as it can assign scores to genes, specifically with a 90% success rate. To test that, we ran multiple trials utilizing XAI methods through the identification of class-specific and patient-specific key genes. We found that the study demonstrated a pipeline that combines multiple XAI techniques to identify potential biomarker genes for breast cancer with a 95% success rate.

Can the nucleotide content of a DNA sequence predict the sequence accessibility?

Balachandran et al. | Mar 10, 2023

Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.

Applying centrality analysis on a protein interaction network to predict colorectal cancer driver genes

Saha et al. | Nov 18, 2023

In this article the authors created an interaction map of proteins involved in colorectal cancer to look for driver vs. non-driver genes. That is they wanted to see if they could determine what genes are more likely to drive the development and progression in colorectal cancer and which are present in altered states but not necessarily driving disease progression.

Optimizing Interplanetary Travel Using a Genetic Algorithm

Murali et al. | Oct 28, 2018

In this work, the authors develop an algorithm that solves the problem of efficient space travel between planets. This is a problem that could soon be of relevance as mankind continues to expand its exploration of outer space, and potentially attempt to inhabit it.

Genomic Signature Analysis for the Strategic Bioremediation of Polycyclic Aromatic Hydrocarbons in Mangrove Ecosystems in the Gulf of Tonkin

Dao et al. | Jun 27, 2021

Engineered bacteria that degrade oil are currently being considered as a safe option for the treatment of oil spills. For this approach to be successful, the bacteria must effectively express oil-degrading genes they uptake as part of an external genoming vehicle called a "plasmid". Using a computational approach, the authors investigate plasmid-bacterium compatibility to find pairs that ensure high levels of gene expression.

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

Naravane et al. | Oct 12, 2022

In the United States, there are currently 17.8 million affected by atopic dermatitis (AD), commonly known as eczema. It is characterized by itching and skin inflammation. AD patients are at higher risk for infections, depression, cancer, and suicide. Genetics, environment, and stress are some of the causes of the disease. With the rise of personalized medicine and the acceptance of gene-editing technologies, AD-related variations need to be identified for treatment. Genome-wide association studies (GWAS) have associated the Filaggrin (FLG) gene with AD but have not identified specific problematic single nucleotide polymorphisms (SNPs). This research aimed to refine known SNPs of FLG for gene editing technologies to establish a causal link between specific SNPs and the diseases and to target the polymorphisms. The research utilized R and its Bioconductor packages to refine data from the National Center for Biotechnology Information's (NCBI's) Variation Viewer. The algorithm filtered the dataset by coding regions and conserved domains. The algorithm also removed synonymous variations and treated non-synonymous, frameshift, and nonsense separately. The non-synonymous variations were refined and ordered by the BLOSUM62 substitution matrix. Overall, the analysis removed 96.65% of data, which was redundant or not the focus of the research and ordered the remaining relevant data by impact. The code for the project can also be repurposed as a tool for other diseases. The research can help solve GWAS's imprecise identification challenge. This research is the first step in providing the refined databases required for gene-editing treatment.

Genetic Bioaugmentation of Oryza sativa to Facilitate Self-Detoxification of Arsenic In-Situ

Bhat et al. | Dec 03, 2024

Arsenic contamination in rice, caused by the use of arsenic-laden groundwater for irrigation, is a growing global concern, affecting over 150 million people. To address this, researchers hypothesized that genetically modifying rice plants with arsenic-resistant genes could reduce arsenic uptake and allow the plants to detoxify arsenic, making them safer to consume.

Genetic underpinnings of the sex bias in autism spectrum disorder

Lee et al. | Mar 29, 2024

Here, seeking to identify a possible explanation for the more frequent diagnosis of autism spectrum disorder (ASD) in males than females, they sought to investigate a potential sex bias in the expression of ASD-associated genes. Based on their analysis, they identified 17 ASD-associated candidate genes that showed stronger collective sex-dependent expression.

Browse Articles

Genetic algorithm based features selection for predicting the unemployment rate of India

Analyzing carbon dividends’ impact on financial security via ML & metaheuristic search

Using explainable artificial intelligence to identify patient-specific breast cancer subtypes

Can the nucleotide content of a DNA sequence predict the sequence accessibility?

Applying centrality analysis on a protein interaction network to predict colorectal cancer driver genes

Optimizing Interplanetary Travel Using a Genetic Algorithm

Genomic Signature Analysis for the Strategic Bioremediation of Polycyclic Aromatic Hydrocarbons in Mangrove Ecosystems in the Gulf of Tonkin

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

Genetic Bioaugmentation of Oryza sativa to Facilitate Self-Detoxification of Arsenic In-Situ

Genetic underpinnings of the sex bias in autism spectrum disorder

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags