Articles | Journal of Emerging Investigators

DyGS: A Dynamic Gene Searching Algorithm for Cancer Detection

Wang et al. | Jun 05, 2018

Wang and Gong developed a novel dynamic gene-searching algorithm called Dynamic Gene Search (DyGS) to create a gene panel for each of the 12 cancers with the highest annual incidence and death rate. The 12 gene panels the DyGS algorithm selected used only 3.5% of the original gene mutation pool, while covering every patient sample. About 40% of each gene panel is druggable, which indicates that the DyGS-generated gene panels can be used for early cancer detection as well as therapeutic targets in treatment methods.

A novel CNN-based machine learning approach to identify skin cancers

Rao et al. | Nov 18, 2022

In this study, the authors developed and assessed the accuracy of a machine learning algorithm to identify skin cancers using images of biopsies.

Estimation of Reproduction Number of Influenza in Greece using SIR Model

Skarpeti et al. | Nov 18, 2020

In this study, we developed an algorithm to estimate the contact rate and the average infectious period of influenza using a Susceptible, Infected, and Recovered (SIR) epidemic model. The parameters in this model were estimated using data on infected Greek individuals collected from the National Public Health Organization. Our model labeled influenza as an epidemic with a basic reproduction value greater than one.

Demographic indicators of voter shift between 2016 and 2020 presidential elections

Wang et al. | Jul 13, 2022

In this study, the authors investigate the demographic indicators for voter shift between the 2016 and 2020 presidential elections based on demographic data put through a K-nearest neighbors classification algorithm and Principal Component Analysis.

Virtual Screening of Cutibacterium acnes Antibacterial Agent Using Natural Compounds Database

Liu et al. | Mar 20, 2022

A common form of Acne is caused by a species of bacterium called Cutibacterium acnes. By using a predictive algorithm and structural analysis, the authors identified 5 small molecules with high affinity to growth factors in Catibacterium acnes. This has potential implications for supplemental skincare products.

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

Ramprasad et al. | Mar 18, 2020

In this study, the authors test whether providing a larger dataset of glass transition temperatures (T_g) to train the machine-learning platform Polymer Genome would improve its accuracy. Polymer Genome is a machine learning based data-driven informatics platform for polymer property prediction and T_g is one property needed to design new polymers in silico. They found that training the model with their larger, curated dataset improved the algorithm's T_g, providing valuable improvements to this useful platform.

Recognition of animal body parts via supervised learning

Kreiman et al. | Oct 28, 2023

The application of machine learning techniques has facilitated the automatic annotation of behavior in video sequences, offering a promising approach for ethological studies by reducing the manual effort required for annotating each video frame. Nevertheless, before solely relying on machine-generated annotations, it is essential to evaluate the accuracy of these annotations to ensure their reliability and applicability. While it is conventionally accepted that there cannot be a perfect annotation, the degree of error associated with machine-generated annotations should be commensurate with the error between different human annotators. We hypothesized that machine learning supervised with adequate human annotations would be able to accurately predict body parts from video sequences. Here, we conducted a comparative analysis of the quality of annotations generated by humans and machines for the body parts of sheep during treadmill walking. For human annotation, two annotators manually labeled six body parts of sheep in 300 frames. To generate machine annotations, we employed the state-of-the-art pose-estimating library, DeepLabCut, which was trained using the frames annotated by human annotators. As expected, the human annotations demonstrated high consistency between annotators. Notably, the machine learning algorithm also generated accurate predictions, with errors comparable to those between humans. We also observed that abnormal annotations with a high error could be revised by introducing Kalman Filtering, which interpolates the trajectory of body parts over the time series, enhancing robustness. Our results suggest that conventional transfer learning methods can generate behavior annotations as accurate as those made by humans, presenting great potential for further research.

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

Naravane et al. | Oct 12, 2022

In the United States, there are currently 17.8 million affected by atopic dermatitis (AD), commonly known as eczema. It is characterized by itching and skin inflammation. AD patients are at higher risk for infections, depression, cancer, and suicide. Genetics, environment, and stress are some of the causes of the disease. With the rise of personalized medicine and the acceptance of gene-editing technologies, AD-related variations need to be identified for treatment. Genome-wide association studies (GWAS) have associated the Filaggrin (FLG) gene with AD but have not identified specific problematic single nucleotide polymorphisms (SNPs). This research aimed to refine known SNPs of FLG for gene editing technologies to establish a causal link between specific SNPs and the diseases and to target the polymorphisms. The research utilized R and its Bioconductor packages to refine data from the National Center for Biotechnology Information's (NCBI's) Variation Viewer. The algorithm filtered the dataset by coding regions and conserved domains. The algorithm also removed synonymous variations and treated non-synonymous, frameshift, and nonsense separately. The non-synonymous variations were refined and ordered by the BLOSUM62 substitution matrix. Overall, the analysis removed 96.65% of data, which was redundant or not the focus of the research and ordered the remaining relevant data by impact. The code for the project can also be repurposed as a tool for other diseases. The research can help solve GWAS's imprecise identification challenge. This research is the first step in providing the refined databases required for gene-editing treatment.

Addressing and Resolving Biases in Artificial Intelligence

Choudhari et al. | May 29, 2024

The authors explore how diversity in data sets contributes to bias in artificial intelligence.

SpottingDiffusion: Using transfer learning to detect Latent Diffusion Model-synthesized images

Sahal Mulki et al. | Nov 15, 2024

The authors develop a method for detecting fake AI-generated images from real images.

Browse Articles

DyGS: A Dynamic Gene Searching Algorithm for Cancer Detection

A novel CNN-based machine learning approach to identify skin cancers

Estimation of Reproduction Number of Influenza in Greece using SIR Model

Demographic indicators of voter shift between 2016 and 2020 presidential elections

Virtual Screening of Cutibacterium acnes Antibacterial Agent Using Natural Compounds Database

Assessing and Improving Machine Learning Model Predictions of Polymer Glass Transition Temperatures

Recognition of animal body parts via supervised learning

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

Addressing and Resolving Biases in Artificial Intelligence

SpottingDiffusion: Using transfer learning to detect Latent Diffusion Model-synthesized images

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags