Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
Read More...Browse Articles
Expression of Anti-Neurodegeneration Genes in Mutant Caenorhabditis elegans Using CRISPR-Cas9 Improves Behavior Associated With Alzheimer’s Disease
Alzheimer's disease is one of the leading causes of death in the United States and is characterized by neurodegeneration. Mishra et al. wanted to understand the role of two transport proteins, LRP1 and AQP4, in the neurodegeneration of Alzheimer's disease. They used a model organism for Alzheimer's disease, the nematode C. elegans, and genetic engineering to look at whether they would see a decrease in neurodegeneration if they increased the amount of these two transport proteins. They found that the best improvements were caused by increased expression of both transport proteins, with smaller improvements when just one of the proteins is overly expressed. Their work has important implications for how we understand neurodegeneration in Alzheimer's disease and what we can do to slow or prevent the progression of the disease.
Read More...Predicting smoking status based on RNA sequencing data
Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.
Read More...Applying centrality analysis on a protein interaction network to predict colorectal cancer driver genes
In this article the authors created an interaction map of proteins involved in colorectal cancer to look for driver vs. non-driver genes. That is they wanted to see if they could determine what genes are more likely to drive the development and progression in colorectal cancer and which are present in altered states but not necessarily driving disease progression.
Read More...DNA-SEnet: A convolutional neural network for classifying DNA-asthma associations
In this study, the authors developed a model named DNA Sequence Embedding Network (DNA-SEnet) to classify DNA-asthma associations using their genomic patterns.
Read More...Investigating the inhibition of catabolic enzymes for implications in cardiovascular diseases and diabetes
Enzymes that metabolize carbohydrates and lipids play a key role in our health, including global health challenges like cardiovascular diseases and diabetes. To learn more about these important enzymes, Gandhi and Gandhi test whether various natural substances (ginger, Aloe vera, lemon, and mint leaves) affect the activity of α-amylase and lipase enzymes.
Read More...Investigating the potential of zinc oxide nanoparticles and zinc ions as promising approaches to lung cancer
Here, the authors chose to investigate the efficacy of zinc oxide nanoparticles (ZnO NPs) and cisplatin or zinc ions in inducing cancer apoptosis. While both treatments were found to reduce the proliferation of lung cancer cells, the authors suggest that further studies to identify the mechanism are necessary.
Read More...Distribution of prophages in the Streptococcus bacteria genus and their role in increasing host pathogenicity
The authors investigated prophages present in Streptococcus bacteria that may increase their survival in different environments.
Read More...A novel approach to determine which organism best displays Gijswijt's Sequence in its genome
The sequence of nitrogenous bases that make up the DNA of organisms can contain hidden mathematical sequences. Here the authors used BioPython, a programming tool, to find an organism that displays Gijswijt’s Sequence in its genome. In this manner they found that the common carp best displays Gijswijt’s Sequence in its genome.
Read More...Design and in silico screening of analogs of rilpivirine as novel non-nucleoside reverse transcriptase inhibitors (NNRTIs) for antiretroviral therapy
In this study, the authors use high-throughput virtual screening to design and evaluate a set of non-nucleoside reverse transcriptase inhibitors for binding affinity to the protein reverse transcriptase. These studies have important applications toward HIV therapies.
Read More...