Browse Articles

Effects of urban traffic noise on the early growth and transcription of Arabidopsis thaliana

Kim et al. | Sep 18, 2024

Effects of urban traffic noise on the early growth and transcription of <i>Arabidopsis thaliana<i>

This article explores the largely unstudied impact of noise pollution on plant life. By exposing Arabidopsis thaliana seedlings to urban traffic noise, the study found a significant increase in seedling growth, alongside substantial changes in gene expression. This research reveals critical insights into how noise pollution affects plant physiology and contributes to a broader understanding of its ecological impacts, helping to guide future efforts in ecosystem conservation.

Read More...

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Predicting smoking status based on RNA sequencing data
Image credit: Yang and Stanley 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Read More...

Can the nucleotide content of a DNA sequence predict the sequence accessibility?

Balachandran et al. | Mar 10, 2023

Can the nucleotide content of a DNA sequence predict the sequence accessibility?
Image credit: Warren Umoh

Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.

Read More...

Expressional correlations between SERPINA6 and pancreatic ductal adenocarcinoma-linked genes

Selver et al. | Oct 06, 2021

Expressional correlations between <em>SERPINA6</em> and pancreatic ductal adenocarcinoma-linked genes

Pancreatic ductal adenocarcinoma (PDAC) is the most common form of pancreatic cancer, with early diagnosis and treatment challenges. When any of the genes KRAS, SMAD4, TP53, and BRCA2 are heavily mutated, they correlate with PDAC progression. Cellular stress, partly regulated by the gene SERPINA6, also correlates with PDAC progression. When SERPINA6 is highly expressed, corticosteroid-binding globulin inhibits the effect of the stress hormone cortisol. In this study, the authors explored whether there is an inverse correlation between the expression of SERPINA6 and PDAC-linked genes.

Read More...

Evolution of Neuroplastin-65

Cremers et al. | Oct 26, 2016

Evolution of Neuroplastin-65

Human intelligence is correlated with variation in the protein neuroplastin-65, which is encoded by the NPTN gene. The authors examine the evolution of this gene across different animal species.

Read More...

Upregulation of the Ribosomal Pathway as a Potential Blood-Based Genetic Biomarker for Comorbid Major Depressive Disorder (MDD) and PTSD

Ravi et al. | Aug 22, 2018

Upregulation of the Ribosomal Pathway as a Potential  Blood-Based Genetic Biomarker for Comorbid Major Depressive Disorder (MDD) and PTSD

Major Depressive Disorder (MDD), and Post-Traumatic Stress Disorder (PTSD) are two of the fastest growing comorbid diseases in the world. Using publicly available datasets from the National Institute for Biotechnology Information (NCBI), Ravi and Lee conducted a differential gene expression analysis using 184 blood samples from either control individuals or individuals with comorbid MDD and PTSD. As a result, the authors identified 253 highly differentially-expressed genes, with enrichment for proteins in the gene ontology group 'Ribosomal Pathway'. These genes may be used as blood-based biomarkers for susceptibility to MDD or PTSD, and to tailor treatments within a personalized medicine regime.

Read More...

Cutibacterium acnes sequence space topology implicates recA and guaA as potential virulence factors

Bohdan et al. | May 01, 2025

<i>Cutibacterium acnes</i> sequence space topology implicates <i>recA</i> and <i>guaA</i> as potential virulence factors
Image credit: Bohdan and Platje 2025

Cutibacterium acnes is a bacterium believed to play an important role in the pathogenesis of common skin diseases such as acne vulgaris. Currently, acne is known to be associated with strains from the type IA1 and IC clades of C. acnes, while those from the type IA2, IB, II, and III phylogroups are associated with skin health. This is the first study to explore the sequence space of individual gene products of different C. acnes phylogroups. Our analysis compared the sequence space topology of virulence factors to proteins with unknown functions and housekeeping proteins. We hypothesized that sequence space features of virulence factors are different from housekeeping protein features, which potentially provides an avenue to deduce unknown proteins’ functions. This proposition should be confirmed based on further experimental outcomes. A notable similarity in the sequence spaces’ topological features of previously known as housekeeping proteins encoded by recA and guaA genes to ‘putative virulence’ genes camp2 and tly was observed. Our research suggests further investigation of recA and guaA’s potential virulence properties to better understand acne pathogenesis and develop more targeted acne treatments.

Read More...

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

Naravane et al. | Oct 12, 2022

Refinement of Single Nucleotide Polymorphisms of Atopic Dermatitis related Filaggrin through R packages

In the United States, there are currently 17.8 million affected by atopic dermatitis (AD), commonly known as eczema. It is characterized by itching and skin inflammation. AD patients are at higher risk for infections, depression, cancer, and suicide. Genetics, environment, and stress are some of the causes of the disease. With the rise of personalized medicine and the acceptance of gene-editing technologies, AD-related variations need to be identified for treatment. Genome-wide association studies (GWAS) have associated the Filaggrin (FLG) gene with AD but have not identified specific problematic single nucleotide polymorphisms (SNPs). This research aimed to refine known SNPs of FLG for gene editing technologies to establish a causal link between specific SNPs and the diseases and to target the polymorphisms. The research utilized R and its Bioconductor packages to refine data from the National Center for Biotechnology Information's (NCBI's) Variation Viewer. The algorithm filtered the dataset by coding regions and conserved domains. The algorithm also removed synonymous variations and treated non-synonymous, frameshift, and nonsense separately. The non-synonymous variations were refined and ordered by the BLOSUM62 substitution matrix. Overall, the analysis removed 96.65% of data, which was redundant or not the focus of the research and ordered the remaining relevant data by impact. The code for the project can also be repurposed as a tool for other diseases. The research can help solve GWAS's imprecise identification challenge. This research is the first step in providing the refined databases required for gene-editing treatment.

Read More...