Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
In the United States, there are currently 17.8 million affected by atopic dermatitis (AD), commonly known as eczema. It is characterized by itching and skin inflammation. AD patients are at higher risk for infections, depression, cancer, and suicide. Genetics, environment, and stress are some of the causes of the disease. With the rise of personalized medicine and the acceptance of gene-editing technologies, AD-related variations need to be identified for treatment. Genome-wide association studies (GWAS) have associated the Filaggrin (FLG) gene with AD but have not identified specific problematic single nucleotide polymorphisms (SNPs). This research aimed to refine known SNPs of FLG for gene editing technologies to establish a causal link between specific SNPs and the diseases and to target the polymorphisms. The research utilized R and its Bioconductor packages to refine data from the National Center for Biotechnology Information's (NCBI's) Variation Viewer. The algorithm filtered the dataset by coding regions and conserved domains. The algorithm also removed synonymous variations and treated non-synonymous, frameshift, and nonsense separately. The non-synonymous variations were refined and ordered by the BLOSUM62 substitution matrix. Overall, the analysis removed 96.65% of data, which was redundant or not the focus of the research and ordered the remaining relevant data by impact. The code for the project can also be repurposed as a tool for other diseases. The research can help solve GWAS's imprecise identification challenge. This research is the first step in providing the refined databases required for gene-editing treatment.
The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.
The sugar-rich modern diet underlies a suite of metabolic disorders, most common of which is diabetes. Accurately reporting the sugar content of pre-packaged food and drink items can help consumers track their sugar intake better, facilitating more cognisant and, eventually, moderate consumption of high-sugar items. In this article, the authors examine the effect of several variables on the accuracy of Fehling's reaction, a colorimetric reaction used to estimate sugar content.
Commercial Concentrated Animal Feeding Operations (CAFOs) produce large quantities of waste material from the animals being housed in them. These feedlots found across the United States contain livestock that produce waste that results in hazardous runoff. This study examines how CAFOs affect water sources by testing for Escherichia Coli (E. coli) content in bodies of water near CAFOs.
This study is centered around developing biofortification methods: the authors test whether the amount of calcium available to growing crops translates into more calcium present in the crops.
This study sought to determine if there is an association between the single nucleotide polymorphism rs7528684 of the Fc receptor-like-3 (FCRL3) gene and asthma or allergic rhinitis (AR). Based on previous studies in an Asian population, we hypothesized that participants with an AA genotype of FCRL3 would be more likely to have asthma and/or allergic rhinitis. To test the hypothesis, surveys were administered to participants, and genotyping was performed on spit samples via PCR, restriction digest, and gel electrophoresis.
Cystic fibrosis is a genetic disease caused by mutations in the CFTR gene. In this paper, the authors attempt to identify variations in stretches of up to 8 nucleotides in the protein-coding portions of the CFTR gene that are associated with disease development. This would allow screening of newborns or even fetuses in utero to determine the likelihood they develop cystic fibrosis.
The goal of this project was to assess the relationships among low myopia, behavioral and demographic factors, and a single-nucleotide polymorphism (SNP) in the TGFβ1 gene.
Although the 5-year survival rate for colorectal cancer is below 10%, it increases to greater than 90% if it is diagnosed early. We hypothesized from our research that analyzing non-synonymous single nucleotide variants (SNVs) in a patient's exome sequence would be an indicator for high genetic risk of developing colorectal cancer.