The authors looked at using genetic algorithms to look at the Indian labor market and what features might best explain any variation seen. They found that features such as economic growth and household consumption, among others, best explained variation.
Breast cancer is the most common cancer in women, with approximately 300,000 diagnosed with breast cancer in 2023. It ranks second in cancer-related deaths for women, after lung cancer with nearly 50,000 deaths. Scientists have identified important genetic mutations in genes like BRCA1 and BRCA2 that lead to the development of breast cancer, but previous studies were limited as they focused on specific populations. To overcome limitations, diverse populations and powerful statistical methods like genome-wide association studies and whole-genome sequencing are needed. Explainable artificial intelligence (XAI) can be used in oncology and breast cancer research to overcome these limitations of specificity as it can analyze datasets of diagnosed patients by providing interpretable explanations for identified patterns and predictions. This project aims to achieve technological and medicinal goals by using advanced algorithms to identify breast cancer subtypes for faster diagnoses. Multiple methods were utilized to develop an efficient algorithm. We hypothesized that an XAI approach would be best as it can assign scores to genes, specifically with a 90% success rate. To test that, we ran multiple trials utilizing XAI methods through the identification of class-specific and patient-specific key genes. We found that the study demonstrated a pipeline that combines multiple XAI techniques to identify potential biomarker genes for breast cancer with a 95% success rate.
Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
In this article the authors created an interaction map of proteins involved in colorectal cancer to look for driver vs. non-driver genes. That is they wanted to see if they could determine what genes are more likely to drive the development and progression in colorectal cancer and which are present in altered states but not necessarily driving disease progression.
In this work, the authors develop an algorithm that solves the problem of efficient space travel between planets. This is a problem that could soon be of relevance as mankind continues to expand its exploration of outer space, and potentially attempt to inhabit it.
Engineered bacteria that degrade oil are currently being considered as a safe option for the treatment of oil spills. For this approach to be successful, the bacteria must effectively express oil-degrading genes they uptake as part of an external genoming vehicle called a "plasmid". Using a computational approach, the authors investigate plasmid-bacterium compatibility to find pairs that ensure high levels of gene expression.
In the United States, there are currently 17.8 million affected by atopic dermatitis (AD), commonly known as eczema. It is characterized by itching and skin inflammation. AD patients are at higher risk for infections, depression, cancer, and suicide. Genetics, environment, and stress are some of the causes of the disease. With the rise of personalized medicine and the acceptance of gene-editing technologies, AD-related variations need to be identified for treatment. Genome-wide association studies (GWAS) have associated the Filaggrin (FLG) gene with AD but have not identified specific problematic single nucleotide polymorphisms (SNPs). This research aimed to refine known SNPs of FLG for gene editing technologies to establish a causal link between specific SNPs and the diseases and to target the polymorphisms. The research utilized R and its Bioconductor packages to refine data from the National Center for Biotechnology Information's (NCBI's) Variation Viewer. The algorithm filtered the dataset by coding regions and conserved domains. The algorithm also removed synonymous variations and treated non-synonymous, frameshift, and nonsense separately. The non-synonymous variations were refined and ordered by the BLOSUM62 substitution matrix. Overall, the analysis removed 96.65% of data, which was redundant or not the focus of the research and ordered the remaining relevant data by impact. The code for the project can also be repurposed as a tool for other diseases. The research can help solve GWAS's imprecise identification challenge. This research is the first step in providing the refined databases required for gene-editing treatment.
Arsenic contamination in rice, caused by the use of arsenic-laden groundwater for irrigation, is a growing global concern, affecting over 150 million people. To address this, researchers hypothesized that genetically modifying rice plants with arsenic-resistant genes could reduce arsenic uptake and allow the plants to detoxify arsenic, making them safer to consume.
Here, seeking to identify a possible explanation for the more frequent diagnosis of autism spectrum disorder (ASD) in males than females, they sought to investigate a potential sex bias in the expression of ASD-associated genes. Based on their analysis, they identified 17 ASD-associated candidate genes that showed stronger collective sex-dependent expression.
Autism Spectrum Disorder (ASD) and Alzheimer's Disease (AD) are distinct conditions, but research suggests a link, as individuals with ASD are 2.5 times more likely to develop AD. A study employing genome-wide association studies and Mendelian randomization revealed shared genetic factors, particularly in synaptic regulation pathways, that may increase the risk of AD in those with ASD. These findings provide insights into the genetic underpinnings connecting the two disorders.