
The authors explore factors affecting graduation rates of students learning English as a second language across Florida counties.
Read More...English learner status in Florida public schools is correlated with significantly lower graduation rates
The authors explore factors affecting graduation rates of students learning English as a second language across Florida counties.
Read More...Impact of Kindles4Covid Virtual Reading Buddies Program on reading frequency and social connections
With the COVID-19 pandemic necessitating the transition to remote learning, disruption to daily school routine has impacted educational experiences on a global scale. As a result, it has potentially worsened reading achievement gaps typically exacerbated by long summer months. To address literacy skill retention and pandemic-induced social isolation, the non-profit organization ByKids4Kids has created a reading program, “Kindles4Covid Virtual Reading Buddies Program,” to instill a structure for youth to read together and connect with the convenience of Amazon Kindle devices. In this article, the authors determine the efficacy of their invaluable program by assessing changes in reading frequency and self-reported connectedness among program participants.
Read More...A novel approach for early detection of Alzheimer’s disease using deep neural networks with magnetic resonance imaging
In the battle against Alzheimer's disease, early detection is critical to mitigating symptoms in patients. Here, the authors use a collection of MRI scans, layering with deep learning computer modeling, to investigate early stages of AD which can be hard to catch by human eye. Their model is successful, able to outperform previous models, and detected regions of interest in the brain for further consideration.
Read More...Prediction of molecular energy using Coulomb matrix and Graph Neural Network
With molecular energy being an integral element to the study of molecules and molecular interactions, computational methods to determine molecular energy are used for the preservation of time and resources. However, these computational methods have high demand for computer resources, limiting their widespread feasibility. The authors of this study employed machine learning to address this disadvantage, utilizing neural networks trained on different representations of molecules to predict molecular properties without the requirement of computationally-intensive processing. In their findings, the authors determined the Feedforward Neural Network, trained by two separate models, as capable of predicting molecular energy with limited prediction error.
Read More...Can the nucleotide content of a DNA sequence predict the sequence accessibility?
Sequence accessibility is an important factor affecting gene expression. Sequence accessibility or openness impacts the likelihood that a gene is transcribed and translated into a protein and performs functions and manifests traits. There are many potential factors that affect the accessibility of a gene. In this study, our hypothesis was that the content of nucleotides in a genetic sequence predicts its accessibility. Using a machine learning linear regression model, we studied the relationship between nucleotide content and accessibility.
Read More...Exploring the effects of diverse historical stock price data on the accuracy of stock price prediction models
Algorithmic trading has been increasingly used by Americans. In this work, we tested whether including the opening, closing, and highest prices in three supervised learning models affected their performance. Indeed, we found that including all three prices decreased the error of the prediction significantly.
Read More...Gradient boosting with temporal feature extraction for modeling keystroke log data
Although there has been great progress in the field of Natural language processing (NLP) over the last few years, particularly with the development of attention-based models, less research has contributed towards modeling keystroke log data. State of the art methods handle textual data directly and while this has produced excellent results, the time complexity and resource usage are quite high for such methods. Additionally, these methods fail to incorporate the actual writing process when assessing text and instead solely focus on the content. Therefore, we proposed a framework for modeling textual data using keystroke-based features. Such methods pay attention to how a document or response was written, rather than the final text that was produced. These features are vastly different from the kind of features extracted from raw text but reveal information that is otherwise hidden. We hypothesized that pairing efficient machine learning techniques with keystroke log information should produce results comparable to transformer techniques, models which pay more or less attention to the different components of a text sequence in a far quicker time. Transformer-based methods dominate the field of NLP currently due to the strong understanding they display of natural language. We showed that models trained on keystroke log data are capable of effectively evaluating the quality of writing and do it in a significantly shorter amount of time compared to traditional methods. This is significant as it provides a necessary fast and cheap alternative to increasingly larger and slower LLMs.
Read More...Cleaning up the world’s oceans with underwater laser imaging
Here recognizing the growing amount of plastic waste in the oceans, the authors sought to develop and test laser imaging for the identification of waste in water. They found that while possible, limitations such as increasing depth and water turbidity result in increasing blurriness in laser images. While their image processing methods were somewhat insufficient they identified recent methods to use deep learning-based techniques as a potential avenue to viability for this method.
Read More...Nintendo Da Vinci: A Novel Control System to Improve Performance in Robotic-Assisted Surgery
Complications of robotic-assisted surgery are on the rise, partly due to surgeons not receiving proper training. Al-Akash and Al-Akash hypothesized Nintendo JoyCon controls would improve surgical performance compared to the FDA-approved Da Vinci Surgical System with two user groups (doctor and gamer). Their results show that implementing a Nintendo JoyCon control system is associated with improved surgical performance and learning rate compared to the Da Vinci Surgical System.
Read More...Predicting smoking status based on RNA sequencing data
Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.
Read More...