Although there has been great progress in the field of Natural language processing (NLP) over the last few years, particularly with the development of attention-based models, less research has contributed towards modeling keystroke log data. State of the art methods handle textual data directly and while this has produced excellent results, the time complexity and resource usage are quite high for such methods. Additionally, these methods fail to incorporate the actual writing process when assessing text and instead solely focus on the content. Therefore, we proposed a framework for modeling textual data using keystroke-based features. Such methods pay attention to how a document or response was written, rather than the final text that was produced. These features are vastly different from the kind of features extracted from raw text but reveal information that is otherwise hidden. We hypothesized that pairing efficient machine learning techniques with keystroke log information should produce results comparable to transformer techniques, models which pay more or less attention to the different components of a text sequence in a far quicker time. Transformer-based methods dominate the field of NLP currently due to the strong understanding they display of natural language. We showed that models trained on keystroke log data are capable of effectively evaluating the quality of writing and do it in a significantly shorter amount of time compared to traditional methods. This is significant as it provides a necessary fast and cheap alternative to increasingly larger and slower LLMs.
Read More...Browse Articles
Assessing CDK5 as a Nanomotor for Chemotactic Drug Delivery
Enzyme chemotaxis is a thermodynamic phenomenon in which enzymes move along a substrate concentration gradient towards regions with higher substrate concentrations and can be used to steer nanovehicles towards targets along natural substrate concentrations. In patients with Alzheimer’s disease, a gradient of tau protein forms in the bloodstream. Tau protein is a substrate of the enzyme CDK5, which catalyzes the phosphorylation of tau protein and can travel using chemotaxis along tau protein gradients to increasing concentrations of tau and amyloid-beta proteins. The authors hypothesized that CDK5 would be able to overcome these barriers of Brownian motion and developed a quantitative model using Michaelis-Menten kinetics to define the necessary parameters to confirm and characterize CDK5’s chemotactic behavior to establish its utility in drug delivery and other applications.
Read More...Prediction of diabetes using supervised classification
The authors develop and test a machine learning algorithm for predicting diabetes diagnoses.
Read More...Evaluating machine learning algorithms to classify forest tree species through satellite imagery
Here, seeking to identify an optimal method to classify tree species through remote sensing, the authors used a few machine learning algorithms to classify forest tree species through multispectral satellite imagery. They found the Random Forest algorithm to most accurately classify tree species, with the potential to improve model training and inference based on the inclusion of other tree properties.
Read More...Examining Heat Recovery from Electric Light Bulbs Using Thermoelectric Generators
Here the author investigates how much heat energy is output and recovered from a conventional electric light bulb.
Read More...A novel approach for predicting Alzheimer’s disease using machine learning on DNA methylation in blood
Here, recognizing the difficulty associated with tracking the progression of dementia, the authors used machine learning models to predict between the presence of cognitive normalcy, mild cognitive impairment, and Alzheimer's Disease, based on blood DNA methylation levels, sex, and age. With four machine learning models and two dataset dimensionality reduction methods they achieved an accuracy of 53.33%.
Read More...The effect of activation function choice on the performance of convolutional neural networks
With the advance of technology, artificial intelligence (AI) is now applied widely in society. In the study of AI, machine learning (ML) is a subfield in which a machine learns to be better at performing certain tasks through experience. This work focuses on the convolutional neural network (CNN), a framework of ML, applied to an image classification task. Specifically, we analyzed the performance of the CNN as the type of neural activation function changes.
Read More...Determining the best convolutional neural network for identifying tuberculosis and pneumonia in chest x-rays
To best identify tuberculosis and pneumonia diagnoses in chest x-rays, the authors compare different deep learning convolution neural networks.
Read More...Predicting asthma-related emergency department visits and hospitalizations with machine learning techniques
Seeking to investigate the effects of ambient pollutants on human respiratory health, here the authors used machine learning to examine asthma in Lost Angeles County, an area with substantial pollution. By using machine learning models and classification techniques, the authors identified that nitrogen dioxide and ozone levels were significantly correlated with asthma hospitalizations. Based on an identified seasonal surge in asthma hospitalizations, the authors suggest future directions to improve machine learning modeling to investigate these relationships.
Read More...Testing Different Polymers and Boron Nitride Nanotube Properties in Fabrication of Ion-selective Membranes
One largely untapped source of clean energy is the use of osmotic gradients where freshwater and saltwater are mixed, for example at estuaries. To harness such energy, charge-selective membranes are needed to separate the anions and cations in saltwater, establishing an electric potential like a battery. The objective of this study was twofold: to investigate the creation of the polymer matrix and test the properties of boron nitride nanotubes, as both are essential in the creation of an ion-selective membrane. Out of three polymer samples tested in this study, the mixture known as Soltech 704 showed the best resistance to etching, as well as the highest UV cure rate.
Read More...