Browse Articles

Vineyard vigilance: Harnessing deep learning for grapevine disease detection

Mandal et al. | Aug 21, 2024

Vineyard vigilance: Harnessing deep learning for grapevine disease detection

Globally, the cultivation of 77.8 million tons of grapes each year underscores their significance in both diets and agriculture. However, grapevines face mounting threats from diseases such as black rot, Esca, and leaf blight. Traditional detection methods often lag, leading to reduced yields and poor fruit quality. To address this, authors used machine learning, specifically deep learning with Convolutional Neural Networks (CNNs), to enhance disease detection.

Read More...

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

Balaji et al. | Sep 11, 2021

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

The purpose of the study was to determine whether graph-based machine learning techniques, which have increased prevalence in the last few years, can accurately classify data into one of many clusters, while requiring less labeled training data and parameter tuning as opposed to traditional machine learning algorithms. The results determined that the accuracy of graph-based and traditional classification algorithms depends directly upon the number of features of each dataset, the number of classes in each dataset, and the amount of labeled training data used.

Read More...

Redesigning an Experiment to Determine the Coefficient of Friction

Hu et al. | Jun 27, 2016

Redesigning an Experiment to Determine the Coefficient of Friction

In a common high school experiment to measure friction coefficients, a weighted mass attached to a spring scale is dragged across a surface at a constant velocity. While the constant velocity is necessary for an accurate measurement, it can be difficult to maintain and this can lead to large errors. Here, the authors designed a new experiment to measure friction coefficients in the classroom using only static force and show that their method has a lower standard deviation than the traditional experiment.

Read More...

Gradient boosting with temporal feature extraction for modeling keystroke log data

Barretto et al. | Oct 04, 2024

Gradient boosting with temporal feature extraction for modeling keystroke log data
Image credit: Barretto and Barretto 2024.

Although there has been great progress in the field of Natural language processing (NLP) over the last few years, particularly with the development of attention-based models, less research has contributed towards modeling keystroke log data. State of the art methods handle textual data directly and while this has produced excellent results, the time complexity and resource usage are quite high for such methods. Additionally, these methods fail to incorporate the actual writing process when assessing text and instead solely focus on the content. Therefore, we proposed a framework for modeling textual data using keystroke-based features. Such methods pay attention to how a document or response was written, rather than the final text that was produced. These features are vastly different from the kind of features extracted from raw text but reveal information that is otherwise hidden. We hypothesized that pairing efficient machine learning techniques with keystroke log information should produce results comparable to transformer techniques, models which pay more or less attention to the different components of a text sequence in a far quicker time. Transformer-based methods dominate the field of NLP currently due to the strong understanding they display of natural language. We showed that models trained on keystroke log data are capable of effectively evaluating the quality of writing and do it in a significantly shorter amount of time compared to traditional methods. This is significant as it provides a necessary fast and cheap alternative to increasingly larger and slower LLMs.

Read More...

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Predicting smoking status based on RNA sequencing data
Image credit: Yang and Stanley 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Read More...

Adults’ attitudes toward non-alcoholic beer purchases and consumption by children and adolescents

An et al. | Aug 23, 2024

Adults’ attitudes toward non-alcoholic beer purchases and consumption by children and adolescents
Image credit: The authors

Consumption of non-alcoholic beverages, like non-alcoholic beer, is growing in popularity in the United States. These beverages raise important societal questions, such as whether minors should be allowed to purchase or consume non-alcoholic beer. An and An investigate this issue by surveying adults to see if they support minors purchasing and consuming non-alcoholic beer.

Read More...

An explainable model for content moderation

Cao et al. | Aug 16, 2023

An explainable model for content moderation

The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.

Read More...

Comparing the Biodegradability of Petroleum-based Plastic with a Novel, Sustainable Bio-plastic Alternative

Van Note et al. | Dec 02, 2020

Comparing the Biodegradability of Petroleum-based Plastic with a Novel, Sustainable Bio-plastic Alternative

In this research, a novel bioplastic inclusive of bamboo tannins and chitosan is selected from more than 60 trial formula variations based on resulting strength, fatigue, and transparency attributes. The biodegradability of the finalized bioplastic is compared to that of conventional polyethylene, in addition to investigating its solubility and water absorbance. This research displays the potential of a legitimate, fully biodegradable plastic alternative to current marketplace bioplastics.

Read More...

DyGS: A Dynamic Gene Searching Algorithm for Cancer Detection

Wang et al. | Jun 05, 2018

DyGS: A Dynamic Gene Searching Algorithm for Cancer Detection

Wang and Gong developed a novel dynamic gene-searching algorithm called Dynamic Gene Search (DyGS) to create a gene panel for each of the 12 cancers with the highest annual incidence and death rate. The 12 gene panels the DyGS algorithm selected used only 3.5% of the original gene mutation pool, while covering every patient sample. About 40% of each gene panel is druggable, which indicates that the DyGS-generated gene panels can be used for early cancer detection as well as therapeutic targets in treatment methods.

Read More...