Browse Articles

A Taste of Sweetness in Bioplastics

Tsai et al. | Apr 05, 2019

A Taste of Sweetness in Bioplastics

Sweet potatoes are one of the most common starches in Taiwan, and sweet potato peels hold significant potential to make biodegradable plastics which can alleviate the environmental impact of conventional petroleum-based plastics. In this paper, Tsai et al created starch-based bioplastics derived from sweet potato peels and manipulated the amount of added glycerol to alter the plastic’s strength and flexibility properties. Their results indicated that higher concentrations of glycerol yield more malleable plastics, providing insights into how recycled agricultural waste material might be used to slow down the rate of pollution caused by widespread production of conventional plastics.

Read More...

Can the attributes of an app predict its rating?

Feng et al. | Jul 03, 2024

Can the attributes of an app predict its rating?
Image credit: Mika Baumeister

In this article the authors looked at different attributes of apps within the Google Play store to determine how those may impact the overall app rating out of five stars. They found that review count, amount of storage needed and when the app was last updated to be the most influential factors on an app's rating.

Read More...

An explainable model for content moderation

Cao et al. | Aug 16, 2023

An explainable model for content moderation

The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.

Read More...

An analysis of junior rower performance and how it is affected by rower's features

Biller et al. | Jan 07, 2022

An analysis of junior rower performance and how it is affected by rower's features

In this study, with consideration for the increasing participation of high school students in indoor rowing, the authors analyzed World Indoor Rowing Championship data. Statistical analysis revealed two key features that can determine the performance of a rower as well as increasing competitiveness in nearly all categories considered. They conclude by offering a 2000-meter ergometer time distribution that can help junior rowers assess their current performance relative to the world competition.

Read More...

Using economic indicators to create an empirical model of inflation

Kasera et al. | Dec 01, 2022

Using economic indicators to create an empirical model of inflation

Here, seeking to understand the correlation of 50 of the most important economic indicators with inflation, the authors used a rolling linear regression to identify indicators with the most significant correlation with the Month over Month Consumer Price Index Seasonally Adjusted (CPI). Ultimately the concluded that the average gasoline price, U.S. import price index, and 5-year market expected inflation had the most significant correlation with the CPI.

Read More...

Predicting smoking status based on RNA sequencing data

Yang et al. | Aug 30, 2024

Predicting smoking status based on RNA sequencing data
Image credit: Yang and Stanley 2024

Given an association between nicotine addiction and gene expression, we hypothesized that expression of genes commonly associated with smoking status would have variable expression between smokers and non-smokers. To test whether gene expression varies between smokers and non-smokers, we analyzed two publicly-available datasets that profiled RNA gene expression from brain (nucleus accumbens) and lung tissue taken from patients identified as smokers or non-smokers. We discovered statistically significant differences in expression of dozens of genes between smokers and non-smokers. To test whether gene expression can be used to predict whether a patient is a smoker or non-smoker, we used gene expression as the training data for a logistic regression or random forest classification model. The random forest classifier trained on lung tissue data showed the most robust results, with area under curve (AUC) values consistently between 0.82 and 0.93. Both models trained on nucleus accumbens data had poorer performance, with AUC values consistently between 0.65 and 0.7 when using random forest. These results suggest gene expression can be used to predict smoking status using traditional machine learning models. Additionally, based on our random forest model, we proposed KCNJ3 and TXLNGY as two candidate markers of smoking status. These findings, coupled with other genes identified in this study, present promising avenues for advancing applications related to the genetic foundation of smoking-related characteristics.

Read More...