Browse Articles

Optimizing data augmentation to improve machine learning accuracy on endemic frog calls

Anand et al. | Mar 09, 2025

Optimizing data augmentation to improve machine learning accuracy on endemic frog calls
Image credit: Anand and Sampath 2025

The mountain chain of the Western Ghats on the Indian peninsula, a UNESCO World Heritage site, is home to about 200 frog species, 89 of which are endemic. Distinctive to each frog species, their vocalizations can be used for species recognition. Manually surveying frogs at night during the rain in elephant and big cat forests is difficult, so being able to autonomously record ambient soundscapes and identify species is essential. An effective machine learning (ML) species classifier requires substantial training data from this area. The goal of this study was to assess data augmentation techniques on a dataset of frog vocalizations from this region, which has a minimal number of audio recordings per species. Consequently, enhancing an ML model’s performance with limited data is necessary. We analyzed the effects of four data augmentation techniques (Time Shifting, Noise Injection, Spectral Augmentation, and Test-Time Augmentation) individually and their combined effect on the frog vocalization data and the public environmental sounds dataset (ESC-50). The effect of combined data augmentation techniques improved the model's relative accuracy as the size of the dataset decreased. The combination of all four techniques improved the ML model’s classification accuracy on the frog calls dataset by 94%. This study established a data augmentation approach to maximize the classification accuracy with sparse data of frog call recordings, thereby creating a possibility to build a real-world automated field frog species identifier system. Such a system can significantly help in the conservation of frog species in this vital biodiversity hotspot.

Read More...

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

Lachwani et al. | Oct 23, 2024

Survival analysis in cardiovascular epidemiology: nexus between heart disease and mortality

In 2021, over 20 million people died from cardiovascular diseases, highlighting the need for a deeper understanding of factors influencing heart failure outcomes. This study examined multiple variables affecting mortality after heart failure, using random forest models to identify time, serum creatinine, and ejection fraction as key predictors. These findings could contribute to personalized medicine, improving survival rates by tailoring treatment strategies for heart failure patients.

Read More...

Investigating ecosystem resiliency in different flood zones of south Brooklyn, New York

Ng et al. | Mar 23, 2024

Investigating ecosystem resiliency in different flood zones of south Brooklyn, New York
Image credit: Ng and Zheng et al 2024

With climate change and rising sea levels, south Brooklyn is exposed to massive flooding and intense precipitation. Previous research discovered that flooding shifts plant species distribution, decreases soil pH, and increases salt concentration, nitrogen, phosphorus, and potassium levels. The authors predicted a decreasing trend from Zone 1 to 6: high-pH, high-salt, and high-nutrients in more flood-prone areas to low-pH, low-salt, and low-nutrient in less flood-prone regions. They performed DNA barcoding to identify plant species inhabiting flood zones with expectations of decreasing salt tolerance and moisture uptake by plants' soil from Zones 1-6. Furthermore, they predicted an increase in invasive species, ultimately resulting in a decrease in biodiversity. After barcoding, they researched existing information regarding invasiveness, ideal soil, pH tolerance, and salt tolerance. They performed soil analyses to identify pH, nitrogen (N), phosphorus (P), and potassium (K) levels. For N and P levels, we discovered a general decreasing trend from Zone 1 to 6 with low and moderate statistical significance respectively. Previous studies found that soil moisture can increase N and P uptake, helping plants adopt efficient resource-use strategies and reduce water stress from flooding. Although characteristics of plants were distributed throughout all zones, demonstrating overall diversity, the soil analyses hinted at the possibility of a rising trend of plants adapting to the increase in flooding. Future expansive research is needed to comprehensively map these trends. Ultimately, investigating trends between flood zones and the prevalence of different species will assist in guiding solutions to weathering climate change and protecting biodiversity in Brooklyn.

Read More...

Predicting baseball pitcher efficacy using physical pitch characteristics

Oberoi et al. | Jan 11, 2024

Predicting baseball pitcher efficacy using physical pitch characteristics
Image credit: Antoine Schibler

Here, the authors sought to develop a new metric to evaluate the efficacy of baseball pitchers using machine learning models. They found that the frequency of balls, was the most predictive feature for their walks/hits allowed per inning (WHIP) metric. While their machine learning models did not identify a defining trait, such as high velocity, spin rate, or types of pitches, they found that consistently pitching within the strike zone resulted in significantly lower WHIPs.

Read More...

Time-Efficient and Low-Cost Neural Network to detect plant disease on leaves and reduce food loss and waste

Singh et al. | Apr 24, 2023

Time-Efficient and Low-Cost Neural Network to detect plant disease on leaves and reduce food loss and waste

About 25% of the food grown never reaches consumers due to spoilage, and 11.5 billion pounds of produce from gardens are wasted every year. Current solutions involve farmers manually looking for and treating diseased crops. These methods of tending crops are neither time-efficient nor feasible. I used a convolutional neural network to identify signs of plant disease on leaves for garden owners and farmers.

Read More...

Differential privacy in machine learning for traffic forecasting

Vinay et al. | Dec 21, 2022

Differential privacy in machine learning for traffic forecasting

In this paper, we measured the privacy budgets and utilities of different differentially private mechanisms combined with different machine learning models that forecast traffic congestion at future timestamps. We expected the ANNs combined with the Staircase mechanism to perform the best with every value in the privacy budget range, especially with the medium high values of the privacy budget. In this study, we used the Autoregressive Integrated Moving Average (ARIMA) and neural network models to forecast and then added differentially private Laplacian, Gaussian, and Staircase noise to our datasets. We tested two real traffic congestion datasets, experimented with the different models, and examined their utility for different privacy budgets. We found that a favorable combination for this application was neural networks with the Staircase mechanism. Our findings identify the optimal models when dealing with tricky time series forecasting and can be used in non-traffic applications like disease tracking and population growth.

Read More...

The knowledge and perception of opioid abuse and its long-term effects among high schoolers

Shroff et al. | Nov 27, 2021

The knowledge and perception of opioid abuse and its long-term effects among high schoolers

Due to the susceptibility of adolescent age groups to opioid misuse, here the authors sought to determine if there was a difference in the perception and knowledge between 9th and 12th graders regarding the opioid crisis. An educational intervention trial was done with the 9th graders and surveys were used to identify its effects. Although the authors acknowledge a small sample size, their results suggest that their are gaps within the knowledge of adolescents in regards to opioid misuse and its long-term effects that could be addressed with further education.

Read More...

Effects of an Informational Waste Management App on a User’s Waste Disposal Habits

Rao et al. | Apr 28, 2021

Effects of an Informational Waste Management App on a User’s Waste Disposal Habits

While 75% of waste in the United States is stated to be recyclable, only about 34% truly is. This project takes a stance to combat the pillars of mismanaged waste through a modern means of convenience: the TracedWaste app. The purpose of this study was to identify how individuals' waste disposal habits improved and knowledge increased (i.e. correctly disposing of waste, understanding negative incorrect waste disposal) due to their use of an informational waste management app as measured by a survey using a 1-5 Likert Scale. The results showed that the TracedWaste app helped conserve abundant resources such as energy and wood, decrease carbon emissions, and minimize financial toll all through reducing individual impact.

Read More...

Phytochemical Analysis of Amaranthus spinosus Linn.: An in vitro Analysis

Sharma et al. | Mar 20, 2021

Phytochemical Analysis of <em>Amaranthus spinosus</em> Linn.: An <em>in vitro</em> Analysis

Mainstream cancer treatments, which include radiotherapy and chemotherapeutic drugs, are known to induce oxidative damage to healthy somatic cells due to the liberation of harmful free radicals. In order to avert this, physiological antioxidants must be complemented with external antioxidants. Here the authors performed a preliminary phytochemical screen to identify alkaloids, saponins, flavonoids, polyphenols, and tannins in all parts of the Amaranthus spinosus Linn. plant. This paper describes the preparation of this crude extract and assesses its antioxidant properties for potential use in complementary cancer treatment.

Read More...

High-throughput virtual screening of novel dihydropyrimidine monastrol analogs reveals robust structure-activity relationship to kinesin Eg5 binding thermodynamics

Shern et al. | Jan 20, 2021

High-throughput virtual screening of novel dihydropyrimidine monastrol analogs reveals robust structure-activity relationship to kinesin Eg5 binding thermodynamics

As cancer continues to take millions of lives worldwide, the need to create effective therapeutics for the disease persists. The kinesin Eg5 assembly motor protein is a promising target for cancer therapeutics as inhibition of this protein leads to cell cycle arrest. Monastrol, a small dihydropyrimidine-based molecule capable of inhibiting the kinesin Eg5 function, has attracted the attention of medicinal chemists with its potency, affinity, and specificity to the highly targeted loop5/α2/α3 allosteric binding pocket. In this work, we employed high-throughput virtual screening (HTVS) to identify potential small molecule Eg5 inhibitors from a designed set of novel dihydropyrimidine analogs structurally similar to monastrol.

Read More...