Articles | Journal of Emerging Investigators

Mitigating skin color bias in dermatology AI using CycleGAN-based data augmentation

Kannan et al. | Jun 24, 2026

This study investigates skin tone bias in artificial intelligence models used for dermatological disease classification and evaluates a CycleGAN-based data augmentation approach to improve diagnostic performance on darker skin types. We generated synthetic dark-skinned images to enhance dataset diversity and compared model performance before and after augmentation. The results demonstrate that augmentation with synthetic dermatological images can help reduce disparities in diagnostic performance across skin tones, highlighting a practical strategy for improving fairness in dermatology AI systems.

Optimizing data augmentation to improve machine learning accuracy on endemic frog calls

Anand et al. | Mar 09, 2025

The mountain chain of the Western Ghats on the Indian peninsula, a UNESCO World Heritage site, is home to about 200 frog species, 89 of which are endemic. Distinctive to each frog species, their vocalizations can be used for species recognition. Manually surveying frogs at night during the rain in elephant and big cat forests is difficult, so being able to autonomously record ambient soundscapes and identify species is essential. An effective machine learning (ML) species classifier requires substantial training data from this area. The goal of this study was to assess data augmentation techniques on a dataset of frog vocalizations from this region, which has a minimal number of audio recordings per species. Consequently, enhancing an ML model’s performance with limited data is necessary. We analyzed the effects of four data augmentation techniques (Time Shifting, Noise Injection, Spectral Augmentation, and Test-Time Augmentation) individually and their combined effect on the frog vocalization data and the public environmental sounds dataset (ESC-50). The effect of combined data augmentation techniques improved the model's relative accuracy as the size of the dataset decreased. The combination of all four techniques improved the ML model’s classification accuracy on the frog calls dataset by 94%. This study established a data augmentation approach to maximize the classification accuracy with sparse data of frog call recordings, thereby creating a possibility to build a real-world automated field frog species identifier system. Such a system can significantly help in the conservation of frog species in this vital biodiversity hotspot.

Transfer learning and data augmentation in osteosarcoma cancer detection

Chu et al. | Jun 03, 2023

Osteosarcoma is a type of bone cancer that affects young adults and children. Early diagnosis of osteosarcoma is crucial to successful treatment. The current methods of diagnosis, which include imaging tests and biopsy, are time consuming and prone to human error. Hence, we used deep learning to extract patterns and detect osteosarcoma from histological images. We hypothesized that the combination of two different technologies (transfer learning and data augmentation) would improve the efficacy of osteosarcoma detection in histological images. The dataset used for the study consisted of histological images for osteosarcoma and was quite imbalanced as it contained very few images with tumors. Since transfer learning uses existing knowledge for the purpose of classification and detection, we hypothesized it would be proficient on such an imbalanced dataset. To further improve our learning, we used data augmentation to include variations in the dataset. We further evaluated the efficacy of different convolutional neural network models on this task. We obtained an accuracy of 91.18% using the transfer learning model MobileNetV2 as the base model with various geometric transformations, outperforming the state-of-the-art convolutional neural network based approach.

Computational Study of Erosion Effects on a Triangular Aerofoil's Aerodynamics at Reynolds number of 10,000

Liuhan et al. | Mar 28, 2025

This study examined the impact of erosion on the performance of a triangular aerofoil at a low Reynolds number (Re = 10,000), relevant for harsh conditions like those on Mars.

Comparing transformer and RNN models in BCIs for handwritten text decoding via neural signals

Hari et al. | Jan 24, 2025

Brain-Computer Interface (BCI) allows users, especially those with paralysis, to control devices through brain activity. This study explored using a custom transformer model to decode neural signals into handwritten text for individuals with limited motor skills, comparing its performance to a traditional RNN-based BCI.

Differences in postoperative satisfaction between orthopedic and cosmetic patients

Lanza et al. | Jul 07, 2023

In this study, the authors investigate differences in psychological outcomes from patients who undergo different surgical procedures.

Browse Articles

Mitigating skin color bias in dermatology AI using CycleGAN-based data augmentation

Optimizing data augmentation to improve machine learning accuracy on endemic frog calls

Transfer learning and data augmentation in osteosarcoma cancer detection

Computational Study of Erosion Effects on a Triangular Aerofoil's Aerodynamics at Reynolds number of 10,000

Comparing transformer and RNN models in BCIs for handwritten text decoding via neural signals

Differences in postoperative satisfaction between orthopedic and cosmetic patients

Search Articles

Popular Tags

Browse Articles

Mitigating skin color bias in dermatology AI using CycleGAN-based data augmentation

Optimizing data augmentation to improve machine learning accuracy on endemic frog calls

Transfer learning and data augmentation in osteosarcoma cancer detection

Computational Study of Erosion Effects on a Triangular Aerofoil's Aerodynamics at Reynolds number of 10,000

Comparing transformer and RNN models in BCIs for handwritten text decoding via neural signals

Differences in postoperative satisfaction between orthopedic and cosmetic patients

Search Articles

Category

School Level

Popular Tags