Articles | Journal of Emerging Investigators

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

Gupta et al. | Oct 18, 2020

In this study, the authors seek to improve a machine learning algorithm used for image classification: identifying male and female images. In addition to fine-tuning the classification model, they investigate how accuracy is affected by their changes (an important task when developing and updating algorithms). To determine accuracy, a set of images is used to train the model and then a separate set of images is used for validation. They found that the validation accuracy was close to the training accuracy. This study contributes to the expanding areas of machine learning and its applications to image identification.

Rhythmic lyrics translation: Customizing a pre-trained language model using stacked fine-tuning

Chong et al. | May 01, 2023

Neural machine translation (NMT) is a software that uses neural network techniques to translate text from one language to another. However, one of the most famous NMT models—Google Translate—failed to give an accurate English translation of a famous Korean nursery rhyme, "Airplane" (비행기). The authors fine-tuned a pre-trained model first with a dataset from the lyrics domain, and then with a smaller dataset containing the rhythmical properties, to teach the model to translate rhythmically accurate lyrics. This stacked fine-tuning method resulted in an NMT model that could maintain the rhythmical characteristics of lyrics during translation while single fine-tuned models failed to do so.

Comparing model-centric and data-centric approaches to determine the efficiency of data-centric AI

La et al. | Apr 20, 2023

In this study, three models are used to test the hypothesis that data-centric artificial intelligence (AI) will improve the performance of machine learning.

Large Language Models are Good Translators

Zeng et al. | Oct 16, 2024

Machine translation remains a challenging area in artificial intelligence, with neural machine translation (NMT) making significant strides over the past decade but still facing hurdles, particularly in translation quality due to the reliance on expensive bilingual training data. This study explores whether large language models (LLMs), like GPT-4, can be effectively adapted for translation tasks and outperform traditional NMT systems.

Identifying shark species using an AlexNet CNN model

Sarwal et al. | Sep 23, 2024

The challenge of accurately identifying shark species is crucial for biodiversity monitoring but is often hindered by time-consuming and labor-intensive manual methods. To address this, SharkNet, a CNN model based on AlexNet, achieved 93% accuracy in classifying shark species using a limited dataset of 1,400 images across 14 species. SharkNet offers a more efficient and reliable solution for marine biologists and conservationists in species identification and environmental monitoring.

Evaluating key factors in emotion detection models for AI-driven personalized bibliotherapy

Dalal et al. | Apr 27, 2026

This study evaluates the potential of natural language processing (NLP) models in an emotion-driven bibliotherapy framework to improve mental health challenges.

Applying machine learning to breast cancer diagnosis: A high school student’s exploration using R

Vikram et al. | Aug 20, 2025

The authors combine fine needle aspiration biopsy and machine learning algorithms to develop a breast cancer detection method suitable for resource-constrained regions that lack access to mammograms.

Correlation between particulate matter concentrations and COPD hospitalization rates in Massachusetts

Ganeshwaran et al. | Dec 30, 2024

Air pollution is thought to increase the prevalence of health conditions like chronic obstructive pulmonary disease (COPD). Ganeshwaran and Ropiak investigate this relationship by determining whether there is a correlation between between one type of air pollution (fine particulate matter concentrations) and COPD hospitalization rates in Massachusetts.

Artificial Intelligence-Based Smart Solution to Reduce Respiratory Problems Caused by Air Pollution

Bhardwaj et al. | Dec 14, 2021

In this report, Bhardwaj and Sharma tested whether placing specific plants indoors can reduce levels of indoor air pollution that can lead to lung-related illnesses. Using machine learning, they show that plants improved overall indoor air quality and reduced levels of particulate matter. They suggest that plant-based interventions coupled with sensors may be a useful long-term solution to reducing and maintaining indoor air pollution.

Innovative use of recycled textile fibers in building materials: A circular economy approach

Gupta et al. | Feb 19, 2026

Textile waste from the fashion industry is a major environmental pollutant, but recycling waste into novel building material is a strategy to reduce the negative effects. This manuscript characterized five different binders that can be used to repurpose textile waste into bricks for construction purposes. Water-based glue, cement, white cement, plaster of Paris, and epoxy resin were mixed with shredded textile waste, and the mechanical characteristics and thermal insulation of each brick type were measured. Bricks with increased mechanical strength had the poorest thermal resistance, and the contrasting properties would suit different building purposes. This work provides a first step in generating recycled textile bricks for construction in a circular economy framework.

Browse Articles

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

Rhythmic lyrics translation: Customizing a pre-trained language model using stacked fine-tuning

Comparing model-centric and data-centric approaches to determine the efficiency of data-centric AI

Large Language Models are Good Translators

Identifying shark species using an AlexNet CNN model

Evaluating key factors in emotion detection models for AI-driven personalized bibliotherapy

Applying machine learning to breast cancer diagnosis: A high school student’s exploration using R

Correlation between particulate matter concentrations and COPD hospitalization rates in Massachusetts

Artificial Intelligence-Based Smart Solution to Reduce Respiratory Problems Caused by Air Pollution

Innovative use of recycled textile fibers in building materials: A circular economy approach

Search Articles

Popular Tags

Browse Articles

Search Articles

Category

School Level

Popular Tags