Browse Articles

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

Balaji et al. | Sep 11, 2021

Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification

The purpose of the study was to determine whether graph-based machine learning techniques, which have increased prevalence in the last few years, can accurately classify data into one of many clusters, while requiring less labeled training data and parameter tuning as opposed to traditional machine learning algorithms. The results determined that the accuracy of graph-based and traditional classification algorithms depends directly upon the number of features of each dataset, the number of classes in each dataset, and the amount of labeled training data used.

Read More...

Machine learning on crowd-sourced data to highlight coral disease

Narayan et al. | Jul 26, 2021

Machine learning on crowd-sourced data to highlight coral disease

Triggered largely by the warming and pollution of oceans, corals are experiencing bleaching and a variety of diseases caused by the spread of bacteria, fungi, and viruses. Identification of bleached/diseased corals enables implementation of measures to halt or retard disease. Benthic cover analysis, a standard metric used in large databases to assess live coral cover, as a standalone measure of reef health is insufficient for identification of coral bleaching/disease. Proposed herein is a solution that couples machine learning with crowd-sourced data – images from government archives, citizen science projects, and personal images collected by tourists – to build a model capable of identifying healthy, bleached, and/or diseased coral.

Read More...

An Investigative Analysis of Climate Change Using Historical and Modern Weather Data

Han et al. | Dec 02, 2013

An Investigative Analysis of Climate Change Using Historical and Modern Weather Data

Climate change is an important and contentious issue that has far-reaching implications for our future. The authors here compare primary temperature and precipitation data from almost 200 years ago against the present day. They find that the average annual temperature in Brooklyn, NY has risen significantly over this time, as has the frequency of precipitation, though not the amount of precipitation. These data stress the need for more ecologically-conscious choices in our daily lives.

Read More...

A Data-Centric Analysis of “Stop and Frisk” in New York City

Bhat et al. | Apr 18, 2021

A Data-Centric Analysis of “Stop and Frisk” in New York City

The death of George Floyd has shed light on the disproportionate level of policing affecting non-Whites in the United States of America. To explore whether non-Whites were disproportionately targetted by New York City's "Stop and Frisk" policy, the authors analyze publicly available data on the practice between 2003-2019. Their results suggest African Americans were indeed more likely to be stopped by the police until 2012, after which there was some improvement.

Read More...

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Choudhary et al. | Jul 26, 2021

Using data science along with machine learning to determine the ARIMA model’s ability to adjust to irregularities in the dataset

Auto-Regressive Integrated Moving Average (ARIMA) models are known for their influence and application on time series data. This statistical analysis model uses time series data to depict future trends or values: a key contributor to crime mapping algorithms. However, the models may not function to their true potential when analyzing data with many different patterns. In order to determine the potential of ARIMA models, our research will test the model on irregularities in the data. Our team hypothesizes that the ARIMA model will be able to adapt to the different irregularities in the data that do not correspond to a certain trend or pattern. Using crime theft data and an ARIMA model, we determined the results of the ARIMA model’s forecast and how the accuracy differed on different days with irregularities in crime.

Read More...

A Retrospective Study of Research Data on End Stage Renal Disease

Ponnaluri et al. | Mar 09, 2016

A Retrospective Study of Research Data on End Stage Renal Disease

End Stage Renal Disease (ESRD) is a growing health concern in the United States. The authors of this study present a study of ESRD incidence over a 32-year period, providing an in-depth look at the contributions of age, race, gender, and underlying medical factors to this disease.

Read More...

LawCrypt: Secret Sharing for Attorney-Client Data in a Multi-Provider Cloud Architecture

Zhang et al. | Jul 19, 2020

LawCrypt: Secret Sharing for Attorney-Client Data in a Multi-Provider Cloud Architecture

In this study, the authors develop an architecture to implement in a cloud-based database used by law firms to ensure confidentiality, availability, and integrity of attorney documents while maintaining greater efficiency than traditional encryption algorithms. They assessed whether the architecture satisfies necessary criteria and tested the overall file sizes the architecture could process. The authors found that their system was able to handle larger file sizes and fit engineering criteria. This study presents a valuable new tool that can be used to ensure law firms have adequate security as they shift to using cloud-based storage systems for their files.

Read More...

Who is at Risk for a Spinal Fracture? – A Comparative Study of National Health and Nutrition Examination Survey Data

He et al. | Mar 01, 2018

Who is at Risk for a Spinal Fracture? – A Comparative Study of National Health and Nutrition Examination Survey Data

One common age-related health problem is the loss of bone mineral density (BMD), which can lead to a variety of negative health outcomes, including increased risk of spinal fracture. In this study, the authors investigate risk factors that may be predictive of an individual's risk of spinal fracture. Their findings provide valuable information that clinicians can use in patient evaluations.

Read More...

The determinants and incentives of corporate greenhouse gas emission reduction

Liu et al. | Jun 04, 2021

The determinants and incentives of corporate greenhouse gas emission reduction

This study used hand-collected Greenhouse gas (GHG) emissions data from the Environmental Protection Agency (EPA) and aimed to understand the determinants and incentives of GHG emissions reduction. It explored how companies’ financials, Chief Executive Officer (CEO) compensation, and corporate governance affected GHG emissions. Results showed that companies reporting GHG emissions were wide-spread among the 48 industries represented by two-digit Standard Industrial Classification (SIC) codes.

Read More...

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

Gupta et al. | Oct 18, 2020

Transfer Learning for Small and Different Datasets: Fine-Tuning A Pre-Trained Model Affects Performance

In this study, the authors seek to improve a machine learning algorithm used for image classification: identifying male and female images. In addition to fine-tuning the classification model, they investigate how accuracy is affected by their changes (an important task when developing and updating algorithms). To determine accuracy, a set of images is used to train the model and then a separate set of images is used for validation. They found that the validation accuracy was close to the training accuracy. This study contributes to the expanding areas of machine learning and its applications to image identification.

Read More...

Search Articles

Search articles by title, author name, or tags

Clear all filters

Popular Tags

Browse by school level