Every year, around 40% of undergraduate students in the United States discontinue their studies, resulting in a loss of valuable education for students and a loss of money for colleges. Even so, colleges across the nation struggle to discover the underlying causes of these high dropout rates. In this paper, the authors discuss the use of machine learning to find correlations between the built environment factors and the retention rates of colleges. They hypothesized that one way for colleges to improve their retention rates could be to improve the physical characteristics of their campus to be more pleasing. The authors used image classification techniques to look at images of colleges and correlate certain features like colors, cars, and people to higher or lower retention rates. With three possible options of high, medium, and low retention rates, the probability that their models reached the right conclusion if they simply chose randomly was 33%. After finding that this 33%, or 0.33 mark, always fell outside of the 99% confidence intervals built around their models’ accuracies, the authors concluded that their machine learning techniques can be used to find correlations between certain environmental factors and retention rates.
Read More...Browse Articles
Similarity Graph-Based Semi-supervised Methods for Multiclass Data Classification
The purpose of the study was to determine whether graph-based machine learning techniques, which have increased prevalence in the last few years, can accurately classify data into one of many clusters, while requiring less labeled training data and parameter tuning as opposed to traditional machine learning algorithms. The results determined that the accuracy of graph-based and traditional classification algorithms depends directly upon the number of features of each dataset, the number of classes in each dataset, and the amount of labeled training data used.
Read More...Using machine learning to develop a global coral bleaching predictor
Coral bleaching is a fatal process that reduces coral diversity, leads to habitat loss for marine organisms, and is a symptom of climate change. This process occurs when corals expel their symbiotic dinoflagellates, algae that photosynthesize within coral tissue providing corals with glucose. Restoration efforts have attempted to repair damaged reefs; however, there are over 360,000 square miles of coral reefs worldwide, making it challenging to target conservation efforts. Thus, predicting the likelihood of bleaching in a certain region would make it easier to allocate resources for conservation efforts. We developed a machine learning model to predict global locations at risk for coral bleaching. Data obtained from the Biological and Chemical Oceanography Data Management Office consisted of various coral bleaching events and the parameters under which the bleaching occurred. Sea surface temperature, sea surface temperature anomalies, longitude, latitude, and coral depth below the surface were the features found to be most correlated to coral bleaching. Thirty-nine machine learning models were tested to determine which one most accurately used the parameters of interest to predict the percentage of corals that would be bleached. A random forest regressor model with an R-squared value of 0.25 and a root mean squared error value of 7.91 was determined to be the best model for predicting coral bleaching. In the end, the random model had a 96% accuracy in predicting the percentage of corals that would be bleached. This prediction system can make it easier for researchers and conservationists to identify coral bleaching hotspots and properly allocate resources to prevent or mitigate bleaching events.
Read More...Evaluating the predicted eruption times of geysers in Yellowstone National Park
The authors compare the predicted versus actual geyser eruption times for the Old Faithful and Beehive Geysers at Yellowstone National Park.
Read More...Idotea balthica comparison: Anatomy, locomotion, and seaweed preference of Massachusetts isopods
Here the authors examined a population of Massachusetts marine isopods, seeking to classify them based on comparison of their morphology, movement, and seaweed preference compared to those of known species. In this process they found that they were most similar to Idotea balthica. The authors suggest that this knowledge combined with monitoring populations of marine biology such as these isopods in different physical and ecological areas can provide useful insight into the effects of climate change.
Read More...Development of Diet-Induced Insulin Resistance in Drosophila melanogaster and Characterization of the Anti-Diabetic Effects of Resveratrol and Pterostilbene
Dhar and colleagues established a Type II diabetes mellitus (T2DM) model in fruit flies, using this model to induce insulin resistance and characterize the effects Resveratrol and Pterostilbene on a number of growth and activity metrics. Resveratrol and Pterostilbene treatment notably overturned the weight gain and glucose levels. The results of this study suggest that Drosophila can be utilized as a model organism to study T2DM and novel pharmacological treatments.
Read More...Effect of Collagen Gel Structure on Fibroblast Phenotype
Environment affects the progression of life, especially at the cellular level. This study investigates multiple 3-dimensional growth environments, also known as scaffolds or hydrogels, and their effect on the growth of a type of cells called fibroblasts. These results suggest that a scaffold made of collagen and polyethylene glycol are favorable for cell growth. This research is useful for developing implantable devices to aid wound healing.
Read More...Evaluating the clinical applicability of neural networks for meningioma tumor segmentation on 3D MRI
Authors emphasize the challenges of manual tumor segmentation and the potential of deep learning models to enhance accuracy by automatically analyzing MRI scans.
Read More...Effects of different synthetic training data on real test data for semantic segmentation
Semantic segmentation - labelling each pixel in an image to a specific class- models require large amounts of manually labeled and collected data to train.
Read More...A novel encoding technique to improve non-weather-based models for solar photovoltaic forecasting
Several studies have applied different machine learning (ML) techniques to the area of forecasting solar photovoltaic power production. Most of these studies use weather data as inputs to predict power production; however, there are numerous practical issues with the procurement of this data. This study proposes models that do not use weather data as inputs, but rather use past power production data as a more practical substitute to weather-based models. Our proposed models demonstrate a better, cheaper, and more reliable alternatives to current weather models.
Read More...