Every year, around 40% of undergraduate students in the United States discontinue their studies, resulting in a loss of valuable education for students and a loss of money for colleges. Even so, colleges across the nation struggle to discover the underlying causes of these high dropout rates. In this paper, the authors discuss the use of machine learning to find correlations between the built environment factors and the retention rates of colleges. They hypothesized that one way for colleges to improve their retention rates could be to improve the physical characteristics of their campus to be more pleasing. The authors used image classification techniques to look at images of colleges and correlate certain features like colors, cars, and people to higher or lower retention rates. With three possible options of high, medium, and low retention rates, the probability that their models reached the right conclusion if they simply chose randomly was 33%. After finding that this 33%, or 0.33 mark, always fell outside of the 99% confidence intervals built around their models’ accuracies, the authors concluded that their machine learning techniques can be used to find correlations between certain environmental factors and retention rates.
Read More...Browse Articles
Predicting the factors involved in orthopedic patient hospital stay
Long hospital stays can be stressful for the patient for many reasons. We hypothesized that age would be the greatest predictor of hospital stay among patients who underwent orthopedic surgery. Through our models, we found that severity of illness was indeed the highest factor that contributed to determining patient length of stay. The other two factors that followed were the facility that the patient was staying in and the type of procedure that they underwent.
Read More...Predicting the Instance of Breast Cancer within Patients using a Convolutional Neural Network
Using a convolution neural network, these authors show machine learning can clinically diagnose breast cancer with high accuracy.
Read More...Predicting asthma-related emergency department visits and hospitalizations with machine learning techniques
Seeking to investigate the effects of ambient pollutants on human respiratory health, here the authors used machine learning to examine asthma in Lost Angeles County, an area with substantial pollution. By using machine learning models and classification techniques, the authors identified that nitrogen dioxide and ozone levels were significantly correlated with asthma hospitalizations. Based on an identified seasonal surge in asthma hospitalizations, the authors suggest future directions to improve machine learning modeling to investigate these relationships.
Read More...Predicting Orbital Resonance of 2867 Šteins Using the Yarkovsky Effect
In this study, the impact of thermal effects on the orbit of an asteroid is investigated. This included determining if the asteroid's orbit would push into a region devoid of asteroids due to the gravitational pull of Jupiter.
Read More...A novel approach for predicting Alzheimer’s disease using machine learning on DNA methylation in blood
Here, recognizing the difficulty associated with tracking the progression of dementia, the authors used machine learning models to predict between the presence of cognitive normalcy, mild cognitive impairment, and Alzheimer's Disease, based on blood DNA methylation levels, sex, and age. With four machine learning models and two dataset dimensionality reduction methods they achieved an accuracy of 53.33%.
Read More...Genetic algorithm based features selection for predicting the unemployment rate of India
The authors looked at using genetic algorithms to look at the Indian labor market and what features might best explain any variation seen. They found that features such as economic growth and household consumption, among others, best explained variation.
Read More...A comparative study of dynamic scoring formulas for capture-the-flag competitions
The use of gamification in cybersecurity education, particularly through capture-the-flag competitions, involves scoring challenges based on their difficulty and the number of teams that solve them. The study investigated how changing the scoring formulas affects competition outcomes, predicting that different formulas would alter score distributions.
Read More...Entropy-based subset selection principal component analysis for diabetes risk factor identification
In this article, the authors looked at developing a strategy that would allow for earlier diagnosis of Diabetes as that improves long-term outcomes. They were able to find that BMI, tricep skin fold thickness, and blood pressure are the risk factors with the highest accuracy in predicting diabetes risk.
Read More...Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis
Over the last few decades, childhood stunting has persisted as a major global challenge. This study hypothesized that TPTO (Tree-based Pipeline Optimization Tool), an AutoML (automated machine learning) tool, would outperform all pre-existing machine learning models and reveal the positive impact of economic prosperity, strong familial traits, and resource attainability on reducing stunting risk. Feature correlation plots revealed that maternal height, wealth indicators, and parental education were universally important features for determining stunting outcomes approximately two years after birth. These results help inform future research by highlighting how demographic, familial, and socio-economic conditions influence stunting and providing medical professionals with a deployable risk assessment tool for predicting childhood stunting.
Read More...