The authors looked at the ability of machine learning algorithms to interpret language given their increasing use in moderating content on social media. Using an explainable model they were able to achieve 81% accuracy in detecting fake vs. real news based on language of posts alone.
Read More...Browse Articles
Uncovering the hidden trafficking trade with geographic data and natural language processing
The authors use machine learning to develop an evidence-based detection tool for identifying human trafficking.
Read More...Machine learning predictions of additively manufactured alloy crack susceptibilities
Additive manufacturing (AM) is transforming the production of complex metal parts, but challenges like internal cracking can arise, particularly in critical sectors such as aerospace and automotive. Traditional methods to assess cracking susceptibility are costly and time-consuming, prompting the use of machine learning (ML) for more efficient predictions. This study developed a multi-model ML pipeline that predicts solidification cracking susceptibility (SCS) more accurately by considering secondary alloy properties alongside composition, with Random Forest models showing the best performance, highlighting a promising direction for future research into SCS quantification.
Read More...Using explainable artificial intelligence to identify patient-specific breast cancer subtypes
Breast cancer is the most common cancer in women, with approximately 300,000 diagnosed with breast cancer in 2023. It ranks second in cancer-related deaths for women, after lung cancer with nearly 50,000 deaths. Scientists have identified important genetic mutations in genes like BRCA1 and BRCA2 that lead to the development of breast cancer, but previous studies were limited as they focused on specific populations. To overcome limitations, diverse populations and powerful statistical methods like genome-wide association studies and whole-genome sequencing are needed. Explainable artificial intelligence (XAI) can be used in oncology and breast cancer research to overcome these limitations of specificity as it can analyze datasets of diagnosed patients by providing interpretable explanations for identified patterns and predictions. This project aims to achieve technological and medicinal goals by using advanced algorithms to identify breast cancer subtypes for faster diagnoses. Multiple methods were utilized to develop an efficient algorithm. We hypothesized that an XAI approach would be best as it can assign scores to genes, specifically with a 90% success rate. To test that, we ran multiple trials utilizing XAI methods through the identification of class-specific and patient-specific key genes. We found that the study demonstrated a pipeline that combines multiple XAI techniques to identify potential biomarker genes for breast cancer with a 95% success rate.
Read More...Can the attributes of an app predict its rating?
In this article the authors looked at different attributes of apps within the Google Play store to determine how those may impact the overall app rating out of five stars. They found that review count, amount of storage needed and when the app was last updated to be the most influential factors on an app's rating.
Read More...An analysis of the feasibility of SARIMAX-GARCH through load forecasting
The authors found that SARIMAX-GARCH is more accurate than SARIMAX for load forecasting with respect to energy consumption.
Read More...Building deep neural networks to detect candy from photos and estimate nutrient portfolio
The authors use pictures of candy wrappers and neural networks to improve nutritional accuracy of diet-tracking apps.
Read More...Evaluating the predicted eruption times of geysers in Yellowstone National Park
The authors compare the predicted versus actual geyser eruption times for the Old Faithful and Beehive Geysers at Yellowstone National Park.
Read More...The use of computer vision to differentiate valley fever from lung cancer via CT scans of nodules
Pulmonary diseases like lung cancer and valley fever pose serious health challenges, making accurate and rapid diagnostics essential. This study developed a MATLAB-based software tool that uses computer vision techniques to differentiate between these diseases by analyzing features of lung nodules in CT scans, achieving higher precision than traditional methods.
Read More...Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis
Over the last few decades, childhood stunting has persisted as a major global challenge. This study hypothesized that TPTO (Tree-based Pipeline Optimization Tool), an AutoML (automated machine learning) tool, would outperform all pre-existing machine learning models and reveal the positive impact of economic prosperity, strong familial traits, and resource attainability on reducing stunting risk. Feature correlation plots revealed that maternal height, wealth indicators, and parental education were universally important features for determining stunting outcomes approximately two years after birth. These results help inform future research by highlighting how demographic, familial, and socio-economic conditions influence stunting and providing medical professionals with a deployable risk assessment tool for predicting childhood stunting.
Read More...