Browse Articles

Gradient boosting with temporal feature extraction for modeling keystroke log data

Barretto et al. | Oct 04, 2024

Gradient boosting with temporal feature extraction for modeling keystroke log data
Image credit: Barretto and Barretto 2024.

Although there has been great progress in the field of Natural language processing (NLP) over the last few years, particularly with the development of attention-based models, less research has contributed towards modeling keystroke log data. State of the art methods handle textual data directly and while this has produced excellent results, the time complexity and resource usage are quite high for such methods. Additionally, these methods fail to incorporate the actual writing process when assessing text and instead solely focus on the content. Therefore, we proposed a framework for modeling textual data using keystroke-based features. Such methods pay attention to how a document or response was written, rather than the final text that was produced. These features are vastly different from the kind of features extracted from raw text but reveal information that is otherwise hidden. We hypothesized that pairing efficient machine learning techniques with keystroke log information should produce results comparable to transformer techniques, models which pay more or less attention to the different components of a text sequence in a far quicker time. Transformer-based methods dominate the field of NLP currently due to the strong understanding they display of natural language. We showed that models trained on keystroke log data are capable of effectively evaluating the quality of writing and do it in a significantly shorter amount of time compared to traditional methods. This is significant as it provides a necessary fast and cheap alternative to increasingly larger and slower LLMs.

Read More...

A HOG feature extraction and CNN approach to Parkinson’s spiral drawing diagnosis

Tripathi et al. | Aug 09, 2024

A HOG feature extraction and CNN approach to Parkinson’s spiral drawing diagnosis

Parkinson’s disease (PD) is a prevalent neurodegenerative disorder in the U.S., second only to Alzheimer’s disease. Current diagnostic methods are often inefficient and dependent on clinical exams. This study explored using machine and deep learning to enhance PD diagnosis by analyzing spiral drawings affected by hand tremors, a common PD symptom.

Read More...

Identifying shark species using an AlexNet CNN model

Sarwal et al. | Sep 23, 2024

Identifying shark species using an AlexNet CNN model

The challenge of accurately identifying shark species is crucial for biodiversity monitoring but is often hindered by time-consuming and labor-intensive manual methods. To address this, SharkNet, a CNN model based on AlexNet, achieved 93% accuracy in classifying shark species using a limited dataset of 1,400 images across 14 species. SharkNet offers a more efficient and reliable solution for marine biologists and conservationists in species identification and environmental monitoring.

Read More...

A Quantitative Assessment of Time, Frequency, and Time-frequency Algorithms for Automated Seizure Detection and Monitoring

Vangal et al. | Sep 28, 2020

A Quantitative Assessment of Time, Frequency, and Time-frequency Algorithms for Automated Seizure Detection and Monitoring

Each year, over 100,000 patients die from Sudden Unexpected Death in Epilepsy (SUDEP). A reliable seizure warning system can help patients stay safe. This work presents a comprehensive, comparative analysis of three different signal processing algorithms for automated seizure/ictal detection. The experimental results show that the proposed methods can be effective for accurate automated seizure detection and monitoring in clinical care.

Read More...

The use of computer vision to differentiate valley fever from lung cancer via CT scans of nodules

El Kereamy et al. | Nov 12, 2024

The use of computer vision to differentiate valley fever from lung cancer via CT scans of nodules

Pulmonary diseases like lung cancer and valley fever pose serious health challenges, making accurate and rapid diagnostics essential. This study developed a MATLAB-based software tool that uses computer vision techniques to differentiate between these diseases by analyzing features of lung nodules in CT scans, achieving higher precision than traditional methods.

Read More...

The Role of a Mask - Understanding the Performance of Deep Neural Networks to Detect, Segment, and Extract Cellular Nuclei from Microscopy Images

Dasgupta et al. | Jul 06, 2021

The Role of a Mask - Understanding the Performance of Deep Neural Networks to Detect, Segment, and Extract Cellular Nuclei from Microscopy Images

Cell segmentation is the task of identifying cell nuclei instances in fluorescence microscopy images. The goal of this paper is to benchmark the performance of representative deep learning techniques for cell nuclei segmentation using standard datasets and common evaluation criteria. This research establishes an important baseline for cell nuclei segmentation, enabling researchers to continually refine and deploy neural models for real-world clinical applications.

Read More...

Simulations of Cheetah Roaming Demonstrate the Effect of Safety Corridors on Genetic Diversity and Human-Cheetah Conflict

Acton et al. | Apr 02, 2018

Simulations of Cheetah Roaming Demonstrate the Effect of Safety Corridors on Genetic Diversity and Human-Cheetah Conflict

Ecological corridors are geographic features designated to allow the movement of wildlife populations between habitats that have been fragmented by human landscapes. Corridors can be a pivotal aspect in wildlife conservation because they preserve a suitable habitat for isolated populations to live and intermingle. Here, two students simulate the effect of introducing a safety corridor for cheetahs, based on real tracking data on cheetahs in Namibia.

Read More...

Predicting baseball pitcher efficacy using physical pitch characteristics

Oberoi et al. | Jan 11, 2024

Predicting baseball pitcher efficacy using physical pitch characteristics
Image credit: Antoine Schibler

Here, the authors sought to develop a new metric to evaluate the efficacy of baseball pitchers using machine learning models. They found that the frequency of balls, was the most predictive feature for their walks/hits allowed per inning (WHIP) metric. While their machine learning models did not identify a defining trait, such as high velocity, spin rate, or types of pitches, they found that consistently pitching within the strike zone resulted in significantly lower WHIPs.

Read More...

Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis

Sirohi et al. | Sep 25, 2022

Risk assessment modeling for childhood stunting using automated machine learning and demographic analysis

Over the last few decades, childhood stunting has persisted as a major global challenge. This study hypothesized that TPTO (Tree-based Pipeline Optimization Tool), an AutoML (automated machine learning) tool, would outperform all pre-existing machine learning models and reveal the positive impact of economic prosperity, strong familial traits, and resource attainability on reducing stunting risk. Feature correlation plots revealed that maternal height, wealth indicators, and parental education were universally important features for determining stunting outcomes approximately two years after birth. These results help inform future research by highlighting how demographic, familial, and socio-economic conditions influence stunting and providing medical professionals with a deployable risk assessment tool for predicting childhood stunting.

Read More...