The Tor network allows individuals to secure their online identities by encrypting their traffic, however it is vulnerable to fingerprinting attacks that threaten users' online privacy. In this paper, the authors develop a new video fingerprinting model to explore how well video streaming can be fingerprinted in Tor. They found that their model could distinguish which one of 50 videos a user was hypothetically watching on the Tor network with 85% accuracy, demonstrating that video fingerprinting is a serious threat to the privacy of Tor users.
In this article, the authors investigate whether stock selection across various sectors is efficient enough to outperform an overall market. Stocks from 2006 to 2020 were selected across sectors to calculate beta values using the Capital Asset Pricing Model.
Using facial recognition as a use-case scenario, we attempt to identify sources of bias in a model developed using transfer learning. To achieve this task, we developed a model based on a pre-trained facial recognition model, and scrutinized the accuracy of the model’s image classification against factors such as age, gender, and race to observe whether or not the model performed better on some demographic groups than others. By identifying the bias and finding potential sources of bias, his work contributes a unique technical perspective from the view of a small scale developer to emerging discussions of accountability and transparency in AI.
In this work, the authors investigate the accuracy with which two different population growth models can predict population growth over time. They apply the Malthusian law or Logistic law to US population from 1951 until 2019. To assess how closely the growth model fits actual population data, a least-squared curve fit was applied and revealed that the Logistic law of population growth resulted in smaller sum of squared residuals. These findings are important for ensuring optimal population growth models are implemented to data as population forecasting affects a country's economic and social structure.
Lung cancer is highly fatal, largely due to late diagnoses, but early detection can greatly improve survival. This study developed three models to enhance early diagnosis: an MLP for clinical data, a CNN for imaging data, and a hybrid model combining both.
Here the authors hypothesized that reducing folliculin (FLCN) might affect p62 protein levels in the dorsal hippocampus of mice, given their potential functional connection and p62's role in neurodegenerative diseases. Their study, using western blots and a two-way ANOVA on young wild-type mice, found that p62 levels correlated with FLCN expression, but ultimately concluded there's no evidence of a functional connection between FLCN and p62 in this specific model.
The global issue of water quality has led to the use of machine learning models, like ANN and SVM, to predict water potability. However, these models can be complex and resource-intensive. This research aimed to find a simpler, more efficient model for water quality prediction.
The mountain chain of the Western Ghats on the Indian peninsula, a UNESCO World Heritage site, is home to about 200 frog species, 89 of which are endemic. Distinctive to each frog species, their vocalizations can be used for species recognition. Manually surveying frogs at night during the rain in elephant and big cat forests is difficult, so being able to autonomously record ambient soundscapes and identify species is essential. An effective machine learning (ML) species classifier requires substantial training data from this area. The goal of this study was to assess data augmentation techniques on a dataset of frog vocalizations from this region, which has a minimal number of audio recordings per species. Consequently, enhancing an ML model’s performance with limited data is necessary. We analyzed the effects of four data augmentation techniques (Time Shifting, Noise Injection, Spectral Augmentation, and Test-Time Augmentation) individually and their combined effect on the frog vocalization data and the public environmental sounds dataset (ESC-50). The effect of combined data augmentation techniques improved the model's relative accuracy as the size of the dataset decreased. The combination of all four techniques improved the ML model’s classification accuracy on the frog calls dataset by 94%. This study established a data augmentation approach to maximize the classification accuracy with sparse data of frog call recordings, thereby creating a possibility to build a real-world automated field frog species identifier system. Such a system can significantly help in the conservation of frog species in this vital biodiversity hotspot.
Additive manufacturing (AM) is transforming the production of complex metal parts, but challenges like internal cracking can arise, particularly in critical sectors such as aerospace and automotive. Traditional methods to assess cracking susceptibility are costly and time-consuming, prompting the use of machine learning (ML) for more efficient predictions. This study developed a multi-model ML pipeline that predicts solidification cracking susceptibility (SCS) more accurately by considering secondary alloy properties alongside composition, with Random Forest models showing the best performance, highlighting a promising direction for future research into SCS quantification.
Rechargeable batteries are playing an increasingly prominent role in our lives due to the ongoing transition from fossil energy sources to green energy. The purpose of this study was to investigate variables that impact the effectiveness of rechargeable batteries. Alkaline (non-rechargeable) and rechargeable batteries share common features that are critical for the operation of a battery. The positive and negative electrodes, also known as the cathode and anode, are where the energy of the battery is stored. The electrolyte is what facilitates the transfer of cations and anions in a battery to generate electricity. Due to the importance of these components, we felt that a systematic investigation examining the surface area of the cathode and anode as well the impact of electrolytes with different properties on battery performance was justified. Utilizing a copper cathode and aluminum anode coupled with a water in salt electrolyte, a model rechargeable battery system was developed to test two hypotheses: a) increasing the contact area between the electrodes and electrolyte would improve battery capacity, and b) more soluble salt-based electrolytes would improve battery capacity. After soaking in an electrolyte solution, the battery was charged and the capacity, starting voltage, and ending voltage of each battery were measured. The results of this study supported our hypothesis that larger anode/cathodes surface areas and more ionic electrolytes such as sodium chloride, potassium chloride and potassium sulfate resulted in superior battery capacity. Incorporating these findings can help maximize the efficiency of commercial rechargeable batteries.