Abstract: In recent years, the development of socially interactive robots has heightened the need for interpretable and trustworthy models capable of recognizing complex human emotions in real time.
Abstract: Automatic modulation recognition (AMR) in underwater acoustic (UWA) communications is limited by scarce training data and channel impairments such as multipath propagation, Doppler shifts, ...
Overview SynthToSoul is a full-stack audio analysis application designed to distinguish between Human-Made and AI-Generated music. By combining a custom Deep Learning classifier with audio ...
Abstract: The identification and analysis of human daily activities has garnered substantial attention in recent years, driven by its expansive applications in areas including healthcare, surveillance ...
Abstract: Handwritten Optical Character Recognition (OCR) for the Tifinagh script remains a challenging task due to the script’s geometric complexity, high similarity among rotated characters, and the ...
Abstract: The rapid progress in audio synthesis technologies allows the generating of artificial audio deep-fakes that sound convincingly real, thus causing security, privacy, and authenticity ...
Abstract: Emotion recognition is essential for improving user experience and interaction quality in human-centered applications. While recent studies have leveraged both event and traditional cameras ...
Abstract: Music genre classification is an important task in music information retrieval with uses in recommendation, indexing and streaming. This paper proposes a CNN-based framework that classifies ...
Abstract: Spiking neural networks (SNNs) are one of the best practices for efficient event-driven object recognition. To achieve high recognition accuracy, existing methods generally accumulate ...
Abstract: Medical image segmentation is increasingly reliant on deep learning techniques, yet the promising performance often come with high annotation costs. This paper introduces Weak-Mamba-UNet, an ...
Abstract: Electroencephalography EEG - based decoding of visual stimuli has gained traction in computational neuroscience and brain-computer interface applications. This study explores the feasibility ...
Abstract: Most existing studies on table tennis training focus on either action recognition or scoring alone, lacking a systematic way to model both tasks together, which limits the ability to provide ...