root, Author at BytesArchive

January 31, 2024

Phoneme-Based Proactive Anti-Eavesdropping with Controlled Recording Privilege

This study presents a new system designed to protect against eavesdropping by jamming microphones with a unique…

January 31, 2024

The article introduces Synchformer, a new model for audio-visual synchronization focused on ‘in-the-wild’ videos, such as those…

January 31, 2024

The research presents AMuSE (Adaptive Multimodal Analysis for Speaker Emotion), a model developed for recognizing individual emotions…

January 31, 2024

Music auto-tagging is key for improving music discovery and recommendation. Existing models in Music Information Retrieval (MIR)…

January 31, 2024

This paper discusses the limitations of current masked audio modeling (MAM) methods and presents a new method…

January 31, 2024

The article presents a framework for continuous target speaker extraction (C-TSE), which aims to refine the process…

January 31, 2024

The paper introduces an algorithm for localizing mono-frequent uniformly moving sound sources, operating entirely in the frequency…

January 31, 2024

The article presents a system that uses spatial-temporal activity for multichannel speaker diarization and separation. The architecture…

January 31, 2024

The article introduces the PBSCSR dataset, a resource for studying composer style recognition in piano sheet music….

January 31, 2024

The article discusses the need for objective metrics in evaluating speech generation. The authors propose new reference-aware…