root, Author at BytesArchive

January 31, 2024

Enhanced Sound Event Localization and Detection in Real 360-degree audio-visual soundscapes

The authors have developed an enhanced audio-visual Sound Event Localization and Detection (SELD) network, improving on the…

January 31, 2024

The article discusses Singing Voice Conversion (SVC), a technology that allows the conversion of one singer’s voice…

January 31, 2024

The paper introduces ESPnet-SPK, a toolkit for training speaker embedding extractors. It offers an open-source platform for…

January 31, 2024

The article introduces AudioSeal, a novel technique designed specifically for localized detection of AI-generated speech, in response…

January 28, 2024

This paper introduces Investigate-Consolidate-Exploit (ICE), a new approach for improving the adaptability and flexibility of AI agents…

January 28, 2024

The article presents ConstraintChecker, a plugin designed to enhance the reasoning capabilities of Large Language Models (LLMs)…

January 28, 2024

The study introduces CMMU, a benchmark tool for evaluating the understanding and reasoning abilities of multi-modal large…

January 28, 2024

The article presents the Uncertainty-Aware Language Agent (UALA), a new framework that leverages uncertainty quantification to improve…

January 28, 2024

Unitxt is an innovative library designed for customizable textual data preparation and evaluation tailored to generative language…

January 28, 2024

The paper focuses on the use of Transformer-based language models, specifically BERT and (Chat)GPT, in detecting semantic…