Press ESC to close

Artificial Intelligence

It is the subfield of computer science that focuses on creating systems capable of intelligent behavior, including problem solving, learning, adaptation, perception, and language understanding.

Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification

root 0

The article introduces a novel audio-video recognition approach called the Audio-Video Transformer (AVT) that uses effective spatio-temporal representation for improved action recognition. The research reduces cross-modality complexity via an audio-video…

Continue reading

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

root 0

DiffSHEG offers a solution for speech-driven holistic 3D expression and gesture generation. Unlike previous research that focused on individual generation of expression or gesture, DiffSHEG facilitates a joint generation, improving…

Continue reading

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness

root 0

The article presents ‘FreeTalker’, a novel framework that generates both spontaneous and non-spontaneous speaker motions, thus improving the naturalness and controllability of talking avatars. Unlike previous models, which only considered…

Continue reading