Joint-Attention

Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation

root October 25, 2023 0

The article presents a novel approach to real-time spoken language transcription and translation using a streaming Transformer-Transducer (T-T) model. The T-T model can jointly produce many-to-one and one-to-many transcription and…

Computer Vision and Pattern Recognition Sound

Audio-Visual Speaker Verification via Joint Cross-Attention

root October 1, 2023 0

The article presents a novel approach to speaker verification, a key technology for person authentication. It focuses on audio-visual fusion, leveraging both faces and voices for more comprehensive information. The…

Page 1 of 1

Press ESC to close

Joint-Attention

Please allow ads on our site