We do industrial, academic and open-source research on diverse topics in-the-realm of human-centered AI. Check out some of our recent works.
News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News
We present a fine-tuned LLM framework for verifiable T.V. news QA pairs, surpassing similar base models and enhancing answer contextualization.
Venue: Under review at ICASSP 2025
Published on: Fri Jul 01 2022
1SPU: 1-step speech processing unit
1SPU extends ASR with tagged placeholders for semantic events, achieving improved transcription quality on SLUE and SLURP benchmarks.
Venue: International Conference on Natural Language Processing 2023
Published on: Fri Jul 01 2022
E2E spoken entity extraction for virtual agents
This study refines entity extraction directly from speech, optimizing ASR encoders to transcribe only relevant content in virtual agent dialogs.
Venue: Empirical Methods in Natural Language Processing 2023
Published on: Thu Sep 15 2022
Combining pretrained speech and text encoders for continuous spoken language Processing
We introduce a multi-modal model for token-level classification using cross-modal attention, efficient for single GPU training.
Venue: International Conference on Natural Language Processing 2023
Published on: Thu Sep 15 2022
The Red Hen anonymizer and the red hen protocol for de-identifying audiovisual recordings
The Red Hen Anonymizer enables de-identification of audiovisual data, ensuring privacy while supporting machine learning and research.
Venue: Linguistics Vanguard 2022 (Journal)
Published on: Thu Sep 15 2022