#Speech Recognition
Latest news articles tagged with "Speech Recognition". Follow the timeline of events, related topics, and entities.
Articles (10)
-
πΊπΈ LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
[USA]
arXiv:2603.20042v1 Announce Type: cross Abstract: Large language models (LLMs) have driven substantial advances in speech language models (SpeechLMs), yielding strong performance in automatic speech ...
Related: #AI Evaluation -
πΊπΈ Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
[USA]
arXiv:2603.20112v1 Announce Type: cross Abstract: Personalizing Automatic Speech Recognition (ASR) for non-normative speech remains challenging because data collection is labor-intensive and model tr...
Related: #Accessibility Technology -
πΊπΈ ProKWS: Personalized Keyword Spotting via Collaborative Learning of Phonemes and Prosody
[USA]
arXiv:2603.18024v1 Announce Type: cross Abstract: Current keyword spotting systems primarily use phoneme-level matching to distinguish confusable words but ignore user-specific pronunciation traits l...
Related: #Personalized AI -
πΊπΈ SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition
[USA]
arXiv:2603.10005v1 Announce Type: cross Abstract: Many Automatic Speech Recognition (ASR) applications require streaming processing of the audio data. In streaming mode, ASR systems need to start tra...
Related: #AI Technology -
πΊπΈ Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding
[USA]
arXiv:2603.06193v1 Announce Type: cross Abstract: Long-form speech recognition with large encoder-decoder models such as Whisper often exhibit hallucinations, repetition loops, and content omissions....
Related: #AI Technology -
πΊπΈ When Denoising Hinders: Revisiting Zero-Shot ASR with SAM-Audio and Whisper
[USA]
arXiv:2603.04710v1 Announce Type: cross Abstract: Recent advances in automatic speech recognition (ASR) and speech enhancement have led to a widespread assumption that improving perceptual audio qual...
Related: #Audio Processing -
πΊπΈ Training-Free Intelligibility-Guided Observation Addition for Noisy ASR
[USA]
arXiv:2602.20967v1 Announce Type: cross Abstract: Automatic speech recognition (ASR) degrades severely in noisy environments. Although speech enhancement (SE) front-ends effectively suppress backgrou...
Related: #Audio Processing, #Machine Learning, #Noise Reduction -
πΊπΈ Wispr Flow launches an Android app for AI-powered dictation
[USA]
AI-powered dictation startup Wispr Flow has launched its Android app today. The company released its app for Mac and Windows first, then launched on iOS in June 2025. On iOS, users could use Wispr Flo...
Related: #AI Technology, #Mobile Applications, #Venture Funding -
πΊπΈ The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?
[USA]
arXiv:2602.17598v1 Announce Type: cross Abstract: Current speech LLMs largely perform implicit ASR: on tasks solvable from a transcript, they are behaviorally and mechanistically equivalent to simple...
Related: #Large Language Models, #Model Architecture Comparison, #Audio Processing, #Efficiency and Cost Analysis -
πΊπΈ SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise
[USA]
arXiv:2602.12783v1 Announce Type: cross Abstract: Spoken query retrieval is an important interaction mode in modern information retrieval. However, existing evaluation datasets are often limited to s...
Related: #Technology Evaluation, #Information Retrieval
Key Entities (6)
- Android (operating system) (1 news)
- Hinglish (1 news)
- Whispering (1 news)
- Speech recognition (1 news)
- Noise reduction (1 news)
- Audio processing (1 news)
About the topic: Speech Recognition
The topic "Speech Recognition" aggregates 10+ news articles from various countries.