Beyond Transcription: Mechanistic Interpretability in ASR

Neta Glazer; Yael Segal-Feldman; Hilit Segev; Aviv Shamsian; Asaf Buchnick; Gill Hetz; Ethan Fetaya; Joseph Keshet; Aviv Navon

doi:10.1609/aaai.v40i44.41073

Authors

Neta Glazer aiOla
Yael Segal-Feldman aiOla
Hilit Segev aiOla
Aviv Shamsian aiOla
Asaf Buchnick aiOla
Gill Hetz aiOla
Ethan Fetaya Bar Ilan University
Joseph Keshet aiOla Technion - Israel Institute of Technology, Technion
Aviv Navon aiOla

DOI:

https://doi.org/10.1609/aaai.v40i44.41073

Abstract

Interpretability methods have recently gained significant attention, particularly in the context of large language models, enabling insights into linguistic representations, error detection, and model behaviors such as hallucinations and repetitions. However, these techniques remain underexplored in automatic speech recognition (ASR), despite their potential to advance both the performance and interpretability of ASR systems. In this work, we adapt and systematically apply established interpretability methods such as logit lens, linear probing, and activation patching, to examine how acoustic and semantic information evolves across layers in ASR systems. Our experiments reveal previously unknown internal dynamics, including specific encoder-decoder interactions responsible for repetition hallucinations and semantic biases encoded deep within acoustic representations. These insights demonstrate the benefits of extending and applying interpretability techniques to speech recognition, opening promising directions for future research on improving model transparency and robustness.

Beyond Transcription: Mechanistic Interpretability in ASR

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information