Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance

Yue Fang; Yuxin Guo; Jiaran Gao; Hongxin Ding; Xinke Jiang; Weibin Liao; Yongxin Xu; Yinghao Zhu; Zhibang Yang; Liantao Ma; Junfeng Zhao; Yasha Wang

doi:10.1609/aaai.v40i36.40325

Authors

Yue Fang Peking University
Yuxin Guo Peking University
Jiaran Gao Peking University
Hongxin Ding Peking University
Xinke Jiang Peking University
Weibin Liao Peking University
Yongxin Xu Peking University
Yinghao Zhu Peking University
Zhibang Yang Peking University
Liantao Ma Peking University
Junfeng Zhao Peking University
Yasha Wang Peking University

DOI:

https://doi.org/10.1609/aaai.v40i36.40325

Abstract

Improving large language models (LLMs) for electronic health record (EHR) reasoning is essential for enabling accurate and generalizable clinical predictions. While LLMs excel at medical text understanding, they underperform on EHR-based prediction tasks due to challenges in modeling temporally structured, high-dimensional data. Existing approaches often rely on hybrid paradigms, where LLMs serve merely as frozen prior retrievers while downstream deep learning (DL) models handle prediction, failing to improve the LLM’s intrinsic reasoning capacity and inheriting the generalization limitations of DL models. To this end, we propose EAG-RL, a novel two-stage training framework designed to intrinsically enhance LLMs’ EHR reasoning ability through expert attention guidance, where expert EHR models refer to task-specific DL models trained on EHR data. Concretely, EAG-RL first constructs high-quality, stepwise reasoning trajectories using expert-guided Monte Carlo Tree Search to effectively initialize the LLM’s policy. Then, EAG-RL further optimizes the policy via reinforcement learning by aligning the LLM’s attention with clinically salient features identified by expert EHR models. Extensive experiments on two real-world EHR datasets show that EAG-RL improves the intrinsic EHR reasoning ability of LLMs by an average of 14.62%, while also enhancing robustness to feature perturbations and generalization to unseen clinical domains. These results demonstrate the practical potential of EAG-RL for real-world deployment in clinical prediction tasks.

Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information