(1)
Pham, T. Truth Behind the Scene: Designing Evaluations Benchmarks to Assess LLMs’ Task-Specific Understanding over Test-Taking Strategies. AAAI 2025, 39, 29596-29598.