Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English

Authors

  • Avinash Anand Indraprastha Institute of Information Technology, Delhi
  • Kritarth Prasad Indraprastha Institute of Information Technology, Delhi
  • Chhavi Kirtani Indraprastha Institute of Information Technology, Delhi
  • Ashwin R Nair Indraprastha Institute of Information Technology, Delhi
  • Manvendra Kumar Nema Indraprastha Institute of Information Technology, Delhi
  • Raj Jaiswal Indraprastha Institute of Information Technology, Delhi
  • Rajiv Ratn Shah Indraprastha Institute of Information Technology, Delhi

DOI:

https://doi.org/10.1609/aaai.v39i22.34509

Abstract

Large Language Models (LLMs) excel in linguistic tasks but struggle with mathematical reasoning, particularly in non- English languages like Hindi. This research aims to en- hance the mathematical reasoning skills of smaller, resource- efficient open-source LLMs in both Hindi and English. We evaluate models like OpenHathi 7B, LLaMA-2 7B, Wizard- Math 7B, Mistral 7B, LLeMMa 7B, MAmmoTH 7B, Gemini Pro, and GPT-4 using zero-shot, few-shot chain-of-thought (CoT) methods, and supervised fine-tuning. Our approach in- corporates curriculum learning, progressively training mod- els on increasingly difficult problems, a novel Decompo- sition Strategy to simplify complex arithmetic operations, and a Structured Solution Design that divides solutions into phases. Our experiments result in notable performance en- hancements. WizardMath 7B exceeds Gemini’s accuracy on English datasets by +6% and matches Gemini’s performance on Hindi datasets. Adopting a bilingual approach that com- bines English and Hindi samples achieves results comparable to individual language models, demonstrating the capability to learn mathematical reasoning in both languages. This re- search highlights the potential for improving mathematical reasoning in open-source LLMs.

Published

2025-04-11

How to Cite

Anand, A., Prasad, K., Kirtani, C., Nair, A. R., Nema, M. K., Jaiswal, R., & Shah, R. R. (2025). Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23415–23423. https://doi.org/10.1609/aaai.v39i22.34509

Issue

Section

AAAI Technical Track on Natural Language Processing I