[2603.24132] MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare
About this article
Abstract page for arXiv paper 2603.24132: MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare
Computer Science > Computation and Language arXiv:2603.24132 (cs) [Submitted on 25 Mar 2026] Title:MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare Authors:Shubham Kumar Nigam, Suparnojit Sarkar, Piyush Patel View a PDF of the paper titled MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare, by Shubham Kumar Nigam and 2 other authors View PDF HTML (experimental) Abstract:Conversational artificial intelligence has the potential to assist users in preliminary medical consultations, particularly in settings where access to healthcare professionals is limited. However, many existing medical dialogue systems operate in a single-turn question--answering paradigm or rely on template-based datasets, limiting conversational realism and multilingual applicability. In this work, we introduce MedAidDialog, a multilingual multi-turn medical dialogue dataset designed to simulate realistic physician--patient consultations. The dataset extends the MDDial corpus by generating synthetic consultations using large language models and further expands them into a parallel multilingual corpus covering seven languages: English, Hindi, Telugu, Tamil, Bengali, Marathi, and Arabic. Building on this dataset, we develop MedAidLM, a conversational medical model trained using parameter-efficient fine-tuning on quantized small language models, enabling deployment without high-end computational infrastructure. Our framework a...