[2603.22777] AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model
About this article
Abstract page for arXiv paper 2603.22777: AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model
Computer Science > Artificial Intelligence arXiv:2603.22777 (cs) [Submitted on 24 Mar 2026] Title:AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model Authors:Yagizhan Bilal Durak, Ahsan Ul Islam, Shahidul Islam, Ashley Morgan-Olvera, Iftekhar Ibne Basith, Syed Hasib Akhter Faruqui View a PDF of the paper titled AgriPestDatabase-v1.0: A Structured Insect Dataset for Training Agricultural Large Language Model, by Yagizhan Bilal Durak and 5 other authors View PDF HTML (experimental) Abstract:Agricultural pest management increasingly relies on timely and accurate access to expert knowledge, yet high quality labeled data and continuous expert support remain limited, particularly for farmers operating in rural regions with unstable/no internet connectivity. At the same time, the rapid growth of AI and LLMs has created new opportunities to deliver practical decision support tools directly to end users in agriculture through compact and deployable systems. This work addresses (i) generating a structured insect information dataset, and (ii) adapting a lightweight LLM model ($\leq$ 7B) by fine tuning it for edge device uses in agricultural pest management. The textual data collection was done by reviewing and collecting information from available pest databases and published manuscripts on nine selected pest species. These structured reports were then reviewed and validated by a domain expert. From these reports, we constructed Q/A pairs...