[2601.08950] ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

arXiv - AI April 07, 2026 3 min read

About this article

Abstract page for arXiv paper 2601.08950: ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

Computer Science > Artificial Intelligence arXiv:2601.08950 (cs) [Submitted on 13 Jan 2026 (v1), last revised 6 Apr 2026 (this version, v2)] Title:ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors Authors:Mayank Sharma, Roy Pea, Hari Subramonyam View a PDF of the paper titled ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors, by Mayank Sharma and 2 other authors View PDF HTML (experimental) Abstract:Despite their growing adoption in education, LLMs remain misaligned with the core principle of effective tutoring: the dialogic construction of knowledge. We introduce ConvoLearn, a dataset of 2,134 semi-synthetic tutor-student dialogues operationalizing six dimensions of dialogic tutoring grounded in knowledge-building theory, situated in middle school Earth Science curriculum. We show that dimension-labeled dialogic training data captures meaningful pedagogical signal that generalizes beyond its semi-synthetic domain: scores from a classifier trained on ConvoLearn correlate significantly with expert-coded instructional quality in authentic classrooms across multiple subscales (range r = .118-.258, all p < .05). As a proof of concept, we fine-tune Mistral-7B on ConvoLearn and show that dimension-level fine-tuning can steer a 7B open-weight model toward dialogic tutoring behavior that credentialed teachers rate as competitive with a strong proprietary baseline. With this work, we support the development of AI tutors capable of more dialogic interactions. Subjects:...

Originally published on April 07, 2026. Curated by AI News.

Llms

How do you test AI agents in production? The unpredictability is overwhelming.[D]

I’ve been in QA for almost a decade. My mental model for quality was always: given input X, assert output Y. Now I’m on a team that’s shi...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Confusing Website

i'm trying to find a video online and couldn't so i asked ChatGPT by describing the video and i was given a link and i'm trying to make s...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

I tested the same prompt across multiple AI models… the differences surprised me

I’ve been experimenting with different AI models lately (ChatGPT, Claude, etc.), and I tried something simple: Using the exact same promp...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

Anthropic’s AI experiment showed Claude independently handled 186 deals worth over $4,000, but results varied by model capability, with u...

AI Tools & Products · 5 min · about 7 hours ago

[2601.08950] ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

About this article

Related Articles

How do you test AI agents in production? The unpredictability is overwhelming.[D]

Confusing Website

I tested the same prompt across multiple AI models… the differences surprised me

Anthropic gave Claude $100 to go shopping, here’s what the AI ended up buying

No comments

Stay updated with AI News