[2602.14643] Arbor: A Framework for Reliable Navigation of Critical Conversation Flows

[2602.14643] Arbor: A Framework for Reliable Navigation of Critical Conversation Flows

arXiv - AI 4 min read Article

Summary

The paper presents Arbor, a framework designed to enhance the navigation of critical conversation flows in high-stakes environments like healthcare, improving accuracy and efficiency.

Why It Matters

Arbor addresses the limitations of traditional monolithic approaches in AI, particularly in healthcare, where accurate decision-making is crucial. By breaking down decision processes into manageable tasks, it allows for better performance even with smaller models, making advanced AI more accessible and reliable in critical applications.

Key Takeaways

  • Arbor improves mean turn accuracy by 29.4 percentage points.
  • It reduces per-turn latency by 57.1%, enhancing efficiency.
  • The framework allows smaller models to perform comparably to larger ones.
  • Decision trees are standardized for dynamic retrieval, improving navigation.
  • Arbor's architecture reduces dependence on model size and capability.

Computer Science > Artificial Intelligence arXiv:2602.14643 (cs) [Submitted on 16 Feb 2026] Title:Arbor: A Framework for Reliable Navigation of Critical Conversation Flows Authors:Luís Silva, Diogo Gonçalves, Catarina Farinha, Clara Matos, Luís Ungaro View a PDF of the paper titled Arbor: A Framework for Reliable Navigation of Critical Conversation Flows, by Lu\'is Silva and 3 other authors View PDF HTML (experimental) Abstract:Large language models struggle to maintain strict adherence to structured workflows in high-stakes domains such as healthcare triage. Monolithic approaches that encode entire decision structures within a single prompt are prone to instruction-following degradation as prompt length increases, including lost-in-the-middle effects and context window overflow. To address this gap, we present Arbor, a framework that decomposes decision tree navigation into specialized, node-level tasks. Decision trees are standardized into an edge-list representation and stored for dynamic retrieval. At runtime, a directed acyclic graph (DAG)-based orchestration mechanism iteratively retrieves only the outgoing edges of the current node, evaluates valid transitions via a dedicated LLM call, and delegates response generation to a separate inference step. The framework is agnostic to the underlying decision logic and model provider. Evaluated against single-prompt baselines across 10 foundation models using annotated turns from real clinical triage conversations. Arbor imp...

Related Articles

Llms

[D] Tested model routing on financial AI datasets — good savings and curious what benchmarks others use.

Ran a benchmark evaluating whether prompt complexity-based routing delivers meaningful savings. Used public HuggingFace datasets. Here's ...

Reddit - Machine Learning · 1 min ·
Llms

[D] AI research on small language models

i'm doing research on some trending fields in AI, currently working on small language models and would love to meet people who are workin...

Reddit - Machine Learning · 1 min ·
Llms

One of The Worst AI's I've Ever Seen

I'm using Gemini just for they gave us a student-free-pro pack. It can't see the images I sent, most of the time it just rewrites the mes...

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone 👋 I've set up a self-hosted API gateway using New-API to manage and distribute Claude Opus 4.6 access across multiple users....

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime