[2603.29450] Few-shot Writer Adaptation via Multimodal In-Context

[2603.29450] Few-shot Writer Adaptation via Multimodal In-Context Learning

arXiv - AI April 01, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.29450: Few-shot Writer Adaptation via Multimodal In-Context Learning

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.29450 (cs) [Submitted on 31 Mar 2026] Title:Few-shot Writer Adaptation via Multimodal In-Context Learning Authors:Tom Simon, Stephane Nicolas, Pierrick Tranouez, Clement Chatelain, Thierry Paquet View a PDF of the paper titled Few-shot Writer Adaptation via Multimodal In-Context Learning, by Tom Simon and 4 other authors View PDF HTML (experimental) Abstract:While state-of-the-art Handwritten Text Recognition (HTR) models perform well on standard benchmarks, they frequently struggle with writers exhibiting highly specific styles that are underrepresented in the training data. To handle unseen and atypical writers, writer adaptation techniques personalize HTR models to individual handwriting styles. Leading writer adaptation methods require either offline fine-tuning or parameter updates at inference time, both involving gradient computation and backpropagation, which increase computational costs and demand careful hyperparameter tuning. In this work, we propose a novel context-driven HTR framework3 inspired by multimodal in-context learning, enabling inference-time writer adaptation using only a few examples from the target writer without any parameter updates. We further demonstrate the impact of context length, design a compact 8M-parameter CNN-Transformer that enables few-shot in-context adaptation, and show that combining context-driven and standard OCR training strategies leads to complementary impr...

Originally published on April 01, 2026. Curated by AI News.

Open Source Ai

From OpenAI to Nvidia, firms channel billions into AI infrastructure as demand booms

This article is discussing another large investment being made by tech firms into AI projects. I’ve noticed that whilst this is happening...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Slides Help Teaching ML First Time [P]

I’m an electrical engineering teacher. One of our faculty members has fallen ill, so I’ve been asked to take over teaching machine learni...

Reddit - Machine Learning · 1 min · about 5 hours ago

Machine Learning

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

https://preview.redd.it/f4d5krhkjyvg1.png?width=1020&format=png&auto=webp&s=11310f377b22abbe3dd110cc7d362ba8aae35f8d I have b...

Reddit - Machine Learning · 1 min · about 7 hours ago

Machine Learning

ICML 2026 - Heavy score variance among various batches? [D]

I've seen some people say in their batch very few papers have above 3.5 score, but then other reviewers say that most papers in their sco...

Reddit - Machine Learning · 1 min · about 10 hours ago

[2603.29450] Few-shot Writer Adaptation via Multimodal In-Context Learning

About this article

Related Articles

From OpenAI to Nvidia, firms channel billions into AI infrastructure as demand booms

Slides Help Teaching ML First Time [P]

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

ICML 2026 - Heavy score variance among various batches? [D]

No comments

Stay updated with AI News