[2306.14685] DiffSketcher: Text Guided Vector Sketch Synthesis through

[2306.14685] DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

arXiv - AI April 09, 2026 3 min read

About this article

Abstract page for arXiv paper 2306.14685: DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

Computer Science > Computer Vision and Pattern Recognition arXiv:2306.14685 (cs) [Submitted on 26 Jun 2023 (v1), last revised 8 Apr 2026 (this version, v5)] Title:DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models Authors:Ximing Xing, Chuang Wang, Haitao Zhou, Jing Zhang, Qian Yu, Dong Xu View a PDF of the paper titled DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, by Ximing Xing and 5 other authors View PDF HTML (experimental) Abstract:We demonstrate that pre-trained text-to-image diffusion models, despite being trained on raster images, possess a remarkable capacity to guide vector sketch synthesis. In this paper, we introduce DiffSketcher, a novel algorithm for generating vectorized free-hand sketches directly from natural language prompts. Our method optimizes a set of Bézier curves via an extended Score Distillation Sampling (SDS) loss, successfully bridging a raster-level diffusion prior with a parametric vector generator. To further accelerate the generation process, we propose a stroke initialization strategy driven by the diffusion model's intrinsic attention maps. Results show that DiffSketcher produces sketches across varying levels of abstraction while maintaining the structural integrity and essential visual details of the subject. Experiments confirm that our approach yields superior perceptual quality and controllability over existing methods. The code and demo are available at this https URL...

Originally published on April 09, 2026. Curated by AI News.

Machine Learning

Meta AI app climbs to No. 5 on the App Store after Muse Spark launch | TechCrunch

The app was ranking No. 57 on the App Store just before Meta AI's new model launched. Now it's No. 5 — and rising.

TechCrunch - AI · 4 min · about 1 hour ago

Machine Learning

Detecting mirrored selfie images: OCR the best way? [D]

I'm trying to catch backwards "selfie" images before passing them to our VLM text reader and/or face embedding extraction. Since models l...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Google’s Gemini AI can answer your questions with 3D models and simulations

submitted by /u/tekz [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

Cold start latency on GPU cloud platforms in 2026 — p99 specifically, not p50. Anyone have real data? [D]

doing infrastructure evaluation for inference workloads and running into the same problem everywhere: every platform publishes p50 cold s...

Reddit - Machine Learning · 1 min · about 2 hours ago

[2306.14685] DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models

About this article

Related Articles

Meta AI app climbs to No. 5 on the App Store after Muse Spark launch | TechCrunch

Detecting mirrored selfie images: OCR the best way? [D]

Google’s Gemini AI can answer your questions with 3D models and simulations

Cold start latency on GPU cloud platforms in 2026 — p99 specifically, not p50. Anyone have real data? [D]

No comments

Stay updated with AI News