[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

arXiv - Machine Learning 3 min read Article

Summary

The paper introduces HARPOON, a novel method for generating tabular data using generalized manifold guidance, addressing limitations in existing conditional generation techniques.

Why It Matters

HARPOON enhances the generation of tabular data by utilizing manifold theory, which allows for better handling of diverse conditional tasks. This advancement is crucial for applications requiring precise data generation and can significantly improve performance in various machine learning tasks.

Key Takeaways

  • HARPOON extends manifold theory to improve tabular data generation.
  • The method addresses limitations of existing conditional generation techniques.
  • Empirical validation shows strong performance in tasks like imputation.
  • Manifold-aware guidance enhances the ability to meet diverse constraints.
  • HARPOON is applicable across various datasets, showcasing its versatility.

Computer Science > Machine Learning arXiv:2602.07875 (cs) [Submitted on 8 Feb 2026 (v1), last revised 19 Feb 2026 (this version, v2)] Title:Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion Authors:Aditya Shankar, Yuandou Wang, Rihan Hai, Lydia Y. Chen View a PDF of the paper titled Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion, by Aditya Shankar and 3 other authors View PDF HTML (experimental) Abstract:Generating tabular data under conditions is critical to applications requiring precise control over the generative process. Existing methods rely on training-time strategies that do not generalise to unseen constraints during inference, and struggle to handle conditional tasks beyond tabular imputation. While manifold theory offers a principled way to guide generation, current formulations are tied to specific inference-time objectives and are limited to continuous domains. We extend manifold theory to tabular data and expand its scope to handle diverse inference-time objectives. On this foundation, we introduce HARPOON, a tabular diffusion method that guides unconstrained samples along the manifold geometry to satisfy diverse tabular conditions at inference. We validate our theoretical contributions empirically on tasks such as imputation and enforcing inequality constraints, demonstrating HARPOON'S strong performance across diverse datasets and the practical benefits of manifold-aware guidance for tabular data. Code URL: t...

Related Articles

Machine Learning

[P] MCGrad: fix calibration of your ML model in subgroups

Hi r/MachineLearning, We’re open-sourcing MCGrad, a Python package for multicalibration–developed and deployed in production at Meta. Thi...

Reddit - Machine Learning · 1 min ·
Machine Learning

Ml project user give dataset and I give best model [D] [P]

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of th...

Reddit - Machine Learning · 1 min ·
Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime