[2602.22699] DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule

[2602.22699] DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule

arXiv - Machine Learning 4 min read Article

Summary

DPSQL+ is a new SQL library designed to enhance data privacy by enforcing differential privacy and a minimum frequency rule, ensuring sensitive information is protected during data analysis.

Why It Matters

As data privacy concerns grow, DPSQL+ addresses the need for secure data analysis tools that comply with governance requirements. By integrating differential privacy with the minimum frequency rule, it offers a robust solution for organizations handling sensitive data.

Key Takeaways

  • DPSQL+ combines differential privacy with a minimum frequency rule for enhanced data protection.
  • The library's modular architecture includes a Validator, Accountant, and Backend for efficient query management.
  • Experiments show DPSQL+ maintains high accuracy across various analytical workloads.

Computer Science > Cryptography and Security arXiv:2602.22699 (cs) [Submitted on 26 Feb 2026] Title:DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule Authors:Tomoya Matsumoto, Shokichi Takakura, Shun Takagi, Satoshi Hasegawa View a PDF of the paper titled DPSQL+: A Differentially Private SQL Library with a Minimum Frequency Rule, by Tomoya Matsumoto and 3 other authors View PDF HTML (experimental) Abstract:SQL is the de facto interface for exploratory data analysis; however, releasing exact query results can expose sensitive information through membership or attribute inference attacks. Differential privacy (DP) provides rigorous privacy guarantees, but in practice, DP alone may not satisfy governance requirements such as the \emph{minimum frequency rule}, which requires each released group (cell) to include contributions from at least $k$ distinct individuals. In this paper, we present \textbf{DPSQL+}, a privacy-preserving SQL library that simultaneously enforces user-level $(\varepsilon,\delta)$-DP and the minimum frequency rule. DPSQL+ adopts a modular architecture consisting of: (i) a \emph{Validator} that statically restricts queries to a DP-safe subset of SQL; (ii) an \emph{Accountant} that consistently tracks cumulative privacy loss across multiple queries; and (iii) a \emph{Backend} that interfaces with various database engines, ensuring portability and extensibility. Experiments on the TPC-H benchmark demonstrate that DPSQL+ achieves pract...

Related Articles

Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
Machine Learning

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

I’ve been reading more about attention mechanisms in transformers and how they effectively learn to weight and prioritize relevant inputs...

Reddit - Machine Learning · 1 min ·
Llms

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

Hi Everybody! I just wanted to share an update on a project I’ve been working on called BULaMU, a family of language models trained (20M,...

Reddit - Machine Learning · 1 min ·
Machine Learning

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime