Machine Learning Data Science Ai Safety

[2601.22983] PIDSMaker: Building and Evaluating Provenance-based Intrusion Detection Systems

arXiv - Machine Learning February 16, 2026 3 min read Article

Summary

PIDSMaker is an open-source framework designed for building and evaluating provenance-based intrusion detection systems (PIDSs), addressing inconsistencies in prior evaluations.

Why It Matters

This framework enhances the reproducibility and comparability of PIDSs, which are crucial for detecting advanced persistent threats. By standardizing evaluation protocols, PIDSMaker supports researchers in developing more effective security systems, ultimately improving cybersecurity measures.

Key Takeaways

PIDSMaker consolidates eight PIDSs into a modular architecture for better evaluation.
Standardized preprocessing and ground-truth labels enhance reproducibility.
The framework supports rapid prototyping with a YAML-based configuration interface.
Includes utilities for hyperparameter tuning and visualization to improve research methodologies.
Preprocessed datasets and labels are provided to facilitate shared evaluations.

Computer Science > Cryptography and Security arXiv:2601.22983 (cs) [Submitted on 30 Jan 2026 (v1), last revised 13 Feb 2026 (this version, v2)] Title:PIDSMaker: Building and Evaluating Provenance-based Intrusion Detection Systems Authors:Tristan Bilot, Baoxiang Jiang, Thomas Pasquier View a PDF of the paper titled PIDSMaker: Building and Evaluating Provenance-based Intrusion Detection Systems, by Tristan Bilot and 2 other authors View PDF Abstract:Recent provenance-based intrusion detection systems (PIDSs) have demonstrated strong potential for detecting advanced persistent threats (APTs) by applying machine learning to system provenance graphs. However, evaluating and comparing PIDSs remains difficult: prior work uses inconsistent preprocessing pipelines, non-standard dataset splits, and incompatible ground-truth labeling and metrics. These discrepancies undermine reproducibility, impede fair comparison, and impose substantial re-implementation overhead on researchers. We present PIDSMaker, an open-source framework for developing and evaluating PIDSs under consistent protocols. PIDSMaker consolidates eight state-of-the-art systems into a modular, extensible architecture with standardized preprocessing and ground-truth labels, enabling consistent experiments and apples-to-apples comparisons. A YAML-based configuration interface supports rapid prototyping by composing components across systems without code changes. PIDSMaker also includes utilities for ablation studies, hyp...

Read Original Article

[2601.22983] PIDSMaker: Building and Evaluating Provenance-based Intrusion Detection Systems

Summary

Why It Matters

Key Takeaways

Related Articles

Finally Abliterated Sarvam 30B and 105B!

BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.

Free tool I built to score dataset quality (LQS) — feedback welcome [D]

Meta’s New AI Model Gives Mark Zuckerberg a Seat at the Big Kid’s Table | WIRED

No comments

Stay updated with AI News