[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
Text understanding and language tasks
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2603.05024: Measuring the Fragility of Trust: Devising Credibility Index via Explanation Stability (CIES) f...
Abstract page for arXiv paper 2603.04981: Rethinking Representativeness and Diversity in Dynamic Data Selection
Abstract page for arXiv paper 2603.04951: Retrieval-Augmented Generation with Covariate Time Series
Abstract page for arXiv paper 2603.04868: K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory ...
Abstract page for arXiv paper 2603.04756: MOOSEnger -- a Domain-Specific AI Agent for the MOOSE Ecosystem
Abstract page for arXiv paper 2603.04741: CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics
Abstract page for arXiv paper 2603.04448: SkillNet: Create, Evaluate, and Connect AI Skills
The ML Prague conference will feature a Taiwan pavilion focused on deepening cooperation.
I'm an AI Engineer currently daily-driving a 16" M1 Pro MBP. It’s been a workhorse, but I’m feeling the bottleneck when running larger lo...
There’s a major risk that OpenClaw will exploit your data and funds. So I built a security focused version in Rust. AMA. I was incredibly...
Abstract page for arXiv paper 2602.10149: Exploring Semantic Labeling Strategies for Third-Party Cybersecurity Risk Assessment Questionna...
Abstract page for arXiv paper 2601.19933: NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference
Abstract page for arXiv paper 2601.04646: Succeeding at Scale: Automated Dataset Construction and Query-Side Adaptation for Multi-Tenant ...
Abstract page for arXiv paper 2601.00361: Deterministic Coreset for Lp Subspace
Abstract page for arXiv paper 2510.02578: FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D l...
Abstract page for arXiv paper 2509.25095: Benchmarking ECG FMs: A Reality Check Across Clinical Tasks
Abstract page for arXiv paper 2508.09844: On the Generalization Limits of Quantum Generative Adversarial Networks with Pure State Generators
Abstract page for arXiv paper 2510.24702: Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
Abstract page for arXiv paper 2510.24178: MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime