[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
Text understanding and language tasks
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2602.09937: Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?
Abstract page for arXiv paper 2506.05634: AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization
Abstract page for arXiv paper 2510.09782: The Geometry of Reasoning: Flowing Logics in Representation Space
Abstract page for arXiv paper 2505.15643: Optimal Best-Arm Identification under Fixed Confidence with Multiple Optima
Abstract page for arXiv paper 2505.13033: TSPulse: Tiny Pre-Trained Models with Disentangled Representations for Rapid Time-Series Analysis
Abstract page for arXiv paper 2503.07638: Leveraging Taxonomy Similarity for Next Activity Prediction in Patient Treatment
Abstract page for arXiv paper 2506.08321: LeanTutor: Towards a Verified AI Mathematical Proof Tutor
Abstract page for arXiv paper 2505.21668: R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning
Abstract page for arXiv paper 2504.20505: MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities o...
Abstract page for arXiv paper 2310.04925: Crystal-GFN: sampling crystals with desirable properties and constraints
Abstract page for arXiv paper 2603.04353: A Constrained RL Approach for Cost-Efficient Delivery of Latency-Sensitive Applications
Abstract page for arXiv paper 2603.04348: RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Repo...
Abstract page for arXiv paper 2603.04317: World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurr...
Abstract page for arXiv paper 2603.04321: SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Lear...
Abstract page for arXiv paper 2603.04204: Beyond Mixtures and Products for Ensemble Aggregation: A Likelihood Perspective on Generalized ...
Abstract page for arXiv paper 2603.04293: LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance
Abstract page for arXiv paper 2603.04005: Training-Free Rate-Distortion-Perception Traversal With Diffusion
Abstract page for arXiv paper 2603.04158: GarmentPile++: Affordance-Driven Cluttered Garments Retrieval with Vision-Language Reasoning
Abstract page for arXiv paper 2603.03843: Invariance-Based Dynamic Regret Minimization
Abstract page for arXiv paper 2603.04037: DQE-CIR: Distinctive Query Embeddings through Learnable Attribute Weights and Target Relative N...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime