[P] Unix philosophy for ML pipelines: modular, swappable stages with typed contracts
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
Text understanding and language tasks
We built an open-source prototype that applies Unix philosophy to retrieval pipelines. Each stage (PII redaction, chunking, dedup, embedd...
I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Abstract page for arXiv paper 2603.04799: Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm
Abstract page for arXiv paper 2603.04772: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
Abstract page for arXiv paper 2603.04743: DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
Abstract page for arXiv paper 2603.04718: AI-Assisted Moot Courts: Simulating Justice-Specific Questioning in Oral Arguments
Abstract page for arXiv paper 2603.04663: Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency H...
Abstract page for arXiv paper 2603.04659: GIANT - Global Path Integration and Attentive Graph Networks for Multi-Agent Trajectory Planning
Abstract page for arXiv paper 2603.04597: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
Abstract page for arXiv paper 2603.04532: Still Fresh? Evaluating Temporal Drift in Retrieval Benchmarks
Abstract page for arXiv paper 2603.04450: MPBMC: Multi-Property Bounded Model Checking with GNN-guided Clustering
Abstract page for arXiv paper 2603.04443: AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems
Abstract page for arXiv paper 2603.04429: What Is Missing: Interpretable Ratings for Large Language Model Outputs
Abstract page for arXiv paper 2603.04422: FedEMA-Distill: Exponential Moving Average Guided Knowledge Distillation for Robust Federated L...
Abstract page for arXiv paper 2603.04421: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?
Abstract page for arXiv paper 2603.04410: SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models
Abstract page for arXiv paper 2603.04406: CTRL-RAG: Contrastive Likelihood Reward Based Reinforcement Learning for Context-Faithful RAG M...
Abstract page for arXiv paper 2603.04403: FinRetrieval: A Benchmark for Financial Data Retrieval by AI Agents
Abstract page for arXiv paper 2603.05295: WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
Abstract page for arXiv paper 2603.05225: AI+HW 2035: Shaping the Next Decade
Abstract page for arXiv paper 2603.05129: MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty C...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime