[P] Using YouTube as a data source (lessons from building a coffee domain dataset)
I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...
Text understanding and language tasks
I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...
Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Abstract page for arXiv paper 2601.22440: AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Value...
Abstract page for arXiv paper 2602.07075: LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
Abstract page for arXiv paper 2601.23236: YuriiFormer: A Suite of Nesterov-Accelerated Transformers
Abstract page for arXiv paper 2512.04551: Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level A...
Abstract page for arXiv paper 2509.08177: Quadrotor Navigation using Reinforcement Learning with Privileged Information
Abstract page for arXiv paper 2511.21033: Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning
Abstract page for arXiv paper 2510.08966: Beyond Prefixes: Graph-as-Memory Cross-Attention for Knowledge Graph Completion with Large Lang...
Abstract page for arXiv paper 2506.07915: A Signal Contract for Online Language Grounding and Discovery in Decision-Making
Abstract page for arXiv paper 2505.04997: Foam-Agent: Towards Automated Intelligent CFD Workflows
Abstract page for arXiv paper 2404.16721: Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods
Abstract page for arXiv paper 2603.05504: RoboPocket: Improve Robot Policies Instantly with Your Phone
Abstract page for arXiv paper 2603.05471: Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval
Abstract page for arXiv paper 2603.05438: Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model
Abstract page for arXiv paper 2603.05210: Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding
Abstract page for arXiv paper 2603.05167: C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reas...
Abstract page for arXiv paper 2603.05114: UniPAR: A Unified Framework for Pedestrian Attribute Recognition
Abstract page for arXiv paper 2603.05057: MUTEX: Leveraging Multilingual Transformers and Conditional Random Fields for Enhanced Urdu Tox...
Abstract page for arXiv paper 2603.04910: VPWEM: Non-Markovian Visuomotor Policy with Working and Episodic Memory
Abstract page for arXiv paper 2603.04890: FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation
Abstract page for arXiv paper 2603.04811: Meta-D: Metadata-Aware Architectures for Brain Tumor Analysis and Missing-Modality Segmentation
Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime