Agents Can Now Propose and Deploy Their Own Code Changes
150 clones yesterday. 43 stars in 3 days. Every agent framework you've used (LangChain, LangGraph, Claude Code) assumes agents are tools ...
Text understanding and language tasks
150 clones yesterday. 43 stars in 3 days. Every agent framework you've used (LangChain, LangGraph, Claude Code) assumes agents are tools ...
Abstract page for arXiv paper 2603.17839: How do LLMs Compute Verbal Confidence
Abstract page for arXiv paper 2602.03584: $V_0$: A Generalist Value Model for Any Policy at State Zero
Abstract page for arXiv paper 2603.01494: Inference-Time Safety For Code LLMs Via Retrieval-Augmented Revision
Abstract page for arXiv paper 2603.01493: PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval
Abstract page for arXiv paper 2603.01465: Non-Markovian Long-Horizon Robot Manipulation via Keyframe Chaining
Abstract page for arXiv paper 2603.01305: AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models
Abstract page for arXiv paper 2603.01281: Spectral Attention Steering for Prompt Highlighting
Abstract page for arXiv paper 2603.01632: DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning
Abstract page for arXiv paper 2603.01589: SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
Abstract page for arXiv paper 2603.01241: TARSE: Test-Time Adaptation via Retrieval of Skills and Experience for Reasoning Agents
Abstract page for arXiv paper 2603.01224: Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction
Abstract page for arXiv paper 2603.01195: VisNec: Measuring and Leveraging Visual Necessity for Multimodal Instruction Tuning
Abstract page for arXiv paper 2603.01309: PAC Guarantees for Reinforcement Learning: Sample Complexity, Coverage, and Structure
Abstract page for arXiv paper 2603.01096: Unified Vision-Language Modeling via Concept Space Alignment
Abstract page for arXiv paper 2603.01297: I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift
Abstract page for arXiv paper 2603.01048: RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair
Abstract page for arXiv paper 2603.01193: Operator Learning Using Weak Supervision from Walk-on-Spheres
Abstract page for arXiv paper 2603.00978: EraseAnything++: Enabling Concept Erasure in Rectified Flow Transformers Leveraging Multi-Objec...
Abstract page for arXiv paper 2603.01097: Understanding LoRA as Knowledge Memory: An Empirical Analysis
Abstract page for arXiv paper 2603.01047: Evaluating GFlowNet from partial episodes for stable and flexible policy-based training
Abstract page for arXiv paper 2603.00924: Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains
Abstract page for arXiv paper 2603.00997: DWAFM: Dynamic Weighted Graph Structure Embedding Integrated with Attention and Frequency-Domai...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime