Bluesky’s new app is an AI for customizing your feed | The Verge
Eventually Attie will be able to vibe code entire apps for the AT Protocol.
GPT, Claude, Gemini, and other LLMs
Eventually Attie will be able to vibe code entire apps for the AT Protocol.
Link: https://m.youtube.com/watch?v=1sd26pWhfmg The Linux exploit is especially interesting because it was introduced in 2003 and was nev...
Inspired by Andrej Karpathy's AutoResearch, I built a system where Claude Code acts as an autonomous ML researcher on tabular binary clas...
In building apps with Anthropic AI assistant Claude, users regularly receive a message that their daily limit has been reached and must w...
Sephora has launched an AI-powered shopping app within ChatGPT, offering a new personalised beauty discovery experience.
Google seems to understand that the current AI landscape has users jumping from one model to the next, looking for...
Abstract page for arXiv paper 2603.18788: Mi:dm K 2.5 Pro
Abstract page for arXiv paper 2603.17729: SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition
Abstract page for arXiv paper 2602.01047: Residual Decoding: Mitigating Hallucinations in Large Vision-Language Models via History-Aware ...
Abstract page for arXiv paper 2602.07023: Behavioral Consistency Validation for LLM Agents: An Analysis of Trading-Style Switching throug...
Abstract page for arXiv paper 2601.13719: Hierarchical Long Video Understanding with Audiovisual Entity Cohesion and Agentic Search
Abstract page for arXiv paper 2603.13606: NCCL EP: Towards a Unified Expert Parallel Communication API for NCCL
Abstract page for arXiv paper 2601.07315: VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit Sizing
Abstract page for arXiv paper 2512.22387: AI-Generated Code Is Not Reproducible (Yet): An Empirical Study of Dependency Gaps in LLM-Based...
Abstract page for arXiv paper 2512.02487: Masking Matters: Unlocking the Spatial Reasoning Capabilities of LLMs for 3D Scene-Language Und...
Abstract page for arXiv paper 2510.26865: Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
Abstract page for arXiv paper 2511.05919: Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
Abstract page for arXiv paper 2510.23421: Quantifying Systemic Vulnerability in the Foundation Model Industry
Abstract page for arXiv paper 2511.12449: MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Un...
Abstract page for arXiv paper 2510.15994: MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
Abstract page for arXiv paper 2510.14967: Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Sear...
Abstract page for arXiv paper 2510.01483: VL-KnG: Persistent Spatiotemporal Knowledge Graphs from Egocentric Video for Embodied Scene Und...
Abstract page for arXiv paper 2509.20502: MARS: toward more efficient multi-agent collaboration for LLM reasoning
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime