Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

OpenAI now lets teams make custom bots that can do work on their own | The Verge
Llms

OpenAI now lets teams make custom bots that can do work on their own | The Verge

OpenAI is bringing “workspace” AI agents to users of its Business, Enterprise, Edu, and Teachers plans that can perform business tasks in...

The Verge - AI · 4 min ·
Llms

My Unsupervised Compliance Layer Project

A bit of context, my work has been mostly around building agentic pipelines. I really love the craft. My latest side project was a delibe...

Reddit - Artificial Intelligence · 1 min ·
Llms

I’m 17 and built an AI that flirts, remembers you, watches your shows, and replies to your reels…

V3 is done and it’s getting… weird. This thing now: auto-replies to DMs with tone adjustment reads images, transcribes voice notes, repli...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.04277] VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments
Llms

[2603.04277] VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments

Abstract page for arXiv paper 2603.04277: VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments

arXiv - AI · 4 min ·
[2603.04259] When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies
Llms

[2603.04259] When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies

Abstract page for arXiv paper 2603.04259: When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies

arXiv - AI · 4 min ·
[2603.04222] PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Adaptive Autonomous Driving
Llms

[2603.04222] PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Adaptive Autonomous Driving

Abstract page for arXiv paper 2603.04222: PRAM-R: A Perception-Reasoning-Action-Memory Framework with LLM-Guided Modality Routing for Ada...

arXiv - AI · 3 min ·
[2603.04165] PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters
Llms

[2603.04165] PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters

Abstract page for arXiv paper 2603.04165: PlaneCycle: Training-Free 2D-to-3D Lifting of Foundation Models Without Adapters

arXiv - AI · 3 min ·
[2603.04177] CodeTaste: Can LLMs Generate Human-Level Code Refactorings?
Llms

[2603.04177] CodeTaste: Can LLMs Generate Human-Level Code Refactorings?

Abstract page for arXiv paper 2603.04177: CodeTaste: Can LLMs Generate Human-Level Code Refactorings?

arXiv - AI · 3 min ·
[2603.04128] Crab$^{+}$: A Scalable and Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
Llms

[2603.04128] Crab$^{+}$: A Scalable and Unified Audio-Visual Scene Understanding Model with Explicit Cooperation

Abstract page for arXiv paper 2603.04128: Crab$^{+}$: A Scalable and Unified Audio-Visual Scene Understanding Model with Explicit Coopera...

arXiv - AI · 4 min ·
[2603.04162] Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model
Llms

[2603.04162] Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Language Model

Abstract page for arXiv paper 2603.04162: Bielik-Q2-Sharp: A Comparative Study of Extreme 2-bit Quantization Methods for a Polish 11B Lan...

arXiv - AI · 3 min ·
[2603.04069] Monitoring Emergent Reward Hacking During Generation via Internal Activations
Llms

[2603.04069] Monitoring Emergent Reward Hacking During Generation via Internal Activations

Abstract page for arXiv paper 2603.04069: Monitoring Emergent Reward Hacking During Generation via Internal Activations

arXiv - AI · 4 min ·
[2603.03683] CONCUR: Benchmarking LLMs for Concurrent Code Generation
Llms

[2603.03683] CONCUR: Benchmarking LLMs for Concurrent Code Generation

Abstract page for arXiv paper 2603.03683: CONCUR: Benchmarking LLMs for Concurrent Code Generation

arXiv - Machine Learning · 4 min ·
[2603.04002] Discriminative Perception via Anchored Description for Reasoning Segmentation
Llms

[2603.04002] Discriminative Perception via Anchored Description for Reasoning Segmentation

Abstract page for arXiv paper 2603.04002: Discriminative Perception via Anchored Description for Reasoning Segmentation

arXiv - AI · 4 min ·
[2603.03589] stratum: A System Infrastructure for Massive Agent-Centric ML Workloads
Llms

[2603.03589] stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

Abstract page for arXiv paper 2603.03589: stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

arXiv - Machine Learning · 4 min ·
[2603.03983] GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery
Llms

[2603.03983] GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Abstract page for arXiv paper 2603.03983: GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

arXiv - AI · 3 min ·
[2603.03583] ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer
Llms

[2603.03583] ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

Abstract page for arXiv paper 2603.03583: ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer

arXiv - Machine Learning · 3 min ·
[2603.03964] BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft
Llms

[2603.03964] BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft

Abstract page for arXiv paper 2603.03964: BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft

arXiv - AI · 3 min ·
[2603.03915] Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personality Effects
Llms

[2603.03915] Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personality Effects

Abstract page for arXiv paper 2603.03915: Rethinking Role-Playing Evaluation: Anonymous Benchmarking and a Systematic Study of Personalit...

arXiv - AI · 3 min ·
[2603.03897] IROSA: Interactive Robot Skill Adaptation using Natural Language
Llms

[2603.03897] IROSA: Interactive Robot Skill Adaptation using Natural Language

Abstract page for arXiv paper 2603.03897: IROSA: Interactive Robot Skill Adaptation using Natural Language

arXiv - AI · 3 min ·
[2603.03881] On the Suitability of LLM-Driven Agents for Dark Pattern Audits
Llms

[2603.03881] On the Suitability of LLM-Driven Agents for Dark Pattern Audits

Abstract page for arXiv paper 2603.03881: On the Suitability of LLM-Driven Agents for Dark Pattern Audits

arXiv - AI · 4 min ·
[2603.03336] Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification
Llms

[2603.03336] Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification

Abstract page for arXiv paper 2603.03336: Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification

arXiv - Machine Learning · 4 min ·
[2603.03310] Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention
Llms

[2603.03310] Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention

Abstract page for arXiv paper 2603.03310: Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention

arXiv - Machine Learning · 3 min ·
[2603.03823] SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Llms

[2603.03823] SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

Abstract page for arXiv paper 2603.03823: SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

arXiv - AI · 3 min ·
Previous Page 225 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime