Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch

Learn how to use Spotify, Canva, Figma, Expedia, and other apps directly in ChatGPT.

TechCrunch - AI · 10 min · about 2 hours ago

Llms

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

AI Tools & Products · 7 min · about 8 hours ago

Llms

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

AI Tools & Products · 5 min · about 8 hours ago

All Content

Llms

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

Abstract page for arXiv paper 2601.11527: "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

arXiv - AI · 3 min · about 1 month ago

Llms

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

Abstract page for arXiv paper 2601.11063: EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robo...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.08393] Controlled LLM Training on Spectral Sphere

Abstract page for arXiv paper 2601.08393: Controlled LLM Training on Spectral Sphere

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

Abstract page for arXiv paper 2601.04548: Identifying Good and Bad Neurons for Task-Level Controllable LLMs

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

Abstract page for arXiv paper 2601.02663: When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Abstract page for arXiv paper 2512.15163: MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.14391] RePo: Language Models with Context Re-Positioning

Abstract page for arXiv paper 2512.14391: RePo: Language Models with Context Re-Positioning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Abstract page for arXiv paper 2512.13586: ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

Abstract page for arXiv paper 2511.21399: Steering Awareness: Models Can Be Trained to Detect Activation Steering

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

Abstract page for arXiv paper 2511.16786: Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Abstract page for arXiv paper 2511.03153: RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Abstract page for arXiv paper 2511.01870: CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction

Abstract page for arXiv paper 2510.27173: FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Erro...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

Abstract page for arXiv paper 2510.22503: LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

Abstract page for arXiv paper 2510.20333: GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Envi...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Abstract page for arXiv paper 2510.18876: Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

Abstract page for arXiv paper 2510.16714: SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

arXiv - AI · 3 min · about 1 month ago

Llms

[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning

Abstract page for arXiv paper 2510.16688: Pursuing Minimal Sufficiency in Spatial Reasoning

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Abstract page for arXiv paper 2510.00507: Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.25149] Pretraining Large Language Models with NVFP4

Abstract page for arXiv paper 2509.25149: Pretraining Large Language Models with NVFP4

arXiv - Machine Learning · 5 min · about 1 month ago

Previous Page 92 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others | TechCrunch

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

All Content

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

[2601.08393] Controlled LLM Training on Spectral Sphere

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

[2512.14391] RePo: Language Models with Context Re-Positioning

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction

[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning

[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

[2509.25149] Pretraining Large Language Models with NVFP4

Related Topics

Stay updated with AI News