Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

Abstract page for arXiv paper 2601.22451: Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validat...

arXiv - AI · 4 min · 3 minutes ago

Llms

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

Abstract page for arXiv paper 2601.21463: Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

arXiv - AI · 4 min · 3 minutes ago

Llms

[2601.16206] Computer Environments Elicit General Agentic Intelligence in LLMs

Abstract page for arXiv paper 2601.16206: Computer Environments Elicit General Agentic Intelligence in LLMs

arXiv - AI · 4 min · 3 minutes ago

All Content

Llms

[2603.04692] Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Models Using Multi-Dataset Embeddings

Abstract page for arXiv paper 2603.04692: Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Mod...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

Abstract page for arXiv paper 2603.04606: PDE foundation model-accelerated inverse estimation of system parameters in inertial confinemen...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

Abstract page for arXiv paper 2603.04545: An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

Abstract page for arXiv paper 2603.04478: Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teac...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.07075] LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

Abstract page for arXiv paper 2602.07075: LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.23236] YuriiFormer: A Suite of Nesterov-Accelerated Transformers

Abstract page for arXiv paper 2601.23236: YuriiFormer: A Suite of Nesterov-Accelerated Transformers

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.21149] Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

Abstract page for arXiv paper 2601.21149: Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.16333] Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments

Abstract page for arXiv paper 2601.16333: Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextuall...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.14327] Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

Abstract page for arXiv paper 2601.14327: Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

Abstract page for arXiv paper 2601.11527: "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

arXiv - AI · 3 min · about 1 month ago

Llms

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

Abstract page for arXiv paper 2601.11063: EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robo...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.08393] Controlled LLM Training on Spectral Sphere

Abstract page for arXiv paper 2601.08393: Controlled LLM Training on Spectral Sphere

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

Abstract page for arXiv paper 2601.04548: Identifying Good and Bad Neurons for Task-Level Controllable LLMs

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

Abstract page for arXiv paper 2601.02663: When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Abstract page for arXiv paper 2512.15163: MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.14391] RePo: Language Models with Context Re-Positioning

Abstract page for arXiv paper 2512.14391: RePo: Language Models with Context Re-Positioning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Abstract page for arXiv paper 2512.13586: ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

Abstract page for arXiv paper 2511.21399: Steering Awareness: Models Can Be Trained to Detect Activation Steering

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

Abstract page for arXiv paper 2511.16786: Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Abstract page for arXiv paper 2511.03153: RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

arXiv - AI · 4 min · about 1 month ago

Previous Page 126 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

[2601.16206] Computer Environments Elicit General Agentic Intelligence in LLMs

All Content

[2603.04692] Engineering Regression Without Real-Data Training: Domain Adaptation for Tabular Foundation Models Using Multi-Dataset Embeddings

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

[2602.07075] LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

[2601.23236] YuriiFormer: A Suite of Nesterov-Accelerated Transformers

[2601.21149] Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

[2601.16333] Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments

[2601.14327] Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

[2601.08393] Controlled LLM Training on Spectral Sphere

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

[2512.14391] RePo: Language Models with Context Re-Positioning

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Related Topics

Stay updated with AI News