Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Vance says Iran sent 3 different versions of 10-point proposal, one of them 'written by ChatGPT'

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework
Llms

[2601.22451] Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validation Framework

Abstract page for arXiv paper 2601.22451: Countering the Over-Reliance Trap: Mitigating Object Hallucination for LVLMs via a Self-Validat...

arXiv - AI · 4 min ·
[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs
Llms

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

Abstract page for arXiv paper 2601.21463: Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

arXiv - AI · 4 min ·

All Content

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring
Llms

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Abstract page for arXiv paper 2511.03153: RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

arXiv - AI · 4 min ·
[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution
Llms

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Abstract page for arXiv paper 2511.01870: CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

arXiv - Machine Learning · 4 min ·
[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction
Llms

[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction

Abstract page for arXiv paper 2510.27173: FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Erro...

arXiv - Machine Learning · 4 min ·
[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery
Llms

[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

Abstract page for arXiv paper 2510.22503: LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

arXiv - Machine Learning · 4 min ·
[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?
Llms

[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

Abstract page for arXiv paper 2510.20333: GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Envi...

arXiv - AI · 4 min ·
[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Llms

[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Abstract page for arXiv paper 2510.18876: Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

arXiv - AI · 4 min ·
[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
Llms

[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

Abstract page for arXiv paper 2510.16714: SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

arXiv - AI · 3 min ·
[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning
Llms

[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning

Abstract page for arXiv paper 2510.16688: Pursuing Minimal Sufficiency in Spatial Reasoning

arXiv - AI · 4 min ·
[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Llms

[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Abstract page for arXiv paper 2510.00507: Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

arXiv - AI · 4 min ·
[2509.25149] Pretraining Large Language Models with NVFP4
Llms

[2509.25149] Pretraining Large Language Models with NVFP4

Abstract page for arXiv paper 2509.25149: Pretraining Large Language Models with NVFP4

arXiv - Machine Learning · 5 min ·
[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning
Llms

[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning

Abstract page for arXiv paper 2510.00177: PrefDisco: Benchmarking Proactive Personalized Reasoning

arXiv - AI · 4 min ·
[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models
Llms

[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

Abstract page for arXiv paper 2509.24210: BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

arXiv - Machine Learning · 4 min ·
[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer
Llms

[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

Abstract page for arXiv paper 2509.23886: Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

arXiv - Machine Learning · 4 min ·
[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones
Llms

[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

Abstract page for arXiv paper 2509.20321: Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

arXiv - AI · 3 min ·
[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models
Llms

[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models

Abstract page for arXiv paper 2508.06249: In-Training Defenses against Emergent Misalignment in Language Models

arXiv - Machine Learning · 4 min ·
[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Llms

[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Abstract page for arXiv paper 2507.01785: MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

arXiv - Machine Learning · 4 min ·
[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective
Llms

[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

Abstract page for arXiv paper 2506.23508: Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

arXiv - AI · 4 min ·
[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models
Llms

[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

Abstract page for arXiv paper 2506.01062: SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

arXiv - Machine Learning · 4 min ·
[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use
Llms

[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

Abstract page for arXiv paper 2505.19255: VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

arXiv - Machine Learning · 4 min ·
[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models
Llms

[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

Abstract page for arXiv paper 2504.04372: Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

arXiv - Machine Learning · 4 min ·
Previous Page 127 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime