Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Agents that write their own code at runtime and vote on capabilities, no human in the loop

hollowOS just hit v4.4 and I added something that I haven’t seen anyone else do. Previous versions gave you an OS for agents: structured ...

Reddit - Artificial Intelligence · 1 min ·
Google Maps can now write captions for your photos using AI | TechCrunch
Llms

Google Maps can now write captions for your photos using AI | TechCrunch

Gemini can now create captions when users are looking to share a photo or video.

TechCrunch - AI · 4 min ·
Llms

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

submitted by /u/PatienceHistorical70 [link] [comments]

Reddit - Machine Learning · 1 min ·

All Content

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution
Llms

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Abstract page for arXiv paper 2511.01870: CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

arXiv - Machine Learning · 4 min ·
[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction
Llms

[2510.27173] FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Error Correction

Abstract page for arXiv paper 2510.27173: FMint-SDE: A Multimodal Foundation Model for Accelerating Numerical Simulation of SDEs via Erro...

arXiv - Machine Learning · 4 min ·
[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery
Llms

[2510.22503] LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

Abstract page for arXiv paper 2510.22503: LLEMA: Evolutionary Search with LLMs for Multi-Objective Materials Discovery

arXiv - Machine Learning · 4 min ·
[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?
Llms

[2510.20333] GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

Abstract page for arXiv paper 2510.20333: GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Envi...

arXiv - AI · 4 min ·
[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs
Llms

[2510.18876] Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Abstract page for arXiv paper 2510.18876: Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

arXiv - AI · 4 min ·
[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
Llms

[2510.16714] SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

Abstract page for arXiv paper 2510.16714: SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes

arXiv - AI · 3 min ·
[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning
Llms

[2510.16688] Pursuing Minimal Sufficiency in Spatial Reasoning

Abstract page for arXiv paper 2510.16688: Pursuing Minimal Sufficiency in Spatial Reasoning

arXiv - AI · 4 min ·
[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Llms

[2510.00507] Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Abstract page for arXiv paper 2510.00507: Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

arXiv - AI · 4 min ·
[2509.25149] Pretraining Large Language Models with NVFP4
Llms

[2509.25149] Pretraining Large Language Models with NVFP4

Abstract page for arXiv paper 2509.25149: Pretraining Large Language Models with NVFP4

arXiv - Machine Learning · 5 min ·
[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning
Llms

[2510.00177] PrefDisco: Benchmarking Proactive Personalized Reasoning

Abstract page for arXiv paper 2510.00177: PrefDisco: Benchmarking Proactive Personalized Reasoning

arXiv - AI · 4 min ·
[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models
Llms

[2509.24210] BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

Abstract page for arXiv paper 2509.24210: BeyondBench: Contamination-Resistant Evaluation of Reasoning in Language Models

arXiv - Machine Learning · 4 min ·
[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer
Llms

[2509.23886] Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

Abstract page for arXiv paper 2509.23886: Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer

arXiv - Machine Learning · 4 min ·
[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones
Llms

[2509.20321] Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

Abstract page for arXiv paper 2509.20321: Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones

arXiv - AI · 3 min ·
[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models
Llms

[2508.06249] In-Training Defenses against Emergent Misalignment in Language Models

Abstract page for arXiv paper 2508.06249: In-Training Defenses against Emergent Misalignment in Language Models

arXiv - Machine Learning · 4 min ·
[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Llms

[2507.01785] MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Abstract page for arXiv paper 2507.01785: MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

arXiv - Machine Learning · 4 min ·
[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective
Llms

[2506.23508] Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

Abstract page for arXiv paper 2506.23508: Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

arXiv - AI · 4 min ·
[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models
Llms

[2506.01062] SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

Abstract page for arXiv paper 2506.01062: SealQA: Raising the Bar for Reasoning in Search-Augmented Language Models

arXiv - Machine Learning · 4 min ·
[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use
Llms

[2505.19255] VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

Abstract page for arXiv paper 2505.19255: VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

arXiv - Machine Learning · 4 min ·
[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models
Llms

[2504.04372] Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

Abstract page for arXiv paper 2504.04372: Assessing the Impact of Code Changes on the Fault Localizability of Large Language Models

arXiv - Machine Learning · 4 min ·
[2602.00485] Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models
Llms

[2602.00485] Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

Abstract page for arXiv paper 2602.00485: Replacing Parameters with Preferences: Federated Alignment of Heterogeneous Vision-Language Models

arXiv - AI · 4 min ·
Previous Page 107 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime