Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

Google Maps can now write captions for your photos using AI | TechCrunch

Gemini can now create captions when users are looking to share a photo or video.

TechCrunch - AI · 4 min · 18 minutes ago

Llms

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

submitted by /u/PatienceHistorical70 [link] [comments]

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

Stop Overcomplicating AI Workflows. This Is the Simple Framework

I’ve been working on building an agentic AI workflow system for business use cases and one thing became very clear very quickly. This is ...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Llms

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

Abstract page for arXiv paper 2603.04606: PDE foundation model-accelerated inverse estimation of system parameters in inertial confinemen...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

Abstract page for arXiv paper 2603.04545: An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

Abstract page for arXiv paper 2603.04478: Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teac...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.07075] LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

Abstract page for arXiv paper 2602.07075: LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.23236] YuriiFormer: A Suite of Nesterov-Accelerated Transformers

Abstract page for arXiv paper 2601.23236: YuriiFormer: A Suite of Nesterov-Accelerated Transformers

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.21149] Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

Abstract page for arXiv paper 2601.21149: Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.16333] Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments

Abstract page for arXiv paper 2601.16333: Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextuall...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.14327] Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

Abstract page for arXiv paper 2601.14327: Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

Abstract page for arXiv paper 2601.11527: "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

arXiv - AI · 3 min · about 1 month ago

Llms

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

Abstract page for arXiv paper 2601.11063: EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robo...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.08393] Controlled LLM Training on Spectral Sphere

Abstract page for arXiv paper 2601.08393: Controlled LLM Training on Spectral Sphere

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

Abstract page for arXiv paper 2601.04548: Identifying Good and Bad Neurons for Task-Level Controllable LLMs

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

Abstract page for arXiv paper 2601.02663: When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

Abstract page for arXiv paper 2512.15163: MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.14391] RePo: Language Models with Context Re-Positioning

Abstract page for arXiv paper 2512.14391: RePo: Language Models with Context Re-Positioning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Abstract page for arXiv paper 2512.13586: ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

Abstract page for arXiv paper 2511.21399: Steering Awareness: Models Can Be Trained to Detect Activation Steering

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

Abstract page for arXiv paper 2511.16786: Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

Abstract page for arXiv paper 2511.03153: RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Abstract page for arXiv paper 2511.01870: CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 106 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Google Maps can now write captions for your photos using AI | TechCrunch

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

Stop Overcomplicating AI Workflows. This Is the Simple Framework

All Content

[2603.04606] PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion

[2603.04545] An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs

[2603.04478] Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation

[2602.07075] LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

[2601.23236] YuriiFormer: A Suite of Nesterov-Accelerated Transformers

[2601.21149] Mobility-Embedded POIs: Learning What A Place Is and How It Is Used from Human Movement

[2601.16333] Where is the multimodal goal post? On the Ability of Foundation Models to Recognize Contextually Important Moments

[2601.14327] Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLM

[2601.11527] "What if she doesn't feel the same?" What Happens When We Ask AI for Relationship Advice

[2601.11063] EmboTeam: Grounding LLM Reasoning into Reactive Behavior Trees via PDDL for Embodied Multi-Robot Collaboration

[2601.08393] Controlled LLM Training on Spectral Sphere

[2601.04548] Identifying Good and Bad Neurons for Task-Level Controllable LLMs

[2601.02663] When Do Tools and Planning Help Large Language Models Think? A Cost- and Latency-Aware Benchmark

[2512.15163] MCP-SafetyBench: A Benchmark for Safety Evaluation of Large Language Models with Real-World MCP Servers

[2512.14391] RePo: Language Models with Context Re-Positioning

[2512.13586] ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

[2511.21399] Steering Awareness: Models Can Be Trained to Detect Activation Steering

[2511.16786] Revisiting Multimodal KV Cache Compression: A Frequency-Domain-Guided Outlier-KV-Aware Approach

[2511.03153] RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring

[2511.01870] CytoNet: A Foundation Model for the Human Cerebral Cortex at Cellular Resolution

Related Topics

Stay updated with AI News