Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Shifting to AI model customization is an architectural imperative | MIT Technology Review

In the early days of large language models (LLMs), we grew accustomed to massive 10x jumps in reasoning and coding capability with every ...

MIT Technology Review · 6 min · about 2 hours ago

All Content

Llms

[R] Evaluating MLLMs with Child-Inspired Cognitive Tasks

Hey there, we’re sharing KidGym, an interactive 2D grid-based benchmark for evaluating MLLMs in continuous, trajectory-based interaction,...

Reddit - Machine Learning · 1 min · 7 days ago

Llms

Anthropic’s Claude Code and Cowork can control your computer | The Verge

Anthropic has updated Claude to perform tasks in its Code and Cowork AI tools autonomously by using your computer for you.

The Verge - AI · 4 min · 7 days ago

Llms

Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch

Agile Robots will incorporate Google DeepMind's robotics foundation models into its bots while collecting data for the AI research lab.

TechCrunch - AI · 4 min · 7 days ago

Llms

Open Source Alternative to NotebookLM

For those of you who aren't familiar with SurfSense, SurfSense is an open-source alternative to NotebookLM for teams. It connects any LLM...

Reddit - Artificial Intelligence · 1 min · 7 days ago

Llms

This 30-minute ChatGPT routine fixed my afternoon slump and changed everything

After using ChatGPT to help get me more focused and productive in the morning, I used it to do the same thing in the afternoon.

AI Tools & Products · 9 min · 7 days ago

Llms

Google CEO Sundar Pichai’s plan to make Gemini the only AI that matters

Google CEO Sundar Pichai aims to establish Gemini as the leading AI technology, though specific details on the plan are not available.

AI Tools & Products · 1 min · 7 days ago

Llms

[2603.12055] Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

Abstract page for arXiv paper 2603.12055: Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2602.10273] Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

Abstract page for arXiv paper 2602.10273: Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2602.10218] ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs

Abstract page for arXiv paper 2602.10218: ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2602.00004] C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

Abstract page for arXiv paper 2602.00004: C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2508.08441] SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

Abstract page for arXiv paper 2508.08441: SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

Abstract page for arXiv paper 2411.16196: Learn from Foundation Model: Fruit Detection Model without Manual Annotation

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.17074] PRISM: Demystifying Retention and Interaction in Mid-Training

Abstract page for arXiv paper 2603.17074: PRISM: Demystifying Retention and Interaction in Mid-Training

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Abstract page for arXiv paper 2603.08104: Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Abstract page for arXiv paper 2602.03773: Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse

Abstract page for arXiv paper 2601.03385: SIGMA: Scalable Spectral Insights for LLM Model Collapse

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

Abstract page for arXiv paper 2512.19735: Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2512.10656] Token Sample Complexity of Attention

Abstract page for arXiv paper 2512.10656: Token Sample Complexity of Attention

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

Abstract page for arXiv paper 2509.24302: LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

Abstract page for arXiv paper 2509.21861: SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

arXiv - Machine Learning · 4 min · 7 days ago

Previous Page 41 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

What I learned about multi-agent coordination running 9 specialized Claude agents

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

Shifting to AI model customization is an architectural imperative | MIT Technology Review

All Content

[R] Evaluating MLLMs with Child-Inspired Cognitive Tasks

Anthropic’s Claude Code and Cowork can control your computer | The Verge

Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch

Open Source Alternative to NotebookLM

This 30-minute ChatGPT routine fixed my afternoon slump and changed everything

Google CEO Sundar Pichai’s plan to make Gemini the only AI that matters

[2603.12055] Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

[2602.10273] Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

[2602.10218] ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs

[2602.00004] C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

[2508.08441] SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

[2603.17074] PRISM: Demystifying Retention and Interaction in Mid-Training

[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse

[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

[2512.10656] Token Sample Complexity of Attention

[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

Related Topics

Stay updated with AI News