Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min ·
Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min ·
Shifting to AI model customization is an architectural imperative | MIT Technology Review
Llms

Shifting to AI model customization is an architectural imperative | MIT Technology Review

In the early days of large language models (LLMs), we grew accustomed to massive 10x jumps in reasoning and coding capability with every ...

MIT Technology Review · 6 min ·

All Content

Llms

[R] Evaluating MLLMs with Child-Inspired Cognitive Tasks

Hey there, we’re sharing KidGym, an interactive 2D grid-based benchmark for evaluating MLLMs in continuous, trajectory-based interaction,...

Reddit - Machine Learning · 1 min ·
Anthropic’s Claude Code and Cowork can control your computer | The Verge
Llms

Anthropic’s Claude Code and Cowork can control your computer | The Verge

Anthropic has updated Claude to perform tasks in its Code and Cowork AI tools autonomously by using your computer for you.

The Verge - AI · 4 min ·
Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch
Llms

Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch

Agile Robots will incorporate Google DeepMind's robotics foundation models into its bots while collecting data for the AI research lab.

TechCrunch - AI · 4 min ·
Llms

Open Source Alternative to NotebookLM

For those of you who aren't familiar with SurfSense, SurfSense is an open-source alternative to NotebookLM for teams. It connects any LLM...

Reddit - Artificial Intelligence · 1 min ·
This 30-minute ChatGPT routine fixed my afternoon slump and changed everything
Llms

This 30-minute ChatGPT routine fixed my afternoon slump and changed everything

After using ChatGPT to help get me more focused and productive in the morning, I used it to do the same thing in the afternoon.

AI Tools & Products · 9 min ·
Llms

Google CEO Sundar Pichai’s plan to make Gemini the only AI that matters

Google CEO Sundar Pichai aims to establish Gemini as the leading AI technology, though specific details on the plan are not available.

AI Tools & Products · 1 min ·
[2603.12055] Continual Learning with Vision-Language Models via Semantic-Geometry Preservation
Llms

[2603.12055] Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

Abstract page for arXiv paper 2603.12055: Continual Learning with Vision-Language Models via Semantic-Geometry Preservation

arXiv - Machine Learning · 4 min ·
[2602.10273] Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning
Llms

[2602.10273] Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

Abstract page for arXiv paper 2602.10273: Power-SMC: Low-Latency Sequence-Level Power Sampling for Training-Free LLM Reasoning

arXiv - Machine Learning · 4 min ·
[2602.10218] ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs
Llms

[2602.10218] ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs

Abstract page for arXiv paper 2602.10218: ACE-RTL: When Agentic Context Evolution Meets RTL-Specialized LLMs

arXiv - Machine Learning · 3 min ·
[2602.00004] C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models
Llms

[2602.00004] C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

Abstract page for arXiv paper 2602.00004: C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

arXiv - Machine Learning · 4 min ·
[2508.08441] SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral
Llms

[2508.08441] SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

Abstract page for arXiv paper 2508.08441: SpectraLLM: Uncovering the Ability of LLMs for Molecule Structure Elucidation from Multi-Spectral

arXiv - Machine Learning · 4 min ·
[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation
Llms

[2411.16196] Learn from Foundation Model: Fruit Detection Model without Manual Annotation

Abstract page for arXiv paper 2411.16196: Learn from Foundation Model: Fruit Detection Model without Manual Annotation

arXiv - Machine Learning · 4 min ·
[2603.17074] PRISM: Demystifying Retention and Interaction in Mid-Training
Llms

[2603.17074] PRISM: Demystifying Retention and Interaction in Mid-Training

Abstract page for arXiv paper 2603.17074: PRISM: Demystifying Retention and Interaction in Mid-Training

arXiv - Machine Learning · 4 min ·
[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
Llms

[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Abstract page for arXiv paper 2603.08104: Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

arXiv - Machine Learning · 4 min ·
[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
Llms

[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Abstract page for arXiv paper 2602.03773: Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

arXiv - Machine Learning · 4 min ·
[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse
Llms

[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse

Abstract page for arXiv paper 2601.03385: SIGMA: Scalable Spectral Insights for LLM Model Collapse

arXiv - Machine Learning · 4 min ·
[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting
Llms

[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

Abstract page for arXiv paper 2512.19735: Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

arXiv - Machine Learning · 4 min ·
[2512.10656] Token Sample Complexity of Attention
Llms

[2512.10656] Token Sample Complexity of Attention

Abstract page for arXiv paper 2512.10656: Token Sample Complexity of Attention

arXiv - Machine Learning · 4 min ·
[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces
Llms

[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

Abstract page for arXiv paper 2509.24302: LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

arXiv - Machine Learning · 4 min ·
[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning
Llms

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

Abstract page for arXiv paper 2509.21861: SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

arXiv - Machine Learning · 4 min ·
Previous Page 41 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime