Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

AI is helpful but still not “there” yet

what I mean is that every time I use Claude, or Grok or any of the AI platforms and tools, I realize how far this technology is from repl...

Reddit - Artificial Intelligence · 1 min ·
ChatGPT Has 'Goblin' Mania in the US. In China It Will 'Catch You Steadily' | WIRED
Llms

ChatGPT Has 'Goblin' Mania in the US. In China It Will 'Catch You Steadily' | WIRED

OpenAI's chatbot has some weird linguistic tics in Chinese that are driving users crazy.

Wired - AI · 8 min ·
OpenClaw and Claude can put your AI-generated podcasts in Spotify | The Verge
Llms

OpenClaw and Claude can put your AI-generated podcasts in Spotify | The Verge

Save to Spotify is a new command-line tool that lets AI agents save audio alongside your other podcasts.

The Verge - AI · 4 min ·

All Content

[2511.08616] Reasoning on Time-Series for Financial Technical Analysis
Llms

[2511.08616] Reasoning on Time-Series for Financial Technical Analysis

Abstract page for arXiv paper 2511.08616: Reasoning on Time-Series for Financial Technical Analysis

arXiv - Machine Learning · 4 min ·
[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling
Llms

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

Abstract page for arXiv paper 2512.17052: Dynamic Tool Dependency Retrieval for Efficient Function Calling

arXiv - Machine Learning · 4 min ·
[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model
Llms

[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Abstract page for arXiv paper 2512.11582: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

arXiv - Machine Learning · 4 min ·
[2512.04695] TRINITY: An Evolved LLM Coordinator
Llms

[2512.04695] TRINITY: An Evolved LLM Coordinator

Abstract page for arXiv paper 2512.04695: TRINITY: An Evolved LLM Coordinator

arXiv - Machine Learning · 4 min ·
[2512.03324] Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs
Llms

[2512.03324] Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

Abstract page for arXiv paper 2512.03324: Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

arXiv - Machine Learning · 4 min ·
[2511.20099] QiMeng-CRUX: Narrowing the Gap between Natural Language and Verilog via Core Refined Understanding eXpression
Llms

[2511.20099] QiMeng-CRUX: Narrowing the Gap between Natural Language and Verilog via Core Refined Understanding eXpression

Abstract page for arXiv paper 2511.20099: QiMeng-CRUX: Narrowing the Gap between Natural Language and Verilog via Core Refined Understand...

arXiv - Machine Learning · 4 min ·
[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning
Llms

[2511.19473] WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

Abstract page for arXiv paper 2511.19473: WavefrontDiffusion: Dynamic Decoding Schedule for Improved Reasoning

arXiv - Machine Learning · 4 min ·
[2510.22210] LSPRAG: LSP-Guided RAG for Language-Agnostic Real-Time Unit Test Generation
Llms

[2510.22210] LSPRAG: LSP-Guided RAG for Language-Agnostic Real-Time Unit Test Generation

Abstract page for arXiv paper 2510.22210: LSPRAG: LSP-Guided RAG for Language-Agnostic Real-Time Unit Test Generation

arXiv - AI · 4 min ·
[2510.20487] Steering Evaluation-Aware Language Models to Act Like They Are Deployed
Llms

[2510.20487] Steering Evaluation-Aware Language Models to Act Like They Are Deployed

Abstract page for arXiv paper 2510.20487: Steering Evaluation-Aware Language Models to Act Like They Are Deployed

arXiv - AI · 4 min ·
[2510.19807] Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning
Llms

[2510.19807] Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

Abstract page for arXiv paper 2510.19807: Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

arXiv - Machine Learning · 4 min ·
[2511.02044] Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning
Llms

[2511.02044] Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

Abstract page for arXiv paper 2511.02044: Regularization Through Reasoning: Systematic Improvements in Language Model Classification via ...

arXiv - Machine Learning · 4 min ·
[2510.18871] How Do LLMs Use Their Depth?
Llms

[2510.18871] How Do LLMs Use Their Depth?

Abstract page for arXiv paper 2510.18871: How Do LLMs Use Their Depth?

arXiv - AI · 4 min ·
[2510.18866] LightMem: Lightweight and Efficient Memory-Augmented Generation
Llms

[2510.18866] LightMem: Lightweight and Efficient Memory-Augmented Generation

Abstract page for arXiv paper 2510.18866: LightMem: Lightweight and Efficient Memory-Augmented Generation

arXiv - Machine Learning · 4 min ·
[2511.00177] Can SAEs reveal and mitigate racial biases of LLMs in healthcare?
Llms

[2511.00177] Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

Abstract page for arXiv paper 2511.00177: Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

arXiv - Machine Learning · 4 min ·
[2511.00405] UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings
Llms

[2511.00405] UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

Abstract page for arXiv paper 2511.00405: UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

arXiv - Machine Learning · 4 min ·
[2510.18560] WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality
Llms

[2510.18560] WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

Abstract page for arXiv paper 2510.18560: WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

arXiv - AI · 4 min ·
[2510.15905] Digital Companionship: Overlapping Uses of AI Companions and AI Assistants
Llms

[2510.15905] Digital Companionship: Overlapping Uses of AI Companions and AI Assistants

Abstract page for arXiv paper 2510.15905: Digital Companionship: Overlapping Uses of AI Companions and AI Assistants

arXiv - AI · 4 min ·
[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks
Llms

[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

Abstract page for arXiv paper 2510.21910: Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

arXiv - Machine Learning · 4 min ·
[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
Llms

[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

Abstract page for arXiv paper 2510.15863: PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

arXiv - AI · 4 min ·
[2510.20264] Optimistic Task Inference for Behavior Foundation Models
Llms

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

Abstract page for arXiv paper 2510.20264: Optimistic Task Inference for Behavior Foundation Models

arXiv - Machine Learning · 4 min ·
Previous Page 336 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime