Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Claude Mythos and misguided open-weight fearmongering

AI Tools & Products · 9 min · about 5 hours ago

Llms

Anthropic Agrees to Rent CoreWeave AI Capacity to Power Claude

AI Tools & Products · 1 min · about 5 hours ago

All Content

Llms

[2510.19807] Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

Abstract page for arXiv paper 2510.19807: Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.02044] Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

Abstract page for arXiv paper 2511.02044: Regularization Through Reasoning: Systematic Improvements in Language Model Classification via ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.18871] How Do LLMs Use Their Depth?

Abstract page for arXiv paper 2510.18871: How Do LLMs Use Their Depth?

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.18866] LightMem: Lightweight and Efficient Memory-Augmented Generation

Abstract page for arXiv paper 2510.18866: LightMem: Lightweight and Efficient Memory-Augmented Generation

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.00177] Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

Abstract page for arXiv paper 2511.00177: Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.00405] UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

Abstract page for arXiv paper 2511.00405: UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.18560] WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

Abstract page for arXiv paper 2510.18560: WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.15905] Digital Companionship: Overlapping Uses of AI Companions and AI Assistants

Abstract page for arXiv paper 2510.15905: Digital Companionship: Overlapping Uses of AI Companions and AI Assistants

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

Abstract page for arXiv paper 2510.21910: Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

Abstract page for arXiv paper 2510.15863: PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

Abstract page for arXiv paper 2510.20264: Optimistic Task Inference for Behavior Foundation Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.21314] A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization

Abstract page for arXiv paper 2510.21314: A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.13888] Reliable Fine-Grained Evaluation of Natural Language Math Proofs

Abstract page for arXiv paper 2510.13888: Reliable Fine-Grained Evaluation of Natural Language Math Proofs

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.17206] Soft-Masked Diffusion Language Models

Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Abstract page for arXiv paper 2510.18245: Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing

Abstract page for arXiv paper 2510.10066: OBsmith: LLM-Powered JavaScript Obfuscator Testing

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Abstract page for arXiv paper 2510.09462: Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Abstract page for arXiv paper 2510.06377: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 157 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Claude Mythos and misguided open-weight fearmongering

Anthropic Agrees to Rent CoreWeave AI Capacity to Power Claude

All Content

[2510.19807] Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning

[2511.02044] Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

[2510.18871] How Do LLMs Use Their Depth?

[2510.18866] LightMem: Lightweight and Efficient Memory-Augmented Generation

[2511.00177] Can SAEs reveal and mitigate racial biases of LLMs in healthcare?

[2511.00405] UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings

[2510.18560] WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality

[2510.15905] Digital Companionship: Overlapping Uses of AI Companions and AI Assistants

[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

[2510.21314] A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization

[2510.13888] Reliable Fine-Grained Evaluation of Natural Language Math Proofs

[2510.17206] Soft-Masked Diffusion Language Models

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Related Topics

Stay updated with AI News