Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Claude Mythos and misguided open-weight fearmongering

AI Tools & Products · 9 min · about 6 hours ago

Llms

Anthropic Agrees to Rent CoreWeave AI Capacity to Power Claude

AI Tools & Products · 1 min · about 6 hours ago

All Content

Llms

[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

Abstract page for arXiv paper 2510.06292: ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Abstract page for arXiv paper 2510.05064: Boomerang Distillation Enables Zero-Shot Model Size Interpolation

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.05174] Emergent Coordination in Multi-Agent Language Models

Abstract page for arXiv paper 2510.05174: Emergent Coordination in Multi-Agent Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens

Abstract page for arXiv paper 2510.05132: Training Large Language Models To Reason In Parallel With Global Forking Tokens

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Abstract page for arXiv paper 2510.05069: SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

Abstract page for arXiv paper 2510.05109: Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on B...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Abstract page for arXiv paper 2510.04682: TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.04067] What Scales in Cross-Entropy Scaling Law?

Abstract page for arXiv paper 2510.04067: What Scales in Cross-Entropy Scaling Law?

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

Abstract page for arXiv paper 2510.03253: Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.02999] Untargeted Jailbreak Attack

Abstract page for arXiv paper 2510.02999: Untargeted Jailbreak Attack

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.02245] ExGRPO: Learning to Reason from Experience

Abstract page for arXiv paper 2510.02245: ExGRPO: Learning to Reason from Experience

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.01051] GEM: A Gym for Agentic LLMs

Abstract page for arXiv paper 2510.01051: GEM: A Gym for Agentic LLMs

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.00819] Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

Abstract page for arXiv paper 2510.00819: Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.25678] Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized Mixture-of-Experts

Abstract page for arXiv paper 2509.25678: Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.00041] Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness

Abstract page for arXiv paper 2510.00041: Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.26601] MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

Abstract page for arXiv paper 2509.26601: MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.26346] EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Abstract page for arXiv paper 2509.26346: EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.24198] Negative Pre-activations Differentiate Syntax

Abstract page for arXiv paper 2509.24198: Negative Pre-activations Differentiate Syntax

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 158 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Claude Mythos and misguided open-weight fearmongering

Anthropic Agrees to Rent CoreWeave AI Capacity to Power Claude

All Content

[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation

[2510.05174] Emergent Coordination in Multi-Agent Language Models

[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens

[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

[2510.04067] What Scales in Cross-Entropy Scaling Law?

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

[2510.02999] Untargeted Jailbreak Attack

[2510.02245] ExGRPO: Learning to Reason from Experience

[2510.01051] GEM: A Gym for Agentic LLMs

[2510.00819] Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

[2509.25678] Massively Multimodal Foundation Models: A Framework for Capturing Interactions with Specialized Mixture-of-Experts

[2510.00041] Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness

[2509.26601] MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

[2509.26346] EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

[2509.24198] Negative Pre-activations Differentiate Syntax

Related Topics

Stay updated with AI News