Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED

Ads are rolling out across the US on ChatGPT’s free tier. I asked OpenAI's bot 500 questions to see what these ads were like and how they...

Wired - AI · 9 min · 20 minutes ago

Llms

Abacus.Ai Claw LLM consumes an incredible amount of credit without any usage :(

Three days ago, I clicked the "Deploy OpenClaw In Seconds" button to get an overview of the new service, but I didn't build any automatio...

Reddit - Artificial Intelligence · 1 min · 20 minutes ago

Llms

Google’s Gemini AI app debuts in Hong Kong

Tech giant’s chatbot service tops Apple’s app store chart in the city.

AI Tools & Products · 2 min · about 2 hours ago

All Content

Llms

[2509.24296] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

Abstract page for arXiv paper 2509.24296: DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

arXiv - AI · 4 min · about 7 hours ago

Llms

[2509.19354] GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response

Abstract page for arXiv paper 2509.19354: GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response

arXiv - AI · 3 min · about 7 hours ago

Llms

[2508.15090] Mapping the Course for Prompt-based Structured Prediction

Abstract page for arXiv paper 2508.15090: Mapping the Course for Prompt-based Structured Prediction

arXiv - AI · 3 min · about 7 hours ago

Llms

[2507.20423] CodeNER: Code Prompting for Named Entity Recognition

Abstract page for arXiv paper 2507.20423: CodeNER: Code Prompting for Named Entity Recognition

arXiv - AI · 4 min · about 7 hours ago

Llms

[2410.12476] Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation

Abstract page for arXiv paper 2410.12476: Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation

arXiv - Machine Learning · 4 min · about 7 hours ago

Llms

[2506.14861] BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models

Abstract page for arXiv paper 2506.14861: BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models

arXiv - AI · 4 min · about 7 hours ago

Llms

[2506.13734] Instruction Following by Principled Boosting Attention of Large Language Models

Abstract page for arXiv paper 2506.13734: Instruction Following by Principled Boosting Attention of Large Language Models

arXiv - Machine Learning · 4 min · about 7 hours ago

Llms

[2506.12104] DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Abstract page for arXiv paper 2506.12104: DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

arXiv - AI · 4 min · about 7 hours ago

Llms

[2505.24840] The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition

Abstract page for arXiv paper 2505.24840: The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition

arXiv - Machine Learning · 3 min · about 7 hours ago

Llms

[2410.15281] LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends

Abstract page for arXiv paper 2410.15281: LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments...

arXiv - AI · 4 min · about 7 hours ago

Llms

[2410.10700] LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

Abstract page for arXiv paper 2410.10700: LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

arXiv - AI · 4 min · about 7 hours ago

Llms

[2408.13366] CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers

Abstract page for arXiv paper 2408.13366: CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers

arXiv - Machine Learning · 3 min · about 7 hours ago

Llms

[2406.07737] The Future of AI-Driven Software Engineering

Abstract page for arXiv paper 2406.07737: The Future of AI-Driven Software Engineering

arXiv - Machine Learning · 3 min · about 7 hours ago

Llms

[2603.23610] Environment Maps: Structured Environmental Representations for Long-Horizon Agents

Abstract page for arXiv paper 2603.23610: Environment Maps: Structured Environmental Representations for Long-Horizon Agents

arXiv - AI · 4 min · about 7 hours ago

Llms

[2601.04426] XGrammar-2: Efficient Dynamic Structured Generation Engine for Agentic LLMs

Abstract page for arXiv paper 2601.04426: XGrammar-2: Efficient Dynamic Structured Generation Engine for Agentic LLMs

arXiv - AI · 3 min · about 7 hours ago

Llms

[2603.08561] RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

Abstract page for arXiv paper 2603.08561: RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

arXiv - AI · 4 min · about 7 hours ago

Llms

[2511.07436] Analysing Environmental Efficiency in AI for X-Ray Diagnosis

Abstract page for arXiv paper 2511.07436: Analysing Environmental Efficiency in AI for X-Ray Diagnosis

arXiv - AI · 4 min · about 7 hours ago

Llms

[2510.18087] Planned Diffusion

Abstract page for arXiv paper 2510.18087: Planned Diffusion

arXiv - AI · 4 min · about 7 hours ago

Llms

[2509.23768] From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Abstract page for arXiv paper 2509.23768: From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

arXiv - AI · 3 min · about 7 hours ago

Llms

[2512.18951] Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

Abstract page for arXiv paper 2512.18951: Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

arXiv - Machine Learning · 3 min · about 7 hours ago

Previous Page 2 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED

Abacus.Ai Claw LLM consumes an incredible amount of credit without any usage :(

Google’s Gemini AI app debuts in Hong Kong

All Content

[2509.24296] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models

[2509.19354] GeoResponder: Towards Building Geospatial LLMs for Time-Critical Disaster Response

[2508.15090] Mapping the Course for Prompt-based Structured Prediction

[2507.20423] CodeNER: Code Prompting for Named Entity Recognition

[2410.12476] Retrieval-Reasoning Large Language Model-based Synthetic Clinical Trial Generation

[2506.14861] BMFM-RNA: whole-cell expression decoding improves transcriptomic foundation models

[2506.13734] Instruction Following by Principled Boosting Attention of Large Language Models

[2506.12104] DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

[2505.24840] The LLM Bottleneck: Why Open-Source Vision LLMs Struggle with Hierarchical Visual Recognition

[2410.15281] LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends

[2410.10700] LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

[2408.13366] CodeRefine: A Pipeline for Enhancing LLM-Generated Code Implementations of Research Papers

[2406.07737] The Future of AI-Driven Software Engineering

[2603.23610] Environment Maps: Structured Environmental Representations for Long-Horizon Agents

[2601.04426] XGrammar-2: Efficient Dynamic Structured Generation Engine for Agentic LLMs

[2603.08561] RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

[2511.07436] Analysing Environmental Efficiency in AI for X-Ray Diagnosis

[2510.18087] Planned Diffusion

[2509.23768] From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

[2512.18951] Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

Related Topics

Stay updated with AI News