Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min · 1 minute ago

Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

ChatGPT on trial: A landmark test of AI liability in the practice of law

AI Tools & Products · about 3 hours ago

All Content

Llms

[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation

Abstract page for arXiv paper 2603.20882: RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

Abstract page for arXiv paper 2603.20854: SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?

Abstract page for arXiv paper 2603.20851: Can ChatGPT Really Understand Modern Chinese Poetry?

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20843] HiCI: Hierarchical Construction-Integration for Long-Context Attention

Abstract page for arXiv paper 2603.20843: HiCI: Hierarchical Construction-Integration for Long-Context Attention

arXiv - Machine Learning · 3 min · 11 days ago

Llms

[2603.20730] Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

Abstract page for arXiv paper 2603.20730: Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20673] PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

Abstract page for arXiv paper 2603.20673: PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20642] Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models

Abstract page for arXiv paper 2603.20642: Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geomet...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20637] AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing

Abstract page for arXiv paper 2603.20637: AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and M...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20586] MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

Abstract page for arXiv paper 2603.20586: MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20562] Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

Abstract page for arXiv paper 2603.20562: Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20531] Epistemic Observability in Language Models

Abstract page for arXiv paper 2603.20531: Epistemic Observability in Language Models

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20514] Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study

Abstract page for arXiv paper 2603.20514: Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Sett...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20513] ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Reformulation

Abstract page for arXiv paper 2603.20513: ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Re...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20508] Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?

Abstract page for arXiv paper 2603.20508: Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20466] Diffutron: A Masked Diffusion Language Model for Turkish Language

Abstract page for arXiv paper 2603.20466: Diffutron: A Masked Diffusion Language Model for Turkish Language

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20450] Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Abstract page for arXiv paper 2603.20450: Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.20449] Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

Abstract page for arXiv paper 2603.20449: Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20433] ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability

Abstract page for arXiv paper 2603.20433: ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning ...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.20432] Coding Agents are Effective Long-Context Processors

Abstract page for arXiv paper 2603.20432: Coding Agents are Effective Long-Context Processors

arXiv - AI · 4 min · 11 days ago

Llms

[2603.20406] Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

Abstract page for arXiv paper 2603.20406: Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Trans...

arXiv - Machine Learning · 4 min · 11 days ago

Previous Page 69 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

People anxious about deviating from what AI tells them to do?

ChatGPT on trial: A landmark test of AI liability in the practice of law

All Content

[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation

[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?

[2603.20843] HiCI: Hierarchical Construction-Integration for Long-Context Attention

[2603.20730] Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

[2603.20673] PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

[2603.20642] Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models

[2603.20637] AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing

[2603.20586] MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

[2603.20562] Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

[2603.20531] Epistemic Observability in Language Models

[2603.20514] Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study

[2603.20513] ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Reformulation

[2603.20508] Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?

[2603.20466] Diffutron: A Masked Diffusion Language Model for Turkish Language

[2603.20450] Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

[2603.20449] Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

[2603.20433] ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability

[2603.20432] Coding Agents are Effective Long-Context Processors

[2603.20406] Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

Related Topics

Stay updated with AI News