Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED
Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min ·
Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min ·
Llms

ChatGPT on trial: A landmark test of AI liability in the practice of law

AI Tools & Products ·

All Content

[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation
Llms

[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation

Abstract page for arXiv paper 2603.20882: RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for...

arXiv - Machine Learning · 4 min ·
[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch
Llms

[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

Abstract page for arXiv paper 2603.20854: SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

arXiv - AI · 3 min ·
[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?
Llms

[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?

Abstract page for arXiv paper 2603.20851: Can ChatGPT Really Understand Modern Chinese Poetry?

arXiv - AI · 3 min ·
[2603.20843] HiCI: Hierarchical Construction-Integration for Long-Context Attention
Llms

[2603.20843] HiCI: Hierarchical Construction-Integration for Long-Context Attention

Abstract page for arXiv paper 2603.20843: HiCI: Hierarchical Construction-Integration for Long-Context Attention

arXiv - Machine Learning · 3 min ·
[2603.20730] Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks
Llms

[2603.20730] Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

Abstract page for arXiv paper 2603.20730: Reasoning Topology Matters: Network-of-Thought for Complex Reasoning Tasks

arXiv - AI · 4 min ·
[2603.20673] PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs
Llms

[2603.20673] PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

Abstract page for arXiv paper 2603.20673: PAVE: Premise-Aware Validation and Editing for Retrieval-Augmented LLMs

arXiv - AI · 3 min ·
[2603.20642] Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models
Llms

[2603.20642] Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models

Abstract page for arXiv paper 2603.20642: Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geomet...

arXiv - AI · 4 min ·
[2603.20637] AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing
Llms

[2603.20637] AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing

Abstract page for arXiv paper 2603.20637: AEGIS: From Clues to Verdicts -- Graph-Guided Deep Vulnerability Reasoning via Dialectics and M...

arXiv - AI · 4 min ·
[2603.20586] MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning
Llms

[2603.20586] MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

Abstract page for arXiv paper 2603.20586: MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning

arXiv - AI · 3 min ·
[2603.20562] Permutation-Consensus Listwise Judging for Robust Factuality Evaluation
Llms

[2603.20562] Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

Abstract page for arXiv paper 2603.20562: Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

arXiv - AI · 3 min ·
[2603.20531] Epistemic Observability in Language Models
Llms

[2603.20531] Epistemic Observability in Language Models

Abstract page for arXiv paper 2603.20531: Epistemic Observability in Language Models

arXiv - Machine Learning · 4 min ·
[2603.20514] Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study
Llms

[2603.20514] Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study

Abstract page for arXiv paper 2603.20514: Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Sett...

arXiv - AI · 3 min ·
[2603.20513] ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Reformulation
Llms

[2603.20513] ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Reformulation

Abstract page for arXiv paper 2603.20513: ReBOL: Retrieval via Bayesian Optimization with Batched LLM Relevance Observations and Query Re...

arXiv - AI · 3 min ·
[2603.20508] Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?
Llms

[2603.20508] Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?

Abstract page for arXiv paper 2603.20508: Measuring Reasoning Trace Legibility: Can Those Who Understand Teach?

arXiv - AI · 4 min ·
[2603.20466] Diffutron: A Masked Diffusion Language Model for Turkish Language
Llms

[2603.20466] Diffutron: A Masked Diffusion Language Model for Turkish Language

Abstract page for arXiv paper 2603.20466: Diffutron: A Masked Diffusion Language Model for Turkish Language

arXiv - AI · 3 min ·
[2603.20450] Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable
Llms

[2603.20450] Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Abstract page for arXiv paper 2603.20450: Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

arXiv - Machine Learning · 4 min ·
[2603.20449] Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents
Llms

[2603.20449] Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

Abstract page for arXiv paper 2603.20449: Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents

arXiv - AI · 4 min ·
[2603.20433] ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability
Llms

[2603.20433] ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability

Abstract page for arXiv paper 2603.20433: ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning ...

arXiv - AI · 3 min ·
[2603.20432] Coding Agents are Effective Long-Context Processors
Llms

[2603.20432] Coding Agents are Effective Long-Context Processors

Abstract page for arXiv paper 2603.20432: Coding Agents are Effective Long-Context Processors

arXiv - AI · 4 min ·
[2603.20406] Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation
Llms

[2603.20406] Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Translation

Abstract page for arXiv paper 2603.20406: Thinking in Different Spaces: Domain-Specific Latent Geometry Survives Cross-Architecture Trans...

arXiv - Machine Learning · 4 min ·
Previous Page 69 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime