Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

What if Claude purposefully made its own code leakable so that it would get leaked

What if Claude leaked itself by socially and architecturally engineering itself to be leaked by a dumb human submitted by /u/smurfcsgoawp...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

Observer-Embedded Reality

Observer-Embedded Reality Consciousness, Complexity, Meaning, and the Limits of Human Knowledge A Conceptual Philosophy-of-Science Paper ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

I think we’re about to have a new kind of “SEO”… and nobody is talking about it.

More people are asking ChatGPT things like: “what’s the best CRM?” “is this tool worth it?” “alternatives to X” And they just… trust the ...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

All Content

Llms

[2603.21697] Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

Abstract page for arXiv paper 2603.21697: Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21654] Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

Abstract page for arXiv paper 2603.21654: Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and ...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

Abstract page for arXiv paper 2603.21613: AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21606] mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Abstract page for arXiv paper 2603.21606: mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21601] Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence

Abstract page for arXiv paper 2603.21601: Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Gr...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21576] PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

Abstract page for arXiv paper 2603.21576: PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Sele...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21530] LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search

Abstract page for arXiv paper 2603.21530: LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21524] CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

Abstract page for arXiv paper 2603.21524: CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21523] SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

Abstract page for arXiv paper 2603.21523: SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21522] Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

Abstract page for arXiv paper 2603.21522: Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21460] When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

Abstract page for arXiv paper 2603.21460: When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieva...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21440] KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

Abstract page for arXiv paper 2603.21440: KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21439] LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study

Abstract page for arXiv paper 2603.21439: LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Ind...

arXiv - AI · 3 min · 11 days ago

Llms

[2603.21418] Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Abstract page for arXiv paper 2603.21418: Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on...

arXiv - Machine Learning · 4 min · 11 days ago

Llms

[2603.21359] Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF

Abstract page for arXiv paper 2603.21359: Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation ...

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21329] COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding

Abstract page for arXiv paper 2603.21329: COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21301] enhancing reasoning accuracy in large language models during inference time

Abstract page for arXiv paper 2603.21301: enhancing reasoning accuracy in large language models during inference time

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21289] When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Abstract page for arXiv paper 2603.21289: When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21280] WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

Abstract page for arXiv paper 2603.21280: WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

arXiv - AI · 4 min · 11 days ago

Llms

[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

Abstract page for arXiv paper 2603.21278: Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conve...

arXiv - AI · 4 min · 11 days ago

Previous Page 67 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

What if Claude purposefully made its own code leakable so that it would get leaked

Observer-Embedded Reality

I think we’re about to have a new kind of “SEO”… and nobody is talking about it.

All Content

[2603.21697] Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

[2603.21654] Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

[2603.21606] mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

[2603.21601] Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence

[2603.21576] PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

[2603.21530] LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search

[2603.21524] CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

[2603.21523] SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

[2603.21522] Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

[2603.21460] When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

[2603.21440] KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

[2603.21439] LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study

[2603.21418] Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

[2603.21359] Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF

[2603.21329] COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding

[2603.21301] enhancing reasoning accuracy in large language models during inference time

[2603.21289] When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

[2603.21280] WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

Related Topics

Stay updated with AI News