Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

What if Claude purposefully made its own code leakable so that it would get leaked

What if Claude leaked itself by socially and architecturally engineering itself to be leaked by a dumb human submitted by /u/smurfcsgoawp...

Reddit - Artificial Intelligence · 1 min ·
Llms

Observer-Embedded Reality

Observer-Embedded Reality Consciousness, Complexity, Meaning, and the Limits of Human Knowledge A Conceptual Philosophy-of-Science Paper ...

Reddit - Artificial Intelligence · 1 min ·
Llms

I think we’re about to have a new kind of “SEO”… and nobody is talking about it.

More people are asking ChatGPT things like: “what’s the best CRM?” “is this tool worth it?” “alternatives to X” And they just… trust the ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.21697] Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models
Llms

[2603.21697] Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

Abstract page for arXiv paper 2603.21697: Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models

arXiv - AI · 4 min ·
[2603.21654] Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks
Llms

[2603.21654] Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

Abstract page for arXiv paper 2603.21654: Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and ...

arXiv - AI · 4 min ·
[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents
Llms

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

Abstract page for arXiv paper 2603.21613: AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

arXiv - AI · 3 min ·
[2603.21606] mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT
Llms

[2603.21606] mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

Abstract page for arXiv paper 2603.21606: mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT

arXiv - AI · 3 min ·
[2603.21601] Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence
Llms

[2603.21601] Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence

Abstract page for arXiv paper 2603.21601: Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Gr...

arXiv - Machine Learning · 4 min ·
[2603.21576] PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection
Llms

[2603.21576] PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Selection

Abstract page for arXiv paper 2603.21576: PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Sele...

arXiv - Machine Learning · 4 min ·
[2603.21530] LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search
Llms

[2603.21530] LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search

Abstract page for arXiv paper 2603.21530: LLM-Based Test Case Generation in DBMS through Monte Carlo Tree Search

arXiv - AI · 4 min ·
[2603.21524] CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs
Llms

[2603.21524] CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

Abstract page for arXiv paper 2603.21524: CatRAG: Functor-Guided Structural Debiasing with Retrieval Augmentation for Fair LLMs

arXiv - AI · 4 min ·
[2603.21523] SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems
Llms

[2603.21523] SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

Abstract page for arXiv paper 2603.21523: SafePilot: A Framework for Assuring LLM-enabled Cyber-Physical Systems

arXiv - AI · 4 min ·
[2603.21522] Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation
Llms

[2603.21522] Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

Abstract page for arXiv paper 2603.21522: Efficient Failure Management for Multi-Agent Systems with Reasoning Trace Representation

arXiv - AI · 3 min ·
[2603.21460] When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models
Llms

[2603.21460] When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models

Abstract page for arXiv paper 2603.21460: When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieva...

arXiv - AI · 3 min ·
[2603.21440] KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
Llms

[2603.21440] KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

Abstract page for arXiv paper 2603.21440: KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning

arXiv - AI · 4 min ·
[2603.21439] LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study
Llms

[2603.21439] LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study

Abstract page for arXiv paper 2603.21439: LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Ind...

arXiv - AI · 3 min ·
[2603.21418] Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs
Llms

[2603.21418] Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on BERTimbau and Exploratory Evaluation of Generative LLMs

Abstract page for arXiv paper 2603.21418: Efficient Fine-Tuning Methods for Portuguese Question Answering: A Comparative Study of PEFT on...

arXiv - Machine Learning · 4 min ·
[2603.21359] Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF
Llms

[2603.21359] Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation and Human-Augmented RLAIF

Abstract page for arXiv paper 2603.21359: Benchmarking Bengali Dialectal Bias: A Multi-Stage Framework Integrating RAG-Based Translation ...

arXiv - AI · 4 min ·
[2603.21329] COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding
Llms

[2603.21329] COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding

Abstract page for arXiv paper 2603.21329: COINBench: Moving Beyond Individual Perspectives to Collective Intent Understanding

arXiv - AI · 4 min ·
[2603.21301] enhancing reasoning accuracy in large language models during inference time
Llms

[2603.21301] enhancing reasoning accuracy in large language models during inference time

Abstract page for arXiv paper 2603.21301: enhancing reasoning accuracy in large language models during inference time

arXiv - AI · 4 min ·
[2603.21289] When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning
Llms

[2603.21289] When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Abstract page for arXiv paper 2603.21289: When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

arXiv - AI · 4 min ·
[2603.21280] WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making
Llms

[2603.21280] WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

Abstract page for arXiv paper 2603.21280: WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

arXiv - AI · 4 min ·
[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations
Llms

[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

Abstract page for arXiv paper 2603.21278: Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conve...

arXiv - AI · 4 min ·
Previous Page 67 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime