Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min ·
Llms

What if Claude purposefully made its own code leakable so that it would get leaked

What if Claude leaked itself by socially and architecturally engineering itself to be leaked by a dumb human submitted by /u/smurfcsgoawp...

Reddit - Artificial Intelligence · 1 min ·
Llms

Observer-Embedded Reality

Observer-Embedded Reality Consciousness, Complexity, Meaning, and the Limits of Human Knowledge A Conceptual Philosophy-of-Science Paper ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations
Llms

[2603.21278] Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conversations

Abstract page for arXiv paper 2603.21278: Conversation Tree Architecture: A Structured Framework for Context-Aware Multi-Branch LLM Conve...

arXiv - AI · 4 min ·
[2603.21276] Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity
Llms

[2603.21276] Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity

Abstract page for arXiv paper 2603.21276: Aggregation Alignment for Federated Learning with Mixture-of-Experts under Data Heterogeneity

arXiv - Machine Learning · 4 min ·
[2603.21232] QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression
Llms

[2603.21232] QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression

Abstract page for arXiv paper 2603.21232: QMoP: Query Guided Mixture-of-Projector for Efficient Visual Token Compression

arXiv - AI · 4 min ·
[2603.21178] LLM-based Automated Architecture View Generation: Where Are We Now?
Llms

[2603.21178] LLM-based Automated Architecture View Generation: Where Are We Now?

Abstract page for arXiv paper 2603.21178: LLM-based Automated Architecture View Generation: Where Are We Now?

arXiv - AI · 4 min ·
[2603.21177] Prompt replay: speeding up grpo with on-policy reuse of high-signal prompts
Llms

[2603.21177] Prompt replay: speeding up grpo with on-policy reuse of high-signal prompts

Abstract page for arXiv paper 2603.21177: Prompt replay: speeding up grpo with on-policy reuse of high-signal prompts

arXiv - Machine Learning · 4 min ·
[2603.21175] Reward Sharpness-Aware Fine-Tuning for Diffusion Models
Llms

[2603.21175] Reward Sharpness-Aware Fine-Tuning for Diffusion Models

Abstract page for arXiv paper 2603.21175: Reward Sharpness-Aware Fine-Tuning for Diffusion Models

arXiv - Machine Learning · 3 min ·
[2603.21149] Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains
Llms

[2603.21149] Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains

Abstract page for arXiv paper 2603.21149: Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based...

arXiv - AI · 3 min ·
[2603.21016] Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO
Llms

[2603.21016] Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

Abstract page for arXiv paper 2603.21016: Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO

arXiv - Machine Learning · 3 min ·
[2603.21011] ALL-FEM: Agentic Large Language models Fine-tuned for Finite Element Methods
Llms

[2603.21011] ALL-FEM: Agentic Large Language models Fine-tuned for Finite Element Methods

Abstract page for arXiv paper 2603.21011: ALL-FEM: Agentic Large Language models Fine-tuned for Finite Element Methods

arXiv - Machine Learning · 4 min ·
[2603.21006] How AI Systems Think About Education: Analyzing Latent Preference Patterns in Large Language Models
Llms

[2603.21006] How AI Systems Think About Education: Analyzing Latent Preference Patterns in Large Language Models

Abstract page for arXiv paper 2603.21006: How AI Systems Think About Education: Analyzing Latent Preference Patterns in Large Language Mo...

arXiv - AI · 3 min ·
[2603.20991] Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds
Llms

[2603.20991] Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds

Abstract page for arXiv paper 2603.20991: Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and F...

arXiv - Machine Learning · 4 min ·
[2603.20976] Detection of adversarial intent in Human-AI teams using LLMs
Llms

[2603.20976] Detection of adversarial intent in Human-AI teams using LLMs

Abstract page for arXiv paper 2603.20976: Detection of adversarial intent in Human-AI teams using LLMs

arXiv - Machine Learning · 4 min ·
[2603.20965] Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification
Llms

[2603.20965] Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification

Abstract page for arXiv paper 2603.20965: Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification

arXiv - AI · 4 min ·
[2603.20957] Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models
Llms

[2603.20957] Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Language Models

Abstract page for arXiv paper 2603.20957: Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Lan...

arXiv - AI · 4 min ·
[2603.20939] User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction
Llms

[2603.20939] User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction

Abstract page for arXiv paper 2603.20939: User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented I...

arXiv - AI · 4 min ·
[2603.20933] AC4A: Access Control for Agents
Llms

[2603.20933] AC4A: Access Control for Agents

Abstract page for arXiv paper 2603.20933: AC4A: Access Control for Agents

arXiv - AI · 4 min ·
[2603.20899] Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach
Llms

[2603.20899] Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach

Abstract page for arXiv paper 2603.20899: Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach

arXiv - AI · 3 min ·
[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation
Llms

[2603.20882] RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation

Abstract page for arXiv paper 2603.20882: RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for...

arXiv - Machine Learning · 4 min ·
[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch
Llms

[2603.20854] SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

Abstract page for arXiv paper 2603.20854: SozKZ: Training Efficient Small Language Models for Kazakh from Scratch

arXiv - AI · 3 min ·
[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?
Llms

[2603.20851] Can ChatGPT Really Understand Modern Chinese Poetry?

Abstract page for arXiv paper 2603.20851: Can ChatGPT Really Understand Modern Chinese Poetry?

arXiv - AI · 3 min ·
Previous Page 68 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime