Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Asked Google Gemini about Ai Agency

I asked Google Gemini what it would do if it would have agency. I find reply quite interesting: That is a fair critique. The previous lis...

Reddit - Artificial Intelligence · 1 min · 5 minutes ago

Llms

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

Deep neural network AIs have beaten symbolic AIs across the board on many tasks, but is there a chance that symbolic AIs written by DNNs(...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

I uploaded my consciousness paper to Gemini: “Beyond Quantum Microtubules: Consciousness as Substrate-Independent Architecture.” Then I s...

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

All Content

Llms

[2603.00113] Position: AI Agents Are Not (Yet) a Panacea for Social Simulation

Abstract page for arXiv paper 2603.00113: Position: AI Agents Are Not (Yet) a Panacea for Social Simulation

arXiv - AI · 3 min · 2 months ago

Llms

[2603.00086] Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization

Abstract page for arXiv paper 2603.00086: Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diariza...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.00076] The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making

Abstract page for arXiv paper 2603.00076: The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Sta...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00077] Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

Abstract page for arXiv paper 2603.00077: Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00059] Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replicate a Human Survey with Synthetic Data

Abstract page for arXiv paper 2603.00059: Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replic...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00055] M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Abstract page for arXiv paper 2603.00055: M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framew...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00054] Expert Divergence Learning for MoE-based Language Models

Abstract page for arXiv paper 2603.00054: Expert Divergence Learning for MoE-based Language Models

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.00051] LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks

Abstract page for arXiv paper 2603.00051: LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.00048] MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models

Abstract page for arXiv paper 2603.00048: MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00045] Breaking the Factorization Barrier in Diffusion Language Models

Abstract page for arXiv paper 2603.00045: Breaking the Factorization Barrier in Diffusion Language Models

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.00042] Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

Abstract page for arXiv paper 2603.00042: Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00039] CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

Abstract page for arXiv paper 2603.00039: CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00026] ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

Abstract page for arXiv paper 2603.00026: ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00024] Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Independence in LLMs

Abstract page for arXiv paper 2603.00024: Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Indep...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02119] Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning

Abstract page for arXiv paper 2603.02119: Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

Abstract page for arXiv paper 2603.02123: Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02070] Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning

Abstract page for arXiv paper 2603.02070: Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations i...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01822] Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

Abstract page for arXiv paper 2603.01822: Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01952] LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations

Abstract page for arXiv paper 2603.01952: LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic ...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01783] GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Abstract page for arXiv paper 2603.01783: GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

arXiv - AI · 4 min · 2 months ago

Previous Page 319 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Asked Google Gemini about Ai Agency

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

All Content

[2603.00113] Position: AI Agents Are Not (Yet) a Panacea for Social Simulation

[2603.00086] Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization

[2603.00076] The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making

[2603.00077] Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

[2603.00059] Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replicate a Human Survey with Synthetic Data

[2603.00055] M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

[2603.00054] Expert Divergence Learning for MoE-based Language Models

[2603.00051] LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks

[2603.00048] MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models

[2603.00045] Breaking the Factorization Barrier in Diffusion Language Models

[2603.00042] Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

[2603.00039] CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

[2603.00026] ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

[2603.00024] Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Independence in LLMs

[2603.02119] Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning

[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

[2603.02070] Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning

[2603.01822] Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

[2603.01952] LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations

[2603.01783] GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Related Topics

Stay updated with AI News