Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Asked Google Gemini about Ai Agency

I asked Google Gemini what it would do if it would have agency. I find reply quite interesting: That is a fair critique. The previous lis...

Reddit - Artificial Intelligence · 1 min ·
Llms

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

Deep neural network AIs have beaten symbolic AIs across the board on many tasks, but is there a chance that symbolic AIs written by DNNs(...

Reddit - Artificial Intelligence · 1 min ·
Llms

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

I uploaded my consciousness paper to Gemini: “Beyond Quantum Microtubules: Consciousness as Substrate-Independent Architecture.” Then I s...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.00113] Position: AI Agents Are Not (Yet) a Panacea for Social Simulation
Llms

[2603.00113] Position: AI Agents Are Not (Yet) a Panacea for Social Simulation

Abstract page for arXiv paper 2603.00113: Position: AI Agents Are Not (Yet) a Panacea for Social Simulation

arXiv - AI · 3 min ·
[2603.00086] Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization
Llms

[2603.00086] Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization

Abstract page for arXiv paper 2603.00086: Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diariza...

arXiv - AI · 3 min ·
[2603.00076] The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making
Llms

[2603.00076] The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making

Abstract page for arXiv paper 2603.00076: The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Sta...

arXiv - Machine Learning · 3 min ·
[2603.00077] Autorubric: A Unified Framework for Rubric-Based LLM Evaluation
Llms

[2603.00077] Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

Abstract page for arXiv paper 2603.00077: Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

arXiv - AI · 4 min ·
[2603.00059] Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replicate a Human Survey with Synthetic Data
Llms

[2603.00059] Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replicate a Human Survey with Synthetic Data

Abstract page for arXiv paper 2603.00059: Stochastic Parrots or Singing in Harmony? Testing Five Leading LLMs for their Ability to Replic...

arXiv - AI · 4 min ·
[2603.00055] M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection
Llms

[2603.00055] M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framework for Industrial Anomaly Detection

Abstract page for arXiv paper 2603.00055: M3-AD: Reflection-aware Multi-modal, Multi-category, and Multi-dimensional Benchmark and Framew...

arXiv - Machine Learning · 3 min ·
[2603.00054] Expert Divergence Learning for MoE-based Language Models
Llms

[2603.00054] Expert Divergence Learning for MoE-based Language Models

Abstract page for arXiv paper 2603.00054: Expert Divergence Learning for MoE-based Language Models

arXiv - Machine Learning · 4 min ·
[2603.00051] LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks
Llms

[2603.00051] LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks

Abstract page for arXiv paper 2603.00051: LitBench: A Graph-Centric Large Language Model Benchmarking Tool For Literature Tasks

arXiv - Machine Learning · 4 min ·
[2603.00048] MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models
Llms

[2603.00048] MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models

Abstract page for arXiv paper 2603.00048: MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models

arXiv - AI · 4 min ·
[2603.00045] Breaking the Factorization Barrier in Diffusion Language Models
Llms

[2603.00045] Breaking the Factorization Barrier in Diffusion Language Models

Abstract page for arXiv paper 2603.00045: Breaking the Factorization Barrier in Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2603.00042] Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment
Llms

[2603.00042] Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

Abstract page for arXiv paper 2603.00042: Maximizing the Spectral Energy Gain in Sub-1-Bit LLMs via Latent Geometry Alignment

arXiv - Machine Learning · 3 min ·
[2603.00039] CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation
Llms

[2603.00039] CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

Abstract page for arXiv paper 2603.00039: CARE: Confounder-Aware Aggregation for Reliable LLM Evaluation

arXiv - Machine Learning · 3 min ·
[2603.00026] ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents
Llms

[2603.00026] ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

Abstract page for arXiv paper 2603.00026: ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

arXiv - AI · 4 min ·
[2603.00024] Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Independence in LLMs
Llms

[2603.00024] Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Independence in LLMs

Abstract page for arXiv paper 2603.00024: Personalization Increases Affective Alignment but Has Role-Dependent Effects on Epistemic Indep...

arXiv - AI · 4 min ·
[2603.02119] Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning
Llms

[2603.02119] Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning

Abstract page for arXiv paper 2603.02119: Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning

arXiv - Machine Learning · 3 min ·
[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy
Llms

[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

Abstract page for arXiv paper 2603.02123: Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

arXiv - AI · 4 min ·
[2603.02070] Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning
Llms

[2603.02070] Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning

Abstract page for arXiv paper 2603.02070: Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations i...

arXiv - AI · 3 min ·
[2603.01822] Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models
Llms

[2603.01822] Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

Abstract page for arXiv paper 2603.01822: Emerging Human-like Strategies for Semantic Memory Foraging in Large Language Models

arXiv - AI · 4 min ·
[2603.01952] LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations
Llms

[2603.01952] LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations

Abstract page for arXiv paper 2603.01952: LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic ...

arXiv - AI · 3 min ·
[2603.01783] GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation
Llms

[2603.01783] GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

Abstract page for arXiv paper 2603.01783: GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

arXiv - AI · 4 min ·
Previous Page 319 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime