Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Asked Google Gemini about Ai Agency

I asked Google Gemini what it would do if it would have agency. I find reply quite interesting: That is a fair critique. The previous lis...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

Deep neural network AIs have beaten symbolic AIs across the board on many tasks, but is there a chance that symbolic AIs written by DNNs(...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

I uploaded my consciousness paper to Gemini: “Beyond Quantum Microtubules: Consciousness as Substrate-Independent Architecture.” Then I s...

Reddit - Artificial Intelligence · 1 min · about 11 hours ago

All Content

Llms

[2603.01712] FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Abstract page for arXiv paper 2603.01712: FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01641] Learning Structured Reasoning via Tractable Trajectory Control

Abstract page for arXiv paper 2603.01641: Learning Structured Reasoning via Tractable Trajectory Control

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01608] Evaluating and Understanding Scheming Propensity in LLM Agents

Abstract page for arXiv paper 2603.01608: Evaluating and Understanding Scheming Propensity in LLM Agents

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01607] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework

Abstract page for arXiv paper 2603.01607: CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounde...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01557] Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring

Abstract page for arXiv paper 2603.01557: Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01562] RubricBench: Aligning Model-Generated Rubrics with Human Standards

Abstract page for arXiv paper 2603.01562: RubricBench: Aligning Model-Generated Rubrics with Human Standards

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01548] Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

Abstract page for arXiv paper 2603.01548: Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01488] LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

Abstract page for arXiv paper 2603.01488: LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01486] Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study

Abstract page for arXiv paper 2603.01486: Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01481] Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents

Abstract page for arXiv paper 2603.01481: Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Indus...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01464] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning

Abstract page for arXiv paper 2603.01464: ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained ...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01437] Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering

Abstract page for arXiv paper 2603.01437: Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01421] SciDER: Scientific Data-centric End-to-end Researcher

Abstract page for arXiv paper 2603.01421: SciDER: Scientific Data-centric End-to-end Researcher

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01416] Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents

Abstract page for arXiv paper 2603.01416: Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01396] HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

Abstract page for arXiv paper 2603.01396: HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01410] GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Abstract page for arXiv paper 2603.01410: GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Gra...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01409] MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

Abstract page for arXiv paper 2603.01409: MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.01375] Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

Abstract page for arXiv paper 2603.01375: Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.01227] The Lattice Representation Hypothesis of Large Language Models

Abstract page for arXiv paper 2603.01227: The Lattice Representation Hypothesis of Large Language Models

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01209] Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

Abstract page for arXiv paper 2603.01209: Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

arXiv - Machine Learning · 4 min · 2 months ago

Previous Page 320 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Asked Google Gemini about Ai Agency

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

All Content

[2603.01712] FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

[2603.01641] Learning Structured Reasoning via Tractable Trajectory Control

[2603.01608] Evaluating and Understanding Scheming Propensity in LLM Agents

[2603.01607] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework

[2603.01557] Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring

[2603.01562] RubricBench: Aligning Model-Generated Rubrics with Human Standards

[2603.01548] Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

[2603.01488] LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

[2603.01486] Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study

[2603.01481] Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents

[2603.01464] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning

[2603.01437] Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering

[2603.01421] SciDER: Scientific Data-centric End-to-end Researcher

[2603.01416] Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents

[2603.01396] HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

[2603.01410] GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

[2603.01409] MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

[2603.01375] Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

[2603.01227] The Lattice Representation Hypothesis of Large Language Models

[2603.01209] Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

Related Topics

Stay updated with AI News