Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Asked Google Gemini about Ai Agency

I asked Google Gemini what it would do if it would have agency. I find reply quite interesting: That is a fair critique. The previous lis...

Reddit - Artificial Intelligence · 1 min ·
Llms

Could the best LLM be able to generate a symbolic AI that is superior to itself, or is there something superior about matrices vs graphs?

Deep neural network AIs have beaten symbolic AIs across the board on many tasks, but is there a chance that symbolic AIs written by DNNs(...

Reddit - Artificial Intelligence · 1 min ·
Llms

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

I uploaded my consciousness paper to Gemini: “Beyond Quantum Microtubules: Consciousness as Substrate-Independent Architecture.” Then I s...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.01712] FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents
Llms

[2603.01712] FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Abstract page for arXiv paper 2603.01712: FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

arXiv - Machine Learning · 4 min ·
[2603.01641] Learning Structured Reasoning via Tractable Trajectory Control
Llms

[2603.01641] Learning Structured Reasoning via Tractable Trajectory Control

Abstract page for arXiv paper 2603.01641: Learning Structured Reasoning via Tractable Trajectory Control

arXiv - AI · 3 min ·
[2603.01608] Evaluating and Understanding Scheming Propensity in LLM Agents
Llms

[2603.01608] Evaluating and Understanding Scheming Propensity in LLM Agents

Abstract page for arXiv paper 2603.01608: Evaluating and Understanding Scheming Propensity in LLM Agents

arXiv - AI · 4 min ·
[2603.01607] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework
Llms

[2603.01607] CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework

Abstract page for arXiv paper 2603.01607: CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounde...

arXiv - Machine Learning · 4 min ·
[2603.01557] Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring
Llms

[2603.01557] Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring

Abstract page for arXiv paper 2603.01557: Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring

arXiv - AI · 4 min ·
[2603.01562] RubricBench: Aligning Model-Generated Rubrics with Human Standards
Llms

[2603.01562] RubricBench: Aligning Model-Generated Rubrics with Human Standards

Abstract page for arXiv paper 2603.01562: RubricBench: Aligning Model-Generated Rubrics with Human Standards

arXiv - AI · 3 min ·
[2603.01548] Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents
Llms

[2603.01548] Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

Abstract page for arXiv paper 2603.01548: Graph-Based Self-Healing Tool Routing for Cost-Efficient LLM Agents

arXiv - AI · 4 min ·
[2603.01488] LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning
Llms

[2603.01488] LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

Abstract page for arXiv paper 2603.01488: LLM-assisted Semantic Option Discovery for Facilitating Adaptive Deep Reinforcement Learning

arXiv - AI · 3 min ·
[2603.01486] Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study
Llms

[2603.01486] Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study

Abstract page for arXiv paper 2603.01486: Agentic Multi-Source Grounding for Enhanced Query Intent Understanding: A DoorDash Case Study

arXiv - AI · 4 min ·
[2603.01481] Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents
Llms

[2603.01481] Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales Agents

Abstract page for arXiv paper 2603.01481: Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Indus...

arXiv - AI · 3 min ·
[2603.01464] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning
Llms

[2603.01464] ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained via Reinforcement Learning

Abstract page for arXiv paper 2603.01464: ProtRLSearch: A Multi-Round Multimodal Protein Search Agent with Large Language Models Trained ...

arXiv - AI · 4 min ·
[2603.01437] Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering
Llms

[2603.01437] Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering

Abstract page for arXiv paper 2603.01437: Decoding Answers Before Chain-of-Thought: Evidence from Pre-CoT Probes and Activation Steering

arXiv - AI · 4 min ·
[2603.01421] SciDER: Scientific Data-centric End-to-end Researcher
Llms

[2603.01421] SciDER: Scientific Data-centric End-to-end Researcher

Abstract page for arXiv paper 2603.01421: SciDER: Scientific Data-centric End-to-end Researcher

arXiv - AI · 3 min ·
[2603.01416] Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents
Llms

[2603.01416] Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents

Abstract page for arXiv paper 2603.01416: Securing the Floor and Raising the Ceiling: A Merging-based Paradigm for Multi-modal Search Agents

arXiv - AI · 4 min ·
[2603.01396] HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts
Llms

[2603.01396] HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

Abstract page for arXiv paper 2603.01396: HarmonyCell: Automating Single-Cell Perturbation Modeling under Semantic and Distribution Shifts

arXiv - AI · 3 min ·
[2603.01410] GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning
Llms

[2603.01410] GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Graph Reasoning

Abstract page for arXiv paper 2603.01410: GraphScout: Empowering Large Language Models with Intrinsic Exploration Ability for Agentic Gra...

arXiv - AI · 4 min ·
[2603.01409] MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning
Llms

[2603.01409] MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

Abstract page for arXiv paper 2603.01409: MIST-RL: Mutation-based Incremental Suite Testing via Reinforcement Learning

arXiv - Machine Learning · 4 min ·
[2603.01375] Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation
Llms

[2603.01375] Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

Abstract page for arXiv paper 2603.01375: Words & Weights: Streamlining Multi-Turn Interactions via Co-Adaptation

arXiv - Machine Learning · 3 min ·
[2603.01227] The Lattice Representation Hypothesis of Large Language Models
Llms

[2603.01227] The Lattice Representation Hypothesis of Large Language Models

Abstract page for arXiv paper 2603.01227: The Lattice Representation Hypothesis of Large Language Models

arXiv - AI · 3 min ·
[2603.01209] Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics
Llms

[2603.01209] Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

Abstract page for arXiv paper 2603.01209: Agents Learn Their Runtime: Interpreter Persistence as Training-Time Semantics

arXiv - Machine Learning · 4 min ·
Previous Page 320 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime