Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

8 free AI courses from Anthropic’s Claude platform with certificates

AI News - General ·
Llms

How is mythos mythos ? [D]

Hello, I’ve been seeing discussions about “Mythos AI” showing behaviors that seem far beyond simple text prediction—like accessing inform...

Reddit - Machine Learning · 1 min ·
Llms

Claude developer hosts Christian leaders for AI summit

AI Tools & Products ·

All Content

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models
Llms

[2603.01454] VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

Abstract page for arXiv paper 2603.01454: VidDoS: Universal Denial-of-Service Attack on Video-based Large Language Models

arXiv - AI · 3 min ·
[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents
Llms

[2603.01438] Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing Agents

Abstract page for arXiv paper 2603.01438: Enhancing Persona Following at Decoding Time via Dynamic Importance Estimation for Role-Playing...

arXiv - AI · 4 min ·
[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning
Llms

[2603.01385] Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

Abstract page for arXiv paper 2603.01385: Toward Graph-Tokenizing Large Language Models with Reconstructive Graph Instruction Tuning

arXiv - AI · 4 min ·
[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation
Llms

[2603.01780] D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

Abstract page for arXiv paper 2603.01780: D3LM: A Discrete DNA Diffusion Language Model for Bidirectional DNA Understanding and Generation

arXiv - Machine Learning · 4 min ·
[2603.01761] Modular Memory is the Key to Continual Learning Agents
Llms

[2603.01761] Modular Memory is the Key to Continual Learning Agents

Abstract page for arXiv paper 2603.01761: Modular Memory is the Key to Continual Learning Agents

arXiv - Machine Learning · 4 min ·
[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning
Llms

[2603.01759] Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

Abstract page for arXiv paper 2603.01759: Meta-Learning Hyperparameters for Parameter Efficient Fine-Tuning

arXiv - Machine Learning · 4 min ·
[2603.01343] PanCanBench: A Comprehensive Benchmark for Evaluating Large Language Models in Pancreatic Oncology
Llms

[2603.01343] PanCanBench: A Comprehensive Benchmark for Evaluating Large Language Models in Pancreatic Oncology

Abstract page for arXiv paper 2603.01343: PanCanBench: A Comprehensive Benchmark for Evaluating Large Language Models in Pancreatic Oncology

arXiv - AI · 4 min ·
[2603.01752] Causal Circuit Tracing Reveals Distinct Computational Architectures in Single-Cell Foundation Models: Inhibitory Dominance, Biological Coherence, and Cross-Model Convergence
Llms

[2603.01752] Causal Circuit Tracing Reveals Distinct Computational Architectures in Single-Cell Foundation Models: Inhibitory Dominance, Biological Coherence, and Cross-Model Convergence

Abstract page for arXiv paper 2603.01752: Causal Circuit Tracing Reveals Distinct Computational Architectures in Single-Cell Foundation M...

arXiv - Machine Learning · 3 min ·
[2603.01331] MetaState: Persistent Working Memory for Discrete Diffusion Language Models
Llms

[2603.01331] MetaState: Persistent Working Memory for Discrete Diffusion Language Models

Abstract page for arXiv paper 2603.01331: MetaState: Persistent Working Memory for Discrete Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2603.01692] Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search
Llms

[2603.01692] Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

Abstract page for arXiv paper 2603.01692: Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search

arXiv - Machine Learning · 4 min ·
[2603.01254] LLM Self-Explanations Fail Semantic Invariance
Llms

[2603.01254] LLM Self-Explanations Fail Semantic Invariance

Abstract page for arXiv paper 2603.01254: LLM Self-Explanations Fail Semantic Invariance

arXiv - AI · 3 min ·
[2603.01252] Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation
Llms

[2603.01252] Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation

Abstract page for arXiv paper 2603.01252: Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation

arXiv - AI · 3 min ·
[2603.01589] SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond
Llms

[2603.01589] SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Abstract page for arXiv paper 2603.01589: SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

arXiv - AI · 4 min ·
[2603.01246] Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders
Llms

[2603.01246] Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders

Abstract page for arXiv paper 2603.01246: Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders

arXiv - AI · 4 min ·
[2603.01239] Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape Model Confidence
Llms

[2603.01239] Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape Model Confidence

Abstract page for arXiv paper 2603.01239: Self-Anchoring Calibration Drift in Large Language Models: How Multi-Turn Conversations Reshape...

arXiv - AI · 4 min ·
[2603.01563] LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models
Llms

[2603.01563] LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

Abstract page for arXiv paper 2603.01563: LFPO: Likelihood-Free Policy Optimization for Masked Diffusion Models

arXiv - Machine Learning · 4 min ·
[2603.01224] Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction
Llms

[2603.01224] Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction

Abstract page for arXiv paper 2603.01224: Monocular 3D Object Position Estimation with VLMs for Human-Robot Interaction

arXiv - Machine Learning · 3 min ·
[2603.01501] GAC: Stabilizing Asynchronous RL Training for LLMs via Gradient Alignment Control
Llms

[2603.01501] GAC: Stabilizing Asynchronous RL Training for LLMs via Gradient Alignment Control

Abstract page for arXiv paper 2603.01501: GAC: Stabilizing Asynchronous RL Training for LLMs via Gradient Alignment Control

arXiv - Machine Learning · 3 min ·
[2603.01185] Token-level Data Selection for Safe LLM Fine-tuning
Llms

[2603.01185] Token-level Data Selection for Safe LLM Fine-tuning

Abstract page for arXiv paper 2603.01185: Token-level Data Selection for Safe LLM Fine-tuning

arXiv - AI · 3 min ·
[2603.01170] ATLAS: AI-Assisted Threat-to-Assertion Learning for System-on-Chip Security Verification
Llms

[2603.01170] ATLAS: AI-Assisted Threat-to-Assertion Learning for System-on-Chip Security Verification

Abstract page for arXiv paper 2603.01170: ATLAS: AI-Assisted Threat-to-Assertion Learning for System-on-Chip Security Verification

arXiv - AI · 3 min ·
Previous Page 168 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime