Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

[2603.08899] ConFu: Contemplate the Future for Better Speculative Sampling

Abstract page for arXiv paper 2603.08899: ConFu: Contemplate the Future for Better Speculative Sampling

arXiv - Machine Learning · 4 min · 5 minutes ago

Llms

[2602.00052] AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows

Abstract page for arXiv paper 2602.00052: AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Tr...

arXiv - Machine Learning · 4 min · 5 minutes ago

Llms

[2601.07160] AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

Abstract page for arXiv paper 2601.07160: AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

arXiv - Machine Learning · 4 min · 5 minutes ago

All Content

Llms

[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Abstract page for arXiv paper 2601.21895: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

Abstract page for arXiv paper 2602.04369: Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

Abstract page for arXiv paper 2602.02742: Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Abstract page for arXiv paper 2602.02555: Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Rein...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.08937] When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

Abstract page for arXiv paper 2512.08937: When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.20838] Reward Models Inherit Value Biases from Pretraining

Abstract page for arXiv paper 2601.20838: Reward Models Inherit Value Biases from Pretraining

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2601.20088] Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Abstract page for arXiv paper 2601.20088: Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.03794] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

Abstract page for arXiv paper 2512.03794: AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents

Abstract page for arXiv paper 2512.01822: InnoGym: Benchmarking the Innovation Potential of AI Agents

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2511.21740] A cross-species neural foundation model for end-to-end speech decoding

Abstract page for arXiv paper 2511.21740: A cross-species neural foundation model for end-to-end speech decoding

arXiv - AI · 4 min · about 2 months ago

Llms

[2511.21722] German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies

Abstract page for arXiv paper 2511.21722: German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Populatio...

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.18753] HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

Abstract page for arXiv paper 2601.18753: HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2511.10985] When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

Abstract page for arXiv paper 2511.10985: When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

arXiv - AI · 4 min · about 2 months ago

Llms

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

Abstract page for arXiv paper 2601.04786: AgentOCR: Reimagining Agent History via Optical Self-Compression

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2511.08616] Reasoning on Time-Series for Financial Technical Analysis

Abstract page for arXiv paper 2511.08616: Reasoning on Time-Series for Financial Technical Analysis

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

Abstract page for arXiv paper 2512.17052: Dynamic Tool Dependency Retrieval for Efficient Function Calling

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Abstract page for arXiv paper 2512.11582: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2512.04695] TRINITY: An Evolved LLM Coordinator

Abstract page for arXiv paper 2512.04695: TRINITY: An Evolved LLM Coordinator

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 216 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[2603.08899] ConFu: Contemplate the Future for Better Speculative Sampling

[2602.00052] AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows

[2601.07160] AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

All Content

[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

[2512.08937] When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

[2601.20838] Reward Models Inherit Value Biases from Pretraining

[2601.20088] Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

[2512.03794] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents

[2511.21740] A cross-species neural foundation model for end-to-end speech decoding

[2511.21722] German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies

[2601.18753] HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

[2511.10985] When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

[2511.08616] Reasoning on Time-Series for Financial Technical Analysis

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

[2512.04695] TRINITY: An Evolved LLM Coordinator

Related Topics

Stay updated with AI News