Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

[2603.08899] ConFu: Contemplate the Future for Better Speculative Sampling
Llms

[2603.08899] ConFu: Contemplate the Future for Better Speculative Sampling

Abstract page for arXiv paper 2603.08899: ConFu: Contemplate the Future for Better Speculative Sampling

arXiv - Machine Learning · 4 min ·
[2602.00052] AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows
Llms

[2602.00052] AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Trial Workflows

Abstract page for arXiv paper 2602.00052: AI-assisted Protocol Information Extraction For Improved Accuracy and Efficiency in Clinical Tr...

arXiv - Machine Learning · 4 min ·
[2601.07160] AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units
Llms

[2601.07160] AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

Abstract page for arXiv paper 2601.07160: AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

arXiv - Machine Learning · 4 min ·

All Content

[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
Llms

[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Abstract page for arXiv paper 2601.21895: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

arXiv - AI · 4 min ·
[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Llms

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

arXiv - Machine Learning · 4 min ·
[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings
Llms

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings

arXiv - Machine Learning · 4 min ·
[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis
Llms

[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

Abstract page for arXiv paper 2602.04369: Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

arXiv - Machine Learning · 4 min ·
[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
Llms

[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

Abstract page for arXiv paper 2602.02742: Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

arXiv - Machine Learning · 3 min ·
[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards
Llms

[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Abstract page for arXiv paper 2602.02555: Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Rein...

arXiv - Machine Learning · 4 min ·
[2512.08937] When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being
Llms

[2512.08937] When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

Abstract page for arXiv paper 2512.08937: When AI Gives Advice: Evaluating AI and Human Responses to Online Advice-Seeking for Well-Being

arXiv - AI · 4 min ·
[2601.20838] Reward Models Inherit Value Biases from Pretraining
Llms

[2601.20838] Reward Models Inherit Value Biases from Pretraining

Abstract page for arXiv paper 2601.20838: Reward Models Inherit Value Biases from Pretraining

arXiv - Machine Learning · 4 min ·
[2601.20088] Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery
Llms

[2601.20088] Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

Abstract page for arXiv paper 2601.20088: Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery

arXiv - Machine Learning · 4 min ·
[2512.03794] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition
Llms

[2512.03794] AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

Abstract page for arXiv paper 2512.03794: AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition

arXiv - Machine Learning · 4 min ·
[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents
Llms

[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents

Abstract page for arXiv paper 2512.01822: InnoGym: Benchmarking the Innovation Potential of AI Agents

arXiv - Machine Learning · 4 min ·
[2511.21740] A cross-species neural foundation model for end-to-end speech decoding
Llms

[2511.21740] A cross-species neural foundation model for end-to-end speech decoding

Abstract page for arXiv paper 2511.21740: A cross-species neural foundation model for end-to-end speech decoding

arXiv - AI · 4 min ·
[2511.21722] German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies
Llms

[2511.21722] German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Population-Aligned LLM Studies

Abstract page for arXiv paper 2511.21722: German General Social Survey Personas: A Survey-Derived Persona Prompt Collection for Populatio...

arXiv - AI · 4 min ·
[2601.18753] HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs
Llms

[2601.18753] HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

Abstract page for arXiv paper 2601.18753: HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs

arXiv - Machine Learning · 4 min ·
[2511.10985] When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets
Llms

[2511.10985] When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

Abstract page for arXiv paper 2511.10985: When Data is the Algorithm: A Systematic Study and Curation of Preference Optimization Datasets

arXiv - AI · 4 min ·
[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression
Llms

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

Abstract page for arXiv paper 2601.04786: AgentOCR: Reimagining Agent History via Optical Self-Compression

arXiv - Machine Learning · 4 min ·
[2511.08616] Reasoning on Time-Series for Financial Technical Analysis
Llms

[2511.08616] Reasoning on Time-Series for Financial Technical Analysis

Abstract page for arXiv paper 2511.08616: Reasoning on Time-Series for Financial Technical Analysis

arXiv - Machine Learning · 4 min ·
[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling
Llms

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

Abstract page for arXiv paper 2512.17052: Dynamic Tool Dependency Retrieval for Efficient Function Calling

arXiv - Machine Learning · 4 min ·
[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model
Llms

[2512.11582] Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

Abstract page for arXiv paper 2512.11582: Brain-Semantoks: Learning Semantic Tokens of Brain Dynamics with a Self-Distilled Foundation Model

arXiv - Machine Learning · 4 min ·
[2512.04695] TRINITY: An Evolved LLM Coordinator
Llms

[2512.04695] TRINITY: An Evolved LLM Coordinator

Abstract page for arXiv paper 2512.04695: TRINITY: An Evolved LLM Coordinator

arXiv - Machine Learning · 4 min ·
Previous Page 216 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime