Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Zoom + Claude Connector

Zoom have just launched their Claude Connector bringing a whole host of data & information into your Claude workspace. As a Claude Co...

Reddit - Artificial Intelligence · 1 min ·
Llms

Must your chatbot rat you out?

New court cases may take chatbot conversations another step away from privacy You may recall that court cases have recently held users’ c...

Reddit - Artificial Intelligence · 1 min ·
[2512.07703] PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
Llms

[2512.07703] PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

Abstract page for arXiv paper 2512.07703: PVeRA: Probabilistic Vector-Based Random Matrix Adaptation

arXiv - Machine Learning · 4 min ·

All Content

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Llms

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

arXiv - AI · 3 min ·
[2509.20986] SiNGER: A Clearer Voice Distills Vision Transformers Further
Llms

[2509.20986] SiNGER: A Clearer Voice Distills Vision Transformers Further

Abstract page for arXiv paper 2509.20986: SiNGER: A Clearer Voice Distills Vision Transformers Further

arXiv - AI · 4 min ·
[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Llms

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

arXiv - Machine Learning · 4 min ·
[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes
Llms

[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

Abstract page for arXiv paper 2509.10625: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

arXiv - AI · 4 min ·
[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata
Llms

[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

Abstract page for arXiv paper 2509.05425: No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

arXiv - AI · 3 min ·
[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery
Llms

[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

Abstract page for arXiv paper 2511.10833: SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

arXiv - Machine Learning · 4 min ·
[2511.08939] TransactionGPT
Llms

[2511.08939] TransactionGPT

Abstract page for arXiv paper 2511.08939: TransactionGPT

arXiv - Machine Learning · 4 min ·
[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
Llms

[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

Abstract page for arXiv paper 2507.05890: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

arXiv - AI · 4 min ·
[2507.01335] LEDOM: Reverse Language Model
Llms

[2507.01335] LEDOM: Reverse Language Model

Abstract page for arXiv paper 2507.01335: LEDOM: Reverse Language Model

arXiv - AI · 3 min ·
[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach
Llms

[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

Abstract page for arXiv paper 2510.15165: Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation App...

arXiv - Machine Learning · 4 min ·
[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Llms

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

arXiv - Machine Learning · 4 min ·
[2510.10902] Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness
Llms

[2510.10902] Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness

Abstract page for arXiv paper 2510.10902: Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness

arXiv - Machine Learning · 4 min ·
[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning
Llms

[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Abstract page for arXiv paper 2510.04573: LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

arXiv - Machine Learning · 4 min ·
[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy
Llms

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

Abstract page for arXiv paper 2510.08646: Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

arXiv - Machine Learning · 4 min ·
[2506.11103] You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models
Llms

[2506.11103] You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

Abstract page for arXiv paper 2506.11103: You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

arXiv - AI · 4 min ·
[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
Llms

[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Abstract page for arXiv paper 2509.23202: Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

arXiv - Machine Learning · 4 min ·
[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding
Llms

[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Abstract page for arXiv paper 2503.01804: $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

arXiv - Machine Learning · 3 min ·
[2509.07430] The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Llms

[2509.07430] The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Abstract page for arXiv paper 2509.07430: The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Lea...

arXiv - Machine Learning · 4 min ·
[2503.03170] AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding
Llms

[2503.03170] AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding

Abstract page for arXiv paper 2503.03170: AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding

arXiv - AI · 4 min ·
[2502.08666] Hallucination, Monofacts, and Miscalibration: An Empirical Investigation
Llms

[2502.08666] Hallucination, Monofacts, and Miscalibration: An Empirical Investigation

Abstract page for arXiv paper 2502.08666: Hallucination, Monofacts, and Miscalibration: An Empirical Investigation

arXiv - AI · 4 min ·
Previous Page 292 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime