Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Mira Murati’s deposition pulled back the curtain on Sam Altman’s ouster | The Verge

Thanks to Musk v. Altman, the public is getting a concrete look at details of Sam Altman’s ouster from OpenAI, much of it centered on for...

The Verge - AI · 11 min · about 1 hour ago

Llms

Diffusion for generating/editing ASTs? [D]

I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...

Reddit - Machine Learning · 1 min · about 2 hours ago

Llms

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge

OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...

The Verge - AI · 4 min · about 2 hours ago

All Content

Llms

[2509.24385] Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

Abstract page for arXiv paper 2509.24385: Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

arXiv - AI · 4 min · 2 months ago

Llms

[2509.24282] SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents

Abstract page for arXiv paper 2509.24282: SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents

arXiv - AI · 4 min · 2 months ago

Llms

[2509.24245] Prompt and Parameter Co-Optimization for Large Language Models

Abstract page for arXiv paper 2509.24245: Prompt and Parameter Co-Optimization for Large Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2509.24203] Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

Abstract page for arXiv paper 2509.24203: Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRP...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

Abstract page for arXiv paper 2509.21029: FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One

Abstract page for arXiv paper 2509.23383: Train Once, Answer All: Many Pretraining Experiments for the Cost of One

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

Abstract page for arXiv paper 2509.22611: Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

Abstract page for arXiv paper 2509.22299: HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Abstract page for arXiv paper 2509.22134: Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

arXiv - AI · 4 min · 2 months ago

Llms

[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models

Abstract page for arXiv paper 2508.07697: Semantic-Enhanced Time-Series Forecasting via Large Language Models

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

Abstract page for arXiv paper 2508.07638: Data Selection for LLM Alignment Using Fine-Grained Preferences

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

Abstract page for arXiv paper 2508.04097: Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Abstract page for arXiv paper 2508.04865: Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Lear...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv - AI · 4 min · 2 months ago

Llms

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Abstract page for arXiv paper 2507.18553: The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses

arXiv - AI · 3 min · 2 months ago

Llms

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Abstract page for arXiv paper 2508.18672: Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

Abstract page for arXiv paper 2506.20746: Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

arXiv - Machine Learning · 4 min · 2 months ago

Previous Page 339 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Mira Murati’s deposition pulled back the curtain on Sam Altman’s ouster | The Verge

Diffusion for generating/editing ASTs? [D]

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge

All Content

[2509.24385] Vid-LLM: A Compact Video-based 3D Multimodal LLM with Reconstruction-Reasoning Synergy

[2509.24282] SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents

[2509.24245] Prompt and Parameter Co-Optimization for Large Language Models

[2509.24203] Group-Relative REINFORCE Is Secretly an Off-Policy Algorithm: Demystifying Some Myths About GRPO and Its Friends

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One

[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

Related Topics

Stay updated with AI News