Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I Cut Claude API Costs by 50% Using This Self Modifying Agentic System

I've been developing a self-modifying Al agent system that effectively cuts my Claude API usage in half, Claude thinks and then I basical...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Sentient OS: a custom on-device vision LLM that understands your entire digital life (every screenshot, note, file, email...), while your device charges overnight. Talk to your data, get proactive reminders, and explore knowledge graphs!

99% of "AI" apps are just GPT wrappers that pipe your data to cloud LLMs and call it a product. No one's ever created an intelligence lay...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

What to build while we still have access to cheap AI?

AI companies are subsidizing access the same way Uber subsidized rides and AWS subsidized compute in the early days - burning cash to gra...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Llms

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

Abstract page for arXiv paper 2509.21029: FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One

Abstract page for arXiv paper 2509.23383: Train Once, Answer All: Many Pretraining Experiments for the Cost of One

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

Abstract page for arXiv paper 2509.22611: Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

Abstract page for arXiv paper 2509.22299: HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Abstract page for arXiv paper 2509.22134: Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

arXiv - AI · 4 min · about 2 months ago

Llms

[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models

Abstract page for arXiv paper 2508.07697: Semantic-Enhanced Time-Series Forecasting via Large Language Models

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

Abstract page for arXiv paper 2508.07638: Data Selection for LLM Alignment Using Fine-Grained Preferences

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

Abstract page for arXiv paper 2508.04097: Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Abstract page for arXiv paper 2508.04865: Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Lear...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv - AI · 4 min · about 2 months ago

Llms

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Abstract page for arXiv paper 2507.18553: The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses

arXiv - AI · 3 min · about 2 months ago

Llms

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Abstract page for arXiv paper 2508.18672: Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

Abstract page for arXiv paper 2506.20746: Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2506.15872] Hidden Breakthroughs in Language Model Training

Abstract page for arXiv paper 2506.15872: Hidden Breakthroughs in Language Model Training

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

Abstract page for arXiv paper 2508.11999: MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understan...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

Abstract page for arXiv paper 2508.06526: PiKV: KV Cache Management System for Mixture of Experts

arXiv - AI · 4 min · about 2 months ago

Llms

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Abstract page for arXiv paper 2506.15307: SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 305 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

I Cut Claude API Costs by 50% Using This Self Modifying Agentic System

Sentient OS: a custom on-device vision LLM that understands your entire digital life (every screenshot, note, file, email...), while your device charges overnight. Talk to your data, get proactive reminders, and explore knowledge graphs!

What to build while we still have access to cheap AI?

All Content

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One

[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

[2506.15872] Hidden Breakthroughs in Language Model Training

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Related Topics

Stay updated with AI News