Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

I Cut Claude API Costs by 50% Using This Self Modifying Agentic System

I've been developing a self-modifying Al agent system that effectively cuts my Claude API usage in half, Claude thinks and then I basical...

Reddit - Artificial Intelligence · 1 min ·
Llms

Sentient OS: a custom on-device vision LLM that understands your entire digital life (every screenshot, note, file, email...), while your device charges overnight. Talk to your data, get proactive reminders, and explore knowledge graphs!

99% of "AI" apps are just GPT wrappers that pipe your data to cloud LLMs and call it a product. No one's ever created an intelligence lay...

Reddit - Artificial Intelligence · 1 min ·
Llms

What to build while we still have access to cheap AI?

AI companies are subsidizing access the same way Uber subsidized rides and AWS subsidized compute in the early days - burning cash to gra...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction
Llms

[2509.21029] FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

Abstract page for arXiv paper 2509.21029: FORCE: Transferable Visual Jailbreaking Attacks via Feature Over-Reliance CorrEction

arXiv - Machine Learning · 4 min ·
[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One
Llms

[2509.23383] Train Once, Answer All: Many Pretraining Experiments for the Cost of One

Abstract page for arXiv paper 2509.23383: Train Once, Answer All: Many Pretraining Experiments for the Cost of One

arXiv - Machine Learning · 4 min ·
[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning
Llms

[2509.22611] Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

Abstract page for arXiv paper 2509.22611: Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning

arXiv - Machine Learning · 4 min ·
[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space
Llms

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

Abstract page for arXiv paper 2509.22299: HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

arXiv - Machine Learning · 4 min ·
[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding
Llms

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Abstract page for arXiv paper 2509.22134: Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

arXiv - AI · 4 min ·
[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models
Llms

[2508.07697] Semantic-Enhanced Time-Series Forecasting via Large Language Models

Abstract page for arXiv paper 2508.07697: Semantic-Enhanced Time-Series Forecasting via Large Language Models

arXiv - Machine Learning · 4 min ·
[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences
Llms

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

Abstract page for arXiv paper 2508.07638: Data Selection for LLM Alignment Using Fine-Grained Preferences

arXiv - Machine Learning · 4 min ·
[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
Llms

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

Abstract page for arXiv paper 2508.04097: Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

arXiv - Machine Learning · 4 min ·
[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment
Llms

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Abstract page for arXiv paper 2508.04865: Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Lear...

arXiv - Machine Learning · 4 min ·
[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation
Llms

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv - AI · 4 min ·
[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
Llms

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Abstract page for arXiv paper 2507.18553: The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

arXiv - Machine Learning · 4 min ·
[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference
Llms

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

arXiv - Machine Learning · 4 min ·
[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints
Llms

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...

arXiv - Machine Learning · 4 min ·
[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses
Llms

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses

arXiv - AI · 3 min ·
[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Llms

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Abstract page for arXiv paper 2508.18672: Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

arXiv - Machine Learning · 4 min ·
[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Llms

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

Abstract page for arXiv paper 2506.20746: Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

arXiv - Machine Learning · 4 min ·
[2506.15872] Hidden Breakthroughs in Language Model Training
Llms

[2506.15872] Hidden Breakthroughs in Language Model Training

Abstract page for arXiv paper 2506.15872: Hidden Breakthroughs in Language Model Training

arXiv - Machine Learning · 3 min ·
[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
Llms

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

Abstract page for arXiv paper 2508.11999: MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understan...

arXiv - Machine Learning · 4 min ·
[2508.06526] PiKV: KV Cache Management System for Mixture of Experts
Llms

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

Abstract page for arXiv paper 2508.06526: PiKV: KV Cache Management System for Mixture of Experts

arXiv - AI · 4 min ·
[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC
Llms

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Abstract page for arXiv paper 2506.15307: SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

arXiv - Machine Learning · 4 min ·
Previous Page 305 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime