Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Claude code x n8n

Hi everyone, I’ve been exploring MCP and integrating tools like n8n with Claude Code, and I’m trying to understand how practical this rea...

Reddit - Artificial Intelligence · 1 min ·
Llms

LLM comprehension question

Basically, does anyone else also get a really strange sense of lingering confusion and non-comprehension when an LLM explains a complex c...

Reddit - Artificial Intelligence · 1 min ·
Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences
Llms

[2508.07638] Data Selection for LLM Alignment Using Fine-Grained Preferences

Abstract page for arXiv paper 2508.07638: Data Selection for LLM Alignment Using Fine-Grained Preferences

arXiv - Machine Learning · 4 min ·
[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks
Llms

[2508.04097] Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

Abstract page for arXiv paper 2508.04097: Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks

arXiv - Machine Learning · 4 min ·
[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment
Llms

[2508.04865] Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment

Abstract page for arXiv paper 2508.04865: Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Lear...

arXiv - Machine Learning · 4 min ·
[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation
Llms

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv - AI · 4 min ·
[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm
Llms

[2507.18553] The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Abstract page for arXiv paper 2507.18553: The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

arXiv - Machine Learning · 4 min ·
[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference
Llms

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

arXiv - Machine Learning · 4 min ·
[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints
Llms

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...

arXiv - Machine Learning · 4 min ·
[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses
Llms

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses

arXiv - AI · 3 min ·
[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Llms

[2508.18672] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Abstract page for arXiv paper 2508.18672: Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

arXiv - Machine Learning · 4 min ·
[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Llms

[2506.20746] Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

Abstract page for arXiv paper 2506.20746: Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers

arXiv - Machine Learning · 4 min ·
[2506.15872] Hidden Breakthroughs in Language Model Training
Llms

[2506.15872] Hidden Breakthroughs in Language Model Training

Abstract page for arXiv paper 2506.15872: Hidden Breakthroughs in Language Model Training

arXiv - Machine Learning · 3 min ·
[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
Llms

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

Abstract page for arXiv paper 2508.11999: MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understan...

arXiv - Machine Learning · 4 min ·
[2508.06526] PiKV: KV Cache Management System for Mixture of Experts
Llms

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

Abstract page for arXiv paper 2508.06526: PiKV: KV Cache Management System for Mixture of Experts

arXiv - AI · 4 min ·
[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC
Llms

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Abstract page for arXiv paper 2506.15307: SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

arXiv - Machine Learning · 4 min ·
[2506.14003] Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs
Llms

[2506.14003] Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

Abstract page for arXiv paper 2506.14003: Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

arXiv - Machine Learning · 4 min ·
[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction
Llms

[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction

Abstract page for arXiv paper 2507.15852: Advancing Complex Video Object Segmentation via Progressive Concept Construction

arXiv - AI · 4 min ·
[2507.04219] Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs
Llms

[2507.04219] Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

Abstract page for arXiv paper 2507.04219: Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

arXiv - Machine Learning · 4 min ·
[2506.02939] QKV Projections Require a Fraction of Their Memory
Llms

[2506.02939] QKV Projections Require a Fraction of Their Memory

Abstract page for arXiv paper 2506.02939: QKV Projections Require a Fraction of Their Memory

arXiv - Machine Learning · 3 min ·
[2506.20666] Cognitive models can reveal interpretable value trade-offs in language models
Llms

[2506.20666] Cognitive models can reveal interpretable value trade-offs in language models

Abstract page for arXiv paper 2506.20666: Cognitive models can reveal interpretable value trade-offs in language models

arXiv - AI · 4 min ·
[2506.18841] LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning
Llms

[2506.18841] LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Abstract page for arXiv paper 2506.18841: LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

arXiv - AI · 4 min ·
Previous Page 160 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime