Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

What is the basic minimum while you prompt

I have realised Claude answers as best as you prompt it. And I suck at it. 😂 I have tried role playing you are top 1% etc and adding cons...

Reddit - Artificial Intelligence · 1 min · 29 minutes ago

Llms

We open-sourced our AI agent config management tool — 888 stars, nearly 100 forks — requesting community feedback

We've been building Caliber to solve AI agent configuration management and released our full setup as open source. The response has been ...

Reddit - Artificial Intelligence · 1 min · 29 minutes ago

Llms

"Prompt Engineering" certs are a joke. So we built a FREE Agentic AI Practitioner Exam that actually forces you to build working swarms to pass.

Hey Everyone, If you look at the AI education space right now, it’s flooded with basic "Prompt Engineering" certificates that you can pas...

Reddit - Artificial Intelligence · 1 min · 29 minutes ago

All Content

Llms

[2506.15872] Hidden Breakthroughs in Language Model Training

Abstract page for arXiv paper 2506.15872: Hidden Breakthroughs in Language Model Training

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

Abstract page for arXiv paper 2508.11999: MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understan...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

Abstract page for arXiv paper 2508.06526: PiKV: KV Cache Management System for Mixture of Experts

arXiv - AI · 4 min · 2 months ago

Llms

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Abstract page for arXiv paper 2506.15307: SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.14003] Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

Abstract page for arXiv paper 2506.14003: Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction

Abstract page for arXiv paper 2507.15852: Advancing Complex Video Object Segmentation via Progressive Concept Construction

arXiv - AI · 4 min · 2 months ago

Llms

[2507.04219] Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

Abstract page for arXiv paper 2507.04219: Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.02939] QKV Projections Require a Fraction of Their Memory

Abstract page for arXiv paper 2506.02939: QKV Projections Require a Fraction of Their Memory

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2506.20666] Cognitive models can reveal interpretable value trade-offs in language models

Abstract page for arXiv paper 2506.20666: Cognitive models can reveal interpretable value trade-offs in language models

arXiv - AI · 4 min · 2 months ago

Llms

[2506.18841] LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Abstract page for arXiv paper 2506.18841: LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

arXiv - AI · 4 min · 2 months ago

Llms

[2505.19590] Learning to Reason without External Rewards

Abstract page for arXiv paper 2505.19590: Learning to Reason without External Rewards

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2506.13474] Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

Abstract page for arXiv paper 2506.13474: Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.15498] SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

Abstract page for arXiv paper 2506.15498: SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervisio...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2505.18116] NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Abstract page for arXiv paper 2505.18116: NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.10085] VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

Abstract page for arXiv paper 2506.10085: VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2505.16122] Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

Abstract page for arXiv paper 2505.16122: Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2505.14042] Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

Abstract page for arXiv paper 2505.14042: Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.08902] Intention-Conditioned Flow Occupancy Models

Abstract page for arXiv paper 2506.08902: Intention-Conditioned Flow Occupancy Models

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2506.06683] RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

Abstract page for arXiv paper 2506.06683: RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

arXiv - AI · 4 min · 2 months ago

Llms

[2506.03135] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Abstract page for arXiv paper 2506.03135: OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

arXiv - AI · 4 min · 2 months ago

Previous Page 306 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

What is the basic minimum while you prompt

We open-sourced our AI agent config management tool — 888 stars, nearly 100 forks — requesting community feedback

"Prompt Engineering" certs are a joke. So we built a FREE Agentic AI Practitioner Exam that actually forces you to build working swarms to pass.

All Content

[2506.15872] Hidden Breakthroughs in Language Model Training

[2508.11999] MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

[2506.14003] Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs

[2507.15852] Advancing Complex Video Object Segmentation via Progressive Concept Construction

[2507.04219] Model Collapse Is Not a Bug but a Feature in Machine Unlearning for LLMs

[2506.02939] QKV Projections Require a Fraction of Their Memory

[2506.20666] Cognitive models can reveal interpretable value trade-offs in language models

[2506.18841] LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

[2505.19590] Learning to Reason without External Rewards

[2506.13474] Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning

[2506.15498] SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling

[2505.18116] NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

[2506.10085] VITA: Zero-Shot Value Functions via Test-Time Adaptation of Vision-Language Models

[2505.16122] Plan and Budget: Effective and Efficient Test-Time Scaling on Reasoning Large Language Models

[2505.14042] Adversarially Pretrained Transformers May Be Universally Robust In-Context Learners

[2506.08902] Intention-Conditioned Flow Occupancy Models

[2506.06683] RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks

[2506.03135] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Related Topics

Stay updated with AI News