Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min · 26 minutes ago

Llms

We hit 150 stars on our AI setup tool!

yo folks, we just hit 150 stars on our open source tool that auto makes AI context files. got 90 PRs merged and 20 issues that ppl are pi...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Is ai getting dummer?

Over the past month, it feels like GPT and Gemini have been giving wrong answers a lot. Do you feel the same, or am I exaggerating? submi...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Llms

[2603.24202] A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

Abstract page for arXiv paper 2603.24202: A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.24126] Likelihood hacking in probabilistic program synthesis

Abstract page for arXiv paper 2603.24126: Likelihood hacking in probabilistic program synthesis

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

Abstract page for arXiv paper 2603.24124: The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty...

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.24093] Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

Abstract page for arXiv paper 2603.24093: Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23994] Understanding the Challenges in Iterative Generative Optimization with LLMs

Abstract page for arXiv paper 2603.23994: Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23987] Can we generate portable representations for clinical time series data using LLMs?

Abstract page for arXiv paper 2603.23987: Can we generate portable representations for clinical time series data using LLMs?

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23985] Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

Abstract page for arXiv paper 2603.23985: Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Abstract page for arXiv paper 2603.23871: HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

Abstract page for arXiv paper 2603.23867: Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Abstract page for arXiv paper 2603.23831: Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Abstract page for arXiv paper 2603.23783: Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation ...

arXiv - AI · 4 min · 3 days ago

Llms

[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

Abstract page for arXiv paper 2603.23780: Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control

Abstract page for arXiv paper 2603.23629: Steering Code LLMs with Activation Directions for Language and Library Control

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23626] A Theory of LLM Information Susceptibility

Abstract page for arXiv paper 2603.23626: A Theory of LLM Information Susceptibility

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23580] MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

Abstract page for arXiv paper 2603.23580: MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

arXiv - Machine Learning · 3 min · 3 days ago

Llms

[2603.23577] The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

Abstract page for arXiv paper 2603.23577: The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23575] APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

Abstract page for arXiv paper 2603.23575: APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23562] Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

Abstract page for arXiv paper 2603.23562: Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

arXiv - Machine Learning · 4 min · 3 days ago

Llms

[2603.23550] Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

Abstract page for arXiv paper 2603.23550: Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

arXiv - Machine Learning · 3 min · 3 days ago

Llms

How do you save and organize your Gemini Deep Research outputs? Curious what workflows people use

I've been using Gemini for deep research and architecture planning, and the outputs are genuinely impressive. But I keep running into the...

Reddit - Artificial Intelligence · 1 min · 4 days ago

Previous Page 12 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

We hit 150 stars on our AI setup tool!

Is ai getting dummer?

All Content

[2603.24202] A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

[2603.24126] Likelihood hacking in probabilistic program synthesis

[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

[2603.24093] Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

[2603.23994] Understanding the Challenges in Iterative Generative Optimization with LLMs

[2603.23987] Can we generate portable representations for clinical time series data using LLMs?

[2603.23985] Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control

[2603.23626] A Theory of LLM Information Susceptibility

[2603.23580] MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

[2603.23577] The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

[2603.23575] APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

[2603.23562] Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

[2603.23550] Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

How do you save and organize your Gemini Deep Research outputs? Curious what workflows people use

Related Topics

Stay updated with AI News