Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min ·
Llms

We hit 150 stars on our AI setup tool!

yo folks, we just hit 150 stars on our open source tool that auto makes AI context files. got 90 PRs merged and 20 issues that ppl are pi...

Reddit - Artificial Intelligence · 1 min ·
Llms

Is ai getting dummer?

Over the past month, it feels like GPT and Gemini have been giving wrong answers a lot. Do you feel the same, or am I exaggerating? submi...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.24202] A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula
Llms

[2603.24202] A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

Abstract page for arXiv paper 2603.24202: A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula

arXiv - Machine Learning · 4 min ·
[2603.24126] Likelihood hacking in probabilistic program synthesis
Llms

[2603.24126] Likelihood hacking in probabilistic program synthesis

Abstract page for arXiv paper 2603.24126: Likelihood hacking in probabilistic program synthesis

arXiv - Machine Learning · 3 min ·
[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
Llms

[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

Abstract page for arXiv paper 2603.24124: The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty...

arXiv - Machine Learning · 4 min ·
[2603.24093] Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization
Llms

[2603.24093] Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

Abstract page for arXiv paper 2603.24093: Towards Effective Experiential Learning: Dual Guidance for Utilization and Internalization

arXiv - Machine Learning · 4 min ·
[2603.23994] Understanding the Challenges in Iterative Generative Optimization with LLMs
Llms

[2603.23994] Understanding the Challenges in Iterative Generative Optimization with LLMs

Abstract page for arXiv paper 2603.23994: Understanding the Challenges in Iterative Generative Optimization with LLMs

arXiv - Machine Learning · 4 min ·
[2603.23987] Can we generate portable representations for clinical time series data using LLMs?
Llms

[2603.23987] Can we generate portable representations for clinical time series data using LLMs?

Abstract page for arXiv paper 2603.23987: Can we generate portable representations for clinical time series data using LLMs?

arXiv - Machine Learning · 4 min ·
[2603.23985] Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score
Llms

[2603.23985] Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

Abstract page for arXiv paper 2603.23985: Diet Your LLM: Dimension-wise Global Pruning of LLMs via Merging Task-specific Importance Score

arXiv - Machine Learning · 3 min ·
[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation
Llms

[2603.23871] HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

Abstract page for arXiv paper 2603.23871: HDPO: Hybrid Distillation Policy Optimization via Privileged Self-Distillation

arXiv - Machine Learning · 3 min ·
[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation
Llms

[2603.23867] Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

Abstract page for arXiv paper 2603.23867: Can VLMs Reason Robustly? A Neuro-Symbolic Investigation

arXiv - Machine Learning · 4 min ·
[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective
Llms

[2603.23831] Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

Abstract page for arXiv paper 2603.23831: Unveiling Hidden Convexity in Deep Learning: a Sparse Signal Processing Perspective

arXiv - Machine Learning · 3 min ·
[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models
Llms

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Abstract page for arXiv paper 2603.23783: Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation ...

arXiv - AI · 4 min ·
[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters
Llms

[2603.23780] Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

Abstract page for arXiv paper 2603.23780: Lightweight Fairness for LLM-Based Recommendations via Kernelized Projection and Gated Adapters

arXiv - Machine Learning · 3 min ·
[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control
Llms

[2603.23629] Steering Code LLMs with Activation Directions for Language and Library Control

Abstract page for arXiv paper 2603.23629: Steering Code LLMs with Activation Directions for Language and Library Control

arXiv - Machine Learning · 3 min ·
[2603.23626] A Theory of LLM Information Susceptibility
Llms

[2603.23626] A Theory of LLM Information Susceptibility

Abstract page for arXiv paper 2603.23626: A Theory of LLM Information Susceptibility

arXiv - Machine Learning · 4 min ·
[2603.23580] MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis
Llms

[2603.23580] MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

Abstract page for arXiv paper 2603.23580: MetaKube: An Experience-Aware LLM Framework for Kubernetes Failure Diagnosis

arXiv - Machine Learning · 3 min ·
[2603.23577] The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations
Llms

[2603.23577] The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

Abstract page for arXiv paper 2603.23577: The Geometric Price of Discrete Logic: Context-driven Manifold Dynamics of Number Representations

arXiv - Machine Learning · 4 min ·
[2603.23575] APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs
Llms

[2603.23575] APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

Abstract page for arXiv paper 2603.23575: APreQEL: Adaptive Mixed Precision Quantization For Edge LLMs

arXiv - Machine Learning · 4 min ·
[2603.23562] Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG
Llms

[2603.23562] Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

Abstract page for arXiv paper 2603.23562: Synthetic Mixed Training: Scaling Parametric Knowledge Acquisition Beyond RAG

arXiv - Machine Learning · 4 min ·
[2603.23550] Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction
Llms

[2603.23550] Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

Abstract page for arXiv paper 2603.23550: Implicit Turn-Wise Policy Optimization for Proactive User-LLM Interaction

arXiv - Machine Learning · 3 min ·
Llms

How do you save and organize your Gemini Deep Research outputs? Curious what workflows people use

I've been using Gemini for deep research and architecture planning, and the outputs are genuinely impressive. But I keep running into the...

Reddit - Artificial Intelligence · 1 min ·
Previous Page 12 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime