Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I thought of something while cooking up a simple RL AI. Please Validate it. [R]

So, I was trying to build a simple AI when I thought of, 'How could I give an AI some emotions? ' This led to one thing after another, an...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Open-source list of GenAI-related incidents

I am sharing this open-source list of cases where the ethics of GenAI use were put in the spotlight, in the hopes of sparking discussion ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2507.07847] From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems

Abstract page for arXiv paper 2507.07847: From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Au...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.05630] Rewards as Labels: Revisiting RLVR from a Classification Perspective

Abstract page for arXiv paper 2602.05630: Rewards as Labels: Revisiting RLVR from a Classification Perspective

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.17473] LeanTutor: Towards a Verified AI Mathematical Proof Tutor

Abstract page for arXiv paper 2601.17473: LeanTutor: Towards a Verified AI Mathematical Proof Tutor

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2505.23783] Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

Abstract page for arXiv paper 2505.23783: Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.20760] Generalization of RLVR Using Causal Reasoning as a Testbed

Abstract page for arXiv paper 2512.20760: Generalization of RLVR Using Causal Reasoning as a Testbed

arXiv - AI · 4 min · about 1 month ago

Llms

[2504.07109] OSCAR: Online Soft Compression And Reranking

Abstract page for arXiv paper 2504.07109: OSCAR: Online Soft Compression And Reranking

arXiv - AI · 3 min · about 1 month ago

Llms

[2503.07885] Safety Guardrails for LLM-Enabled Robots

Abstract page for arXiv paper 2503.07885: Safety Guardrails for LLM-Enabled Robots

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.22935] EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

Abstract page for arXiv paper 2511.22935: EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

arXiv - AI · 4 min · about 1 month ago

Llms

[2412.13091] LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Abstract page for arXiv paper 2412.13091: LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

arXiv - AI · 3 min · about 1 month ago

Llms

[2510.15982] AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution

Abstract page for arXiv paper 2510.15982: AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution

arXiv - AI · 4 min · about 1 month ago

Llms

[2406.06512] Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

Abstract page for arXiv paper 2406.06512: Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

arXiv - AI · 4 min · about 1 month ago

Llms

[2405.15374] Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

Abstract page for arXiv paper 2405.15374: Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.23405] Planner Aware Path Learning in Diffusion Language Models Training

Abstract page for arXiv paper 2509.23405: Planner Aware Path Learning in Diffusion Language Models Training

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.22263] Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning

Abstract page for arXiv paper 2509.22263: Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.21465] Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data

Abstract page for arXiv paper 2509.21465: Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.17874] Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models

Abstract page for arXiv paper 2509.17874: Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.09937] Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

Abstract page for arXiv paper 2602.09937: Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.15963] On the Limits of Sparse Autoencoders: A Theoretical Framework and Reweighted Remedy

Abstract page for arXiv paper 2506.15963: On the Limits of Sparse Autoencoders: A Theoretical Framework and Reweighted Remedy

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2601.16529] SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

Abstract page for arXiv paper 2601.16529: SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters fo...

arXiv - AI · 3 min · about 1 month ago

Llms

[2601.15160] Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Abstract page for arXiv paper 2601.15160: Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

arXiv - AI · 4 min · about 1 month ago

Previous Page 194 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

I thought of something while cooking up a simple RL AI. Please Validate it. [R]

Open-source list of GenAI-related incidents

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

All Content

[2507.07847] From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems

[2602.05630] Rewards as Labels: Revisiting RLVR from a Classification Perspective

[2601.17473] LeanTutor: Towards a Verified AI Mathematical Proof Tutor

[2505.23783] Boosting In-Context Learning in LLMs Through the Lens of Classical Supervised Learning

[2512.20760] Generalization of RLVR Using Causal Reasoning as a Testbed

[2504.07109] OSCAR: Online Soft Compression And Reranking

[2503.07885] Safety Guardrails for LLM-Enabled Robots

[2511.22935] EnECG: Efficient Ensemble Learning for Electrocardiogram Multi-task Foundation Model

[2412.13091] LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

[2510.15982] AMiD: Knowledge Distillation for LLMs with $α$-mixture Assistant Distribution

[2406.06512] Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

[2405.15374] Leveraging Large Language Models for Semantic Query Processing in a Scholarly Knowledge Graph

[2509.23405] Planner Aware Path Learning in Diffusion Language Models Training

[2509.22263] Erase or Hide? Suppressing Spurious Unlearning Neurons for Robust Unlearning

[2509.21465] Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data

[2509.17874] Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models

[2602.09937] Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?

[2506.15963] On the Limits of Sparse Autoencoders: A Theoretical Framework and Reweighted Remedy

[2601.16529] SycoEval-EM: Sycophancy Evaluation of Large Language Models in Simulated Clinical Encounters for Emergency Care

[2601.15160] Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Related Topics

Stay updated with AI News