Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

As a beginner how did you learn about how to use Ai

Most people aren’t going to learn AI by reading about it. They’re going to learn by using it. The problem is Ai can be Sycophantic and wi...

Reddit - Artificial Intelligence · 1 min ·
Tumbler Ridge families sue OpenAI for not alerting police to the suspect’s ChatGPT activity | The Verge
Llms

Tumbler Ridge families sue OpenAI for not alerting police to the suspect’s ChatGPT activity | The Verge

Seven families of Tumbler Ridge school shooting victims are suing OpenAI and CEO Sam Altman after they failed to alert police to the shoo...

The Verge - AI · 4 min ·
Granite 4.1 LLMs: How They’re Built
Llms

Granite 4.1 LLMs: How They’re Built

A Blog post by IBM Granite on Hugging Face

Hugging Face Blog · 14 min ·

All Content

[2601.15160] Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning
Llms

[2601.15160] Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Abstract page for arXiv paper 2601.15160: Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

arXiv - AI · 4 min ·
[2511.22235] Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
Llms

[2511.22235] Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation

Abstract page for arXiv paper 2511.22235: Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon ...

arXiv - AI · 4 min ·
[2511.21471] SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition
Llms

[2511.21471] SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition

Abstract page for arXiv paper 2511.21471: SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition

arXiv - AI · 4 min ·
[2511.05854] Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for Hallucination Detection
Llms

[2511.05854] Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for Hallucination Detection

Abstract page for arXiv paper 2511.05854: Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for ...

arXiv - AI · 4 min ·
[2510.26905] Cognition Envelopes for Bounded Decision Making in Autonomous UAS Operations
Llms

[2510.26905] Cognition Envelopes for Bounded Decision Making in Autonomous UAS Operations

Abstract page for arXiv paper 2510.26905: Cognition Envelopes for Bounded Decision Making in Autonomous UAS Operations

arXiv - AI · 4 min ·
[2505.20065] SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety
Llms

[2505.20065] SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

Abstract page for arXiv paper 2505.20065: SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety

arXiv - AI · 4 min ·
[2510.09782] The Geometry of Reasoning: Flowing Logics in Representation Space
Llms

[2510.09782] The Geometry of Reasoning: Flowing Logics in Representation Space

Abstract page for arXiv paper 2510.09782: The Geometry of Reasoning: Flowing Logics in Representation Space

arXiv - Machine Learning · 4 min ·
[2510.07972] SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Llms

[2510.07972] SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance

Abstract page for arXiv paper 2510.07972: SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance

arXiv - AI · 4 min ·
[2509.21782] Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety
Llms

[2509.21782] Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety

Abstract page for arXiv paper 2509.21782: Benchmarking MLLM-based Web Understanding: Reasoning, Robustness and Safety

arXiv - AI · 4 min ·
[2508.03284] ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools
Llms

[2508.03284] ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools

Abstract page for arXiv paper 2508.03284: ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools

arXiv - AI · 4 min ·
[2505.02888] When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger
Llms

[2505.02888] When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger

Abstract page for arXiv paper 2505.02888: When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger

arXiv - AI · 3 min ·
[2507.15796] From Privacy to Trust in the Agentic Era: A Taxonomy of Challenges in Trustworthy Federated Learning Through the Lens of Trust Report 2.0
Llms

[2507.15796] From Privacy to Trust in the Agentic Era: A Taxonomy of Challenges in Trustworthy Federated Learning Through the Lens of Trust Report 2.0

Abstract page for arXiv paper 2507.15796: From Privacy to Trust in the Agentic Era: A Taxonomy of Challenges in Trustworthy Federated Lea...

arXiv - AI · 4 min ·
[2507.15518] HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics
Llms

[2507.15518] HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics

Abstract page for arXiv paper 2507.15518: HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics

arXiv - AI · 4 min ·
[2502.01534] Preference Leakage: A Contamination Problem in LLM-as-a-judge
Llms

[2502.01534] Preference Leakage: A Contamination Problem in LLM-as-a-judge

Abstract page for arXiv paper 2502.01534: Preference Leakage: A Contamination Problem in LLM-as-a-judge

arXiv - AI · 4 min ·
[2506.08321] LeanTutor: Towards a Verified AI Mathematical Proof Tutor
Llms

[2506.08321] LeanTutor: Towards a Verified AI Mathematical Proof Tutor

Abstract page for arXiv paper 2506.08321: LeanTutor: Towards a Verified AI Mathematical Proof Tutor

arXiv - AI · 3 min ·
[2505.21668] R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning
Llms

[2505.21668] R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning

Abstract page for arXiv paper 2505.21668: R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning

arXiv - AI · 4 min ·
[2505.21281] RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models
Llms

[2505.21281] RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models

Abstract page for arXiv paper 2505.21281: RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models

arXiv - AI · 4 min ·
[2504.20505] MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living
Llms

[2504.20505] MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living

Abstract page for arXiv paper 2504.20505: MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities o...

arXiv - AI · 4 min ·
[2603.04317] World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings
Llms

[2603.04317] World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings

Abstract page for arXiv paper 2603.04317: World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurr...

arXiv - AI · 3 min ·
[2603.04257] Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory
Llms

[2603.04257] Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

Abstract page for arXiv paper 2603.04257: Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

arXiv - Machine Learning · 4 min ·
Previous Page 266 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime