Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Have Companies Began Adopting Claude Co-Work at an Enterprise Level?

Hi Guys, My company is considering purchasing the Claude Enterprise plan. The main two constraints are: - Being able to block usage of Cl...

Reddit - Artificial Intelligence · 1 min · 40 minutes ago

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

Abstract page for arXiv paper 2509.21861: SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2508.07117] From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

Abstract page for arXiv paper 2508.07117: From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2505.15340] SSR: Speculative Parallel Scaling Reasoning in Test-time

Abstract page for arXiv paper 2505.15340: SSR: Speculative Parallel Scaling Reasoning in Test-time

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2503.01013] TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

Abstract page for arXiv paper 2503.01013: TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2407.08626] RoboMorph: Evolving Robot Morphology using Large Language Models

Abstract page for arXiv paper 2407.08626: RoboMorph: Evolving Robot Morphology using Large Language Models

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2406.03736] Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

Abstract page for arXiv paper 2406.03736: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.22278] The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

Abstract page for arXiv paper 2603.22278: The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.22216] Gumbel Distillation for Parallel Text Generation

Abstract page for arXiv paper 2603.22216: Gumbel Distillation for Parallel Text Generation

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.21658] A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

Abstract page for arXiv paper 2603.21658: A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Comm...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.21465] DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

Abstract page for arXiv paper 2603.21465: DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Abstract page for arXiv paper 2603.21389: Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

Abstract page for arXiv paper 2603.21335: TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Pr...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling

Abstract page for arXiv paper 2603.21033: TabPFN Extensions for Interpretable Geotechnical Modelling

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

Abstract page for arXiv paper 2603.20975: DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20895] LLM Router: Prefill is All You Need

Abstract page for arXiv paper 2603.20895: LLM Router: Prefill is All You Need

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20808] Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

Abstract page for arXiv paper 2603.20808: Predictive Regularization Against Visual Representation Degradation in Multimodal Large Languag...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20799] RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

Abstract page for arXiv paper 2603.20799: RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a...

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20389] A chemical language model for reticular materials design

Abstract page for arXiv paper 2603.20389: A chemical language model for reticular materials design

arXiv - Machine Learning · 4 min · 8 days ago

Llms

[2603.20314] VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

Abstract page for arXiv paper 2603.20314: VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

arXiv - Machine Learning · 3 min · 8 days ago

Llms

[2603.20219] Thinking into the Future: Latent Lookahead Training for Transformers

Abstract page for arXiv paper 2603.20219: Thinking into the Future: Latent Lookahead Training for Transformers

arXiv - Machine Learning · 4 min · 8 days ago

Previous Page 42 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Have Companies Began Adopting Claude Co-Work at an Enterprise Level?

What I learned about multi-agent coordination running 9 specialized Claude agents

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

All Content

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

[2508.07117] From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

[2505.15340] SSR: Speculative Parallel Scaling Reasoning in Test-time

[2503.01013] TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

[2407.08626] RoboMorph: Evolving Robot Morphology using Large Language Models

[2406.03736] Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

[2603.22278] The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

[2603.22216] Gumbel Distillation for Parallel Text Generation

[2603.21658] A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

[2603.21465] DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling

[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

[2603.20895] LLM Router: Prefill is All You Need

[2603.20808] Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

[2603.20799] RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

[2603.20389] A chemical language model for reticular materials design

[2603.20314] VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

[2603.20219] Thinking into the Future: Latent Lookahead Training for Transformers

Related Topics

Stay updated with AI News