Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · 44 minutes ago

Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min · 44 minutes ago

Llms

I Accidentally Discovered a Security Vulnerability in AI Education — Then Submitted It To a $200K Competition

Last night I was testing Maestro University, the first fully AI-taught university. I walked into their enrollment chatbot and asked it to...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Llms

[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Abstract page for arXiv paper 2603.08104: Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Abstract page for arXiv paper 2602.03773: Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse

Abstract page for arXiv paper 2601.03385: SIGMA: Scalable Spectral Insights for LLM Model Collapse

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

Abstract page for arXiv paper 2512.19735: Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2512.10656] Token Sample Complexity of Attention

Abstract page for arXiv paper 2512.10656: Token Sample Complexity of Attention

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

Abstract page for arXiv paper 2509.24302: LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

Abstract page for arXiv paper 2509.21861: SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2508.07117] From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

Abstract page for arXiv paper 2508.07117: From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2505.15340] SSR: Speculative Parallel Scaling Reasoning in Test-time

Abstract page for arXiv paper 2505.15340: SSR: Speculative Parallel Scaling Reasoning in Test-time

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2503.01013] TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

Abstract page for arXiv paper 2503.01013: TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2407.08626] RoboMorph: Evolving Robot Morphology using Large Language Models

Abstract page for arXiv paper 2407.08626: RoboMorph: Evolving Robot Morphology using Large Language Models

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2406.03736] Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

Abstract page for arXiv paper 2406.03736: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.22278] The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

Abstract page for arXiv paper 2603.22278: The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.22216] Gumbel Distillation for Parallel Text Generation

Abstract page for arXiv paper 2603.22216: Gumbel Distillation for Parallel Text Generation

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2603.21658] A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

Abstract page for arXiv paper 2603.21658: A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Comm...

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.21465] DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

Abstract page for arXiv paper 2603.21465: DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Abstract page for arXiv paper 2603.21389: Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

Abstract page for arXiv paper 2603.21335: TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Pr...

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling

Abstract page for arXiv paper 2603.21033: TabPFN Extensions for Interpretable Geotechnical Modelling

arXiv - Machine Learning · 4 min · 7 days ago

Llms

[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

Abstract page for arXiv paper 2603.20975: DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

arXiv - Machine Learning · 3 min · 7 days ago

Previous Page 29 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

World models will be the next big thing, bye-bye LLMs

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

I Accidentally Discovered a Security Vulnerability in AI Education — Then Submitted It To a $200K Competition

All Content

[2603.08104] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

[2602.03773] Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

[2601.03385] SIGMA: Scalable Spectral Insights for LLM Model Collapse

[2512.19735] Improving Fairness of Large Language Model-Based ICU Mortality Prediction via Case-Based Prompting

[2512.10656] Token Sample Complexity of Attention

[2509.24302] LEAF: Language-EEG Aligned Foundation Model for Brain-Computer Interfaces

[2509.21861] SpecMol: A Spectroscopy-Grounded Foundation Model for Multi-Task Molecular Learning

[2508.07117] From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context

[2505.15340] SSR: Speculative Parallel Scaling Reasoning in Test-time

[2503.01013] TimeXL: Explainable Multi-modal Time Series Prediction with LLM-in-the-Loop

[2407.08626] RoboMorph: Evolving Robot Morphology using Large Language Models

[2406.03736] Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

[2603.22278] The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

[2603.22216] Gumbel Distillation for Parallel Text Generation

[2603.21658] A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

[2603.21465] DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling

[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

Related Topics

Stay updated with AI News