Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

🤖 AI News Digest - March 27, 2026

Today's AI news: 1. My minute-by-minute response to the LiteLLM malware attack The article describes a detailed, minute-by-minute respons...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

[D] Real-time Student Attention Detection: ResNet vs Facial Landmarks - Which approach for resource-constrained deployment?

I have a problem statement where we are supposed to detect the attention level of student in a classroom, basically output whether he is ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

[D] We audited LoCoMo: 6.4% of the answer key is wrong and the judge accepts up to 63% of intentionally wrong answers

Projects are still submitting new scores on LoCoMo as of March 2026. We audited it and found 6.4% of the answer key is wrong, and the LLM...

Reddit - Machine Learning · 1 min · about 1 hour ago

All Content

Llms

[2603.24986] Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

Abstract page for arXiv paper 2603.24986: Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

arXiv - AI · 3 min · about 12 hours ago

Llms

[2603.24940] Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system

Abstract page for arXiv paper 2603.24940: Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-i...

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.24617] Multi-LLM Query Optimization

Abstract page for arXiv paper 2603.24617: Multi-LLM Query Optimization

arXiv - Machine Learning · 3 min · about 12 hours ago

Llms

[2603.25687] On Neural Scaling Laws for Weather Emulation through Continual Training

Abstract page for arXiv paper 2603.25687: On Neural Scaling Laws for Weather Emulation through Continual Training

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.24857] AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

Abstract page for arXiv paper 2603.24857: AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.24846] NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Neurological Disorders

Abstract page for arXiv paper 2603.24846: NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Ne...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.25562] Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

Abstract page for arXiv paper 2603.25562: Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.24804] GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining

Abstract page for arXiv paper 2603.24804: GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretra...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.24774] From Untestable to Testable: Metamorphic Testing in the Age of LLMs

Abstract page for arXiv paper 2603.24774: From Untestable to Testable: Metamorphic Testing in the Age of LLMs

arXiv - AI · 3 min · about 12 hours ago

Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.25385] GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

Abstract page for arXiv paper 2603.25385: GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.24721] Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models

Abstract page for arXiv paper 2603.24721: Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.25186] Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation

Abstract page for arXiv paper 2603.25186: Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserv...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.24651] When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews

Abstract page for arXiv paper 2603.24651: When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews

arXiv - AI · 3 min · about 12 hours ago

Llms

[2603.25184] Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model

Abstract page for arXiv paper 2603.25184: Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reaso...

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.25111] SEVerA: Verified Synthesis of Self-Evolving Agents

Abstract page for arXiv paper 2603.25111: SEVerA: Verified Synthesis of Self-Evolving Agents

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.24629] Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models

Abstract page for arXiv paper 2603.24629: Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.25062] SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning

Abstract page for arXiv paper 2603.25062: SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Auto...

arXiv - Machine Learning · 3 min · about 12 hours ago

Llms

[2603.25040] Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Abstract page for arXiv paper 2603.25040: Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

arXiv - Machine Learning · 5 min · about 12 hours ago

Previous Page 5 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

🤖 AI News Digest - March 27, 2026

[D] Real-time Student Attention Detection: ResNet vs Facial Landmarks - Which approach for resource-constrained deployment?

[D] We audited LoCoMo: 6.4% of the answer key is wrong and the judge accepts up to 63% of intentionally wrong answers

All Content

[2603.24986] Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

[2603.24940] Evaluating adaptive and generative AI-based feedback and recommendations in a knowledge-graph-integrated programming learning system

[2603.24617] Multi-LLM Query Optimization

[2603.25687] On Neural Scaling Laws for Weather Emulation through Continual Training

[2603.24857] AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective

[2603.24846] NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Neurological Disorders

[2603.25562] Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

[2603.24804] GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretraining

[2603.24774] From Untestable to Testable: Metamorphic Testing in the Age of LLMs

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

[2603.25385] GlowQ: Group-Shared LOw-Rank Approximation for Quantized LLMs

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

[2603.24721] Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models

[2603.25186] Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation

[2603.24651] When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews

[2603.25184] Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model

[2603.25111] SEVerA: Verified Synthesis of Self-Evolving Agents

[2603.24629] Sketch2Simulation: Automating Flowsheet Generation via Multi Agent Large Language Models

[2603.25062] SIGMA: Structure-Invariant Generative Molecular Alignment for Chemical Language Models via Autoregressive Contrastive Learning

[2603.25040] Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Related Topics

Stay updated with AI News