Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

Claude Mythos and Project Glasswing: why an AI superhacker has the tech world on alert

A new AI model could automate the process of searching for cybersecurity bugs and flaws – for better or worse.

AI Tools & Products · 5 min · 39 minutes ago

Llms

Gemini could take a 'proactive' approach with leaked 'Your Day' feature

This feature could leverage your apps in a way that might feel familiar.

AI Tools & Products · 5 min · 39 minutes ago

Llms

I ditched my paper planner for Gemini Live — and it solved the one professional problem I couldn't fix

Can Gemini Live replace a physical planner? Tom's Guide AI Editor Amanda Caswell ditched her notebook for Google’s voice AI. Here’s how i...

AI Tools & Products · 8 min · 40 minutes ago

All Content

Llms

[2603.03752] Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning

Abstract page for arXiv paper 2603.03752: Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04300] LUMINA: Foundation Models for Topology Transferable ACOPF

Abstract page for arXiv paper 2603.04300: LUMINA: Foundation Models for Topology Transferable ACOPF

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2603.03739] PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation

Abstract page for arXiv paper 2603.03739: PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03727] Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes

Abstract page for arXiv paper 2603.03727: Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots throug...

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04276] Causality Elicitation from Large Language Models

Abstract page for arXiv paper 2603.04276: Causality Elicitation from Large Language Models

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04142] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series

Abstract page for arXiv paper 2603.04142: A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2603.03681] EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

Abstract page for arXiv paper 2603.03681: EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03677] MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric Consultation

Abstract page for arXiv paper 2603.03677: MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.04135] Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization

Abstract page for arXiv paper 2603.04135: Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03637] Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions

Abstract page for arXiv paper 2603.03637: Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial I...

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03633] Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study

Abstract page for arXiv paper 2603.03633: Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.04045] Inference-Time Toxicity Mitigation in Protein Language Models

Abstract page for arXiv paper 2603.04045: Inference-Time Toxicity Mitigation in Protein Language Models

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03590] Social Norm Reasoning in Multimodal Language Models: An Evaluation

Abstract page for arXiv paper 2603.03590: Social Norm Reasoning in Multimodal Language Models: An Evaluation

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03585] Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

Abstract page for arXiv paper 2603.03585: Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.04028] A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Quality

Abstract page for arXiv paper 2603.04028: A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Qua...

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03555] Molt Dynamics: Emergent Social Phenomena in Autonomous AI Agent Populations

Abstract page for arXiv paper 2603.03555: Molt Dynamics: Emergent Social Phenomena in Autonomous AI Agent Populations

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03543] Tucano 2 Cool: Better Open Source LLMs for Portuguese

Abstract page for arXiv paper 2603.03543: Tucano 2 Cool: Better Open Source LLMs for Portuguese

arXiv - AI · 4 min · about 1 month ago

Llms

[2603.03541] RAG-X: Systematic Diagnosis of Retrieval-Augmented Generation for Medical Question Answering

Abstract page for arXiv paper 2603.03541: RAG-X: Systematic Diagnosis of Retrieval-Augmented Generation for Medical Question Answering

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03536] SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

Abstract page for arXiv paper 2603.03536: SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

arXiv - AI · 3 min · about 1 month ago

Llms

[2603.03946] Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Abstract page for arXiv paper 2603.03946: Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 169 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Claude Mythos and Project Glasswing: why an AI superhacker has the tech world on alert

Gemini could take a 'proactive' approach with leaked 'Your Day' feature

I ditched my paper planner for Gemini Live — and it solved the one professional problem I couldn't fix

All Content

[2603.03752] Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning

[2603.04300] LUMINA: Foundation Models for Topology Transferable ACOPF

[2603.03739] PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation

[2603.03727] Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes

[2603.04276] Causality Elicitation from Large Language Models

[2603.04142] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series

[2603.03681] EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

[2603.03677] MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric Consultation

[2603.04135] Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization

[2603.03637] Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions

[2603.03633] Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study

[2603.04045] Inference-Time Toxicity Mitigation in Protein Language Models

[2603.03590] Social Norm Reasoning in Multimodal Language Models: An Evaluation

[2603.03585] Belief-Sim: Towards Belief-Driven Simulation of Demographic Misinformation Susceptibility

[2603.04028] A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Quality

[2603.03555] Molt Dynamics: Emergent Social Phenomena in Autonomous AI Agent Populations

[2603.03543] Tucano 2 Cool: Better Open Source LLMs for Portuguese

[2603.03541] RAG-X: Systematic Diagnosis of Retrieval-Augmented Generation for Medical Question Answering

[2603.03536] SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

[2603.03946] Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models

Related Topics

Stay updated with AI News