Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

Gemini in Google Maps is a surprisingly useful way to explore new territory.

The Verge - AI · 11 min · about 3 hours ago

Llms

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

I'm a strategy person by background. Two years ago I'd write a recommendation and hand it to a product team. Now.. I describe what I want...

Reddit - Artificial Intelligence · 1 min · about 11 hours ago

All Content

Llms

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

Abstract page for arXiv paper 2508.13876: Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

arXiv - AI · 4 min · 13 days ago

Llms

[2502.14400] HPS: Hard Preference Sampling for Human Preference Alignment

Abstract page for arXiv paper 2502.14400: HPS: Hard Preference Sampling for Human Preference Alignment

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding

Abstract page for arXiv paper 2603.20180: Adaptive Greedy Frame Selection for Long Video Understanding

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20179] AI Agents Can Already Autonomously Perform Experimental High Energy Physics

Abstract page for arXiv paper 2603.20179: AI Agents Can Already Autonomously Perform Experimental High Energy Physics

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20172] Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

Abstract page for arXiv paper 2603.20172: Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thoug...

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20164] The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

Abstract page for arXiv paper 2603.20164: The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Abstract page for arXiv paper 2603.20161: Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.20122] Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

Abstract page for arXiv paper 2603.20122: Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20094] LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain

Abstract page for arXiv paper 2603.20094: LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace ...

arXiv - AI · 4 min · 13 days ago

$[2603.20105] The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus$

Llms

[2603.20105] The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

Abstract page for arXiv paper 2603.20105: The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

arXiv - AI · 4 min · 13 days ago

Llms

[2603.20100] An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

Abstract page for arXiv paper 2603.20100: An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

arXiv - AI · 3 min · 13 days ago

Llms

[2603.20075] Agentic Harness for Real-World Compilers

Abstract page for arXiv paper 2603.20075: Agentic Harness for Real-World Compilers

arXiv - AI · 3 min · 13 days ago

Llms

[2603.20062] The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

Abstract page for arXiv paper 2603.20062: The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

arXiv - AI · 3 min · 13 days ago

Llms

[2603.20042] LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

Abstract page for arXiv paper 2603.20042: LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recogniti...

arXiv - AI · 3 min · 13 days ago

Llms

[2603.20020] Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR

Abstract page for arXiv paper 2603.20020: Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for...

arXiv - AI · 4 min · 13 days ago

Llms

[2603.19987] Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Abstract page for arXiv paper 2603.19987: Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

arXiv - AI · 3 min · 13 days ago

Llms

[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

Abstract page for arXiv paper 2603.19957: HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19880] What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Abstract page for arXiv paper 2603.19880: What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

arXiv - AI · 3 min · 13 days ago

Llms

[2603.19849] Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue

Abstract page for arXiv paper 2603.19849: Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue

arXiv - AI · 4 min · 13 days ago

Llms

[2603.19677] GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

Abstract page for arXiv paper 2603.19677: GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

arXiv - AI · 4 min · 13 days ago

Previous Page 78 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

OpenClaw security checklist: practical safeguards for AI agents

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

All Content

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

[2502.14400] HPS: Hard Preference Sampling for Human Preference Alignment

[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding

[2603.20179] AI Agents Can Already Autonomously Perform Experimental High Energy Physics

[2603.20172] Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

[2603.20164] The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

[2603.20122] Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

[2603.20094] LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain

[2603.20105] The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

[2603.20100] An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

[2603.20075] Agentic Harness for Real-World Compilers

[2603.20062] The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

[2603.20042] LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

[2603.20020] Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR

[2603.19987] Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

[2603.19880] What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

[2603.19849] Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue

[2603.19677] GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

Related Topics

Stay updated with AI News