Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min ·
I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge
Llms

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

Gemini in Google Maps is a surprisingly useful way to explore new territory.

The Verge - AI · 11 min ·
Llms

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

I'm a strategy person by background. Two years ago I'd write a recommendation and hand it to a product team. Now.. I describe what I want...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection
Llms

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

Abstract page for arXiv paper 2508.13876: Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

arXiv - AI · 4 min ·
[2502.14400] HPS: Hard Preference Sampling for Human Preference Alignment
Llms

[2502.14400] HPS: Hard Preference Sampling for Human Preference Alignment

Abstract page for arXiv paper 2502.14400: HPS: Hard Preference Sampling for Human Preference Alignment

arXiv - AI · 4 min ·
[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding
Llms

[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding

Abstract page for arXiv paper 2603.20180: Adaptive Greedy Frame Selection for Long Video Understanding

arXiv - AI · 4 min ·
[2603.20179] AI Agents Can Already Autonomously Perform Experimental High Energy Physics
Llms

[2603.20179] AI Agents Can Already Autonomously Perform Experimental High Energy Physics

Abstract page for arXiv paper 2603.20179: AI Agents Can Already Autonomously Perform Experimental High Energy Physics

arXiv - AI · 4 min ·
[2603.20172] Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation
Llms

[2603.20172] Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation

Abstract page for arXiv paper 2603.20172: Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thoug...

arXiv - AI · 4 min ·
[2603.20164] The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
Llms

[2603.20164] The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

Abstract page for arXiv paper 2603.20164: The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning

arXiv - AI · 4 min ·
[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models
Llms

[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Abstract page for arXiv paper 2603.20161: Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.20122] Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
Llms

[2603.20122] Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

Abstract page for arXiv paper 2603.20122: Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models

arXiv - AI · 4 min ·
[2603.20094] LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain
Llms

[2603.20094] LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace Domain

Abstract page for arXiv paper 2603.20094: LLM-Enhanced Semantic Data Integration of Electronic Component Qualifications in the Aerospace ...

arXiv - AI · 4 min ·
[2603.20105] The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus
Llms

[2603.20105] The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

Abstract page for arXiv paper 2603.20105: The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

arXiv - AI · 4 min ·
[2603.20100] An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models
Llms

[2603.20100] An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

Abstract page for arXiv paper 2603.20100: An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models

arXiv - AI · 3 min ·
[2603.20075] Agentic Harness for Real-World Compilers
Llms

[2603.20075] Agentic Harness for Real-World Compilers

Abstract page for arXiv paper 2603.20075: Agentic Harness for Real-World Compilers

arXiv - AI · 3 min ·
[2603.20062] The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries
Llms

[2603.20062] The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

Abstract page for arXiv paper 2603.20062: The End of Rented Discovery: How AI Search Redistributes Power Between Hotels and Intermediaries

arXiv - AI · 3 min ·
[2603.20042] LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families
Llms

[2603.20042] LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families

Abstract page for arXiv paper 2603.20042: LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recogniti...

arXiv - AI · 3 min ·
[2603.20020] Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR
Llms

[2603.20020] Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR

Abstract page for arXiv paper 2603.20020: Detached Skip-Links and $R$-Probe: Decoupling Feature Aggregation from Gradient Propagation for...

arXiv - AI · 4 min ·
[2603.19987] Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States
Llms

[2603.19987] Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

Abstract page for arXiv paper 2603.19987: Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States

arXiv - AI · 3 min ·
[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction
Llms

[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

Abstract page for arXiv paper 2603.19957: HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

arXiv - Machine Learning · 3 min ·
[2603.19880] What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time
Llms

[2603.19880] What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Abstract page for arXiv paper 2603.19880: What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

arXiv - AI · 3 min ·
[2603.19849] Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue
Llms

[2603.19849] Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue

Abstract page for arXiv paper 2603.19849: Semantic Delta: An Interpretable Signal Differentiating Human and LLMs Dialogue

arXiv - AI · 4 min ·
[2603.19677] GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
Llms

[2603.19677] GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

Abstract page for arXiv paper 2603.19677: GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems

arXiv - AI · 4 min ·
Previous Page 78 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime