Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Researchers asked ChatGPT, Gemini and Claude which jobs are most exposed to AI. The chatbots wildly diagree

A study reveals that AI models disagree on which jobs are most vulnerable to automation, highlighting the unreliability of AI-generated e...

AI Tools & Products · 4 min · about 6 hours ago

Llms

I stopped treating ChatGPT like Google — and everything suddenly clicked

I stopped using ChatGPT like Google and started treating it like a thinking partner — here’s why that simple shift made the AI dramatical...

AI Tools & Products · 8 min · about 6 hours ago

Llms

Hackers abuse Google ads, Claude.ai chats to push Mac malware

AI Tools & Products · 6 min · about 6 hours ago

All Content

Llms

[2605.07649] Operating Within the Operational Design Domain: Zero-Shot Perception with Vision-Language Models

Abstract page for arXiv paper 2605.07649: Operating Within the Operational Design Domain: Zero-Shot Perception with Vision-Language Models

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07575] Response-G1: Explicit Scene Graph Modeling for Proactive Streaming Video Understanding

Abstract page for arXiv paper 2605.07575: Response-G1: Explicit Scene Graph Modeling for Proactive Streaming Video Understanding

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07481] Vaporizer: Breaking Watermarking Schemes for Large Language Model Outputs

Abstract page for arXiv paper 2605.07481: Vaporizer: Breaking Watermarking Schemes for Large Language Model Outputs

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07517] LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

Abstract page for arXiv paper 2605.07517: LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07472] HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Suspicion-Based Detection Inversion

Abstract page for arXiv paper 2605.07472: HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Susp...

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07422] Prompt Engineering Strategies for LLM-based Qualitative Coding of Psychological Safety in Software Engineering Communities: A Controlled Empirical Study

Abstract page for arXiv paper 2605.07422: Prompt Engineering Strategies for LLM-based Qualitative Coding of Psychological Safety in Softw...

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07394] BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

Abstract page for arXiv paper 2605.07394: BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07355] TTF: Temporal Token Fusion for Efficient Video-Language Model

Abstract page for arXiv paper 2605.07355: TTF: Temporal Token Fusion for Efficient Video-Language Model

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07325] CSR: Infinite-Horizon Real-Time Policies with Massive Cached State Representations

Abstract page for arXiv paper 2605.07325: CSR: Infinite-Horizon Real-Time Policies with Massive Cached State Representations

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07314] DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation

Abstract page for arXiv paper 2605.07314: DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07305] MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

Abstract page for arXiv paper 2605.07305: MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07299] EgoPro-Bench: Benchmarking Personalized Proactive Interaction in Egocentric Video Streams

Abstract page for arXiv paper 2605.07299: EgoPro-Bench: Benchmarking Personalized Proactive Interaction in Egocentric Video Streams

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07271] Understanding Performance Collapse in Layer-Pruned Large Language Models via Decision Representation Transitions

Abstract page for arXiv paper 2605.07271: Understanding Performance Collapse in Layer-Pruned Large Language Models via Decision Represent...

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07250] Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment

Abstract page for arXiv paper 2605.07250: Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07234] Reformulating KV Cache Eviction Problem for Long-Context LLM Inference

Abstract page for arXiv paper 2605.07234: Reformulating KV Cache Eviction Problem for Long-Context LLM Inference

arXiv - AI · 3 min · about 9 hours ago

Llms

[2605.07186] The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

Abstract page for arXiv paper 2605.07186: The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07141] Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding

Abstract page for arXiv paper 2605.07141: Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07111] Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

Abstract page for arXiv paper 2605.07111: Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07068] WiCER: Wiki-memory Compile, Evaluate, Refine Iterative Knowledge Compilation for LLM Wiki Systems

Abstract page for arXiv paper 2605.07068: WiCER: Wiki-memory Compile, Evaluate, Refine Iterative Knowledge Compilation for LLM Wiki Systems

arXiv - AI · 4 min · about 9 hours ago

Llms

[2605.07058] MedExAgent: Training LLM Agents to Ask, Examine, and Diagnose in Noisy Clinical Environments

Abstract page for arXiv paper 2605.07058: MedExAgent: Training LLM Agents to Ask, Examine, and Diagnose in Noisy Clinical Environments

arXiv - AI · 4 min · about 9 hours ago

Previous Page 3 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Researchers asked ChatGPT, Gemini and Claude which jobs are most exposed to AI. The chatbots wildly diagree

I stopped treating ChatGPT like Google — and everything suddenly clicked

Hackers abuse Google ads, Claude.ai chats to push Mac malware

All Content

[2605.07649] Operating Within the Operational Design Domain: Zero-Shot Perception with Vision-Language Models

[2605.07575] Response-G1: Explicit Scene Graph Modeling for Proactive Streaming Video Understanding

[2605.07481] Vaporizer: Breaking Watermarking Schemes for Large Language Model Outputs

[2605.07517] LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

[2605.07472] HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Suspicion-Based Detection Inversion

[2605.07422] Prompt Engineering Strategies for LLM-based Qualitative Coding of Psychological Safety in Software Engineering Communities: A Controlled Empirical Study

[2605.07394] BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning

[2605.07355] TTF: Temporal Token Fusion for Efficient Video-Language Model

[2605.07325] CSR: Infinite-Horizon Real-Time Policies with Massive Cached State Representations

[2605.07314] DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation

[2605.07305] MedAction: Towards Active Multi-turn Clinical Diagnostic LLMs

[2605.07299] EgoPro-Bench: Benchmarking Personalized Proactive Interaction in Egocentric Video Streams

[2605.07271] Understanding Performance Collapse in Layer-Pruned Large Language Models via Decision Representation Transitions

[2605.07250] Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment

[2605.07234] Reformulating KV Cache Eviction Problem for Long-Context LLM Inference

[2605.07186] The Text Uncanny Valley: Non-Monotonic Performance Degradation in LLM Information Retrieval

[2605.07141] Qwen3-VL-Seg: Unlocking Open-World Referring Segmentation with Vision-Language Grounding

[2605.07111] Beyond LoRA vs. Full Fine-Tuning: Gradient-Guided Optimizer Routing for LLM Adaptation

[2605.07068] WiCER: Wiki-memory Compile, Evaluate, Refine Iterative Knowledge Compilation for LLM Wiki Systems

[2605.07058] MedExAgent: Training LLM Agents to Ask, Examine, and Diagnose in Noisy Clinical Environments

Related Topics

Stay updated with AI News