Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

Gemini in Google Maps is a surprisingly useful way to explore new territory.

The Verge - AI · 11 min · about 1 hour ago

Llms

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

I'm a strategy person by background. Two years ago I'd write a recommendation and hand it to a product team. Now.. I describe what I want...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Llms

Block Resets Management With AI As Cash App Adds Installment Transfers

Block (NYSE:XYZ) plans a permanent organizational overhaul that replaces many middle management roles with AI-driven models to create fla...

AI Tools & Products · 5 min · about 11 hours ago

All Content

Llms

[2603.18123] Understanding Task Aggregation for Generalizable Ultrasound Foundation Models

Abstract page for arXiv paper 2603.18123: Understanding Task Aggregation for Generalizable Ultrasound Foundation Models

arXiv - AI · 4 min · 13 days ago

Llms

[2603.18090] MOSS-TTS Technical Report

Abstract page for arXiv paper 2603.18090: MOSS-TTS Technical Report

arXiv - AI · 3 min · 13 days ago

Llms

[2603.16513] FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

Abstract page for arXiv paper 2603.16513: FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

arXiv - AI · 4 min · 13 days ago

Llms

[2603.15727] ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Abstract page for arXiv paper 2603.15727: ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.12277] Prompt Injection as Role Confusion

Abstract page for arXiv paper 2603.12277: Prompt Injection as Role Confusion

arXiv - AI · 3 min · 13 days ago

Llms

[2601.03273] A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

Abstract page for arXiv paper 2601.03273: A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robust...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2601.03018] Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis

Abstract page for arXiv paper 2601.03018: Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-Wor...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2512.01183] TempPerturb-Eval: On the Joint Effects of Internal Temperature and External Perturbations in RAG Robustness

Abstract page for arXiv paper 2512.01183: TempPerturb-Eval: On the Joint Effects of Internal Temperature and External Perturbations in RA...

arXiv - AI · 3 min · 13 days ago

Llms

[2511.21448] The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs

Abstract page for arXiv paper 2511.21448: The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs

arXiv - AI · 4 min · 13 days ago

Llms

[2511.16665] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Abstract page for arXiv paper 2511.16665: Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

arXiv - AI · 4 min · 13 days ago

Llms

[2511.06571] Rep2Text: Decoding Full Text from a Single LLM Token Representation

Abstract page for arXiv paper 2511.06571: Rep2Text: Decoding Full Text from a Single LLM Token Representation

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2510.19496] CARES: Context-Aware Resolution Selector for VLMs

Abstract page for arXiv paper 2510.19496: CARES: Context-Aware Resolution Selector for VLMs

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2510.05710] FinReflectKG -- EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation

Abstract page for arXiv paper 2510.05710: FinReflectKG -- EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation

arXiv - AI · 4 min · 13 days ago

Llms

[2506.15047] Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers

Abstract page for arXiv paper 2506.15047: Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for A...

arXiv - AI · 4 min · 13 days ago

Llms

[2504.09775] Understanding and Optimizing Multi-Stage AI Inference Pipelines

Abstract page for arXiv paper 2504.09775: Understanding and Optimizing Multi-Stage AI Inference Pipelines

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2411.13207] LISAA: A Framework for Large Language Model Information Security Awareness Assessment

Abstract page for arXiv paper 2411.13207: LISAA: A Framework for Large Language Model Information Security Awareness Assessment

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2310.11703] A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

Abstract page for arXiv paper 2310.11703: A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

arXiv - AI · 4 min · 13 days ago

Llms

[2603.18048] DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

Abstract page for arXiv paper 2603.18048: DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

arXiv - AI · 4 min · 13 days ago

Llms

[2601.12781] VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension

Abstract page for arXiv paper 2601.12781: VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression ...

arXiv - AI · 4 min · 13 days ago

Llms

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

Abstract page for arXiv paper 2508.13876: Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

arXiv - AI · 4 min · 13 days ago

Previous Page 77 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

Block Resets Management With AI As Cash App Adds Installment Transfers

All Content

[2603.18123] Understanding Task Aggregation for Generalizable Ultrasound Foundation Models

[2603.18090] MOSS-TTS Technical Report

[2603.16513] FEAT: A Linear-Complexity Foundation Model for Extremely Large Structured Data

[2603.15727] ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

[2603.12277] Prompt Injection as Role Confusion

[2601.03273] A Multi-Perspective Benchmark and Moderation Model for Evaluating Safety and Adversarial Robustness

[2601.03018] Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis

[2512.01183] TempPerturb-Eval: On the Joint Effects of Internal Temperature and External Perturbations in RAG Robustness

[2511.21448] The Phish, The Spam, and The Valid: Generating Feature-Rich Emails for Benchmarking LLMs

[2511.16665] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

[2511.06571] Rep2Text: Decoding Full Text from a Single LLM Token Representation

[2510.19496] CARES: Context-Aware Resolution Selector for VLMs

[2510.05710] FinReflectKG -- EvalBench: Benchmarking Financial KG with Multi-Dimensional Evaluation

[2506.15047] Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers

[2504.09775] Understanding and Optimizing Multi-Stage AI Inference Pipelines

[2411.13207] LISAA: A Framework for Large Language Model Information Security Awareness Assessment

[2310.11703] A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

[2603.18048] DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

[2601.12781] VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension

[2508.13876] Improved Generalized Planning with LLMs through Strategy Refinement and Reflection

Related Topics

Stay updated with AI News