Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED
Llms

I Asked ChatGPT 500 Questions. Here Are the Ads I Saw Most Often | WIRED

Ads are rolling out across the US on ChatGPT’s free tier. I asked OpenAI's bot 500 questions to see what these ads were like and how they...

Wired - AI · 9 min ·
Llms

Abacus.Ai Claw LLM consumes an incredible amount of credit without any usage :(

Three days ago, I clicked the "Deploy OpenClaw In Seconds" button to get an overview of the new service, but I didn't build any automatio...

Reddit - Artificial Intelligence · 1 min ·
Google’s Gemini AI app debuts in Hong Kong
Llms

Google’s Gemini AI app debuts in Hong Kong

Tech giant’s chatbot service tops Apple’s app store chart in the city.

AI Tools & Products · 2 min ·

All Content

[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning
Llms

[2509.03345] Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive Reasoning

Abstract page for arXiv paper 2509.03345: Do Language Models Follow Occam's Razor? An Evaluation of Parsimony in Inductive and Abductive ...

arXiv - AI · 4 min ·
[2504.15780] TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Llms

[2504.15780] TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Abstract page for arXiv paper 2504.15780: TrustGeoGen: Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

arXiv - AI · 4 min ·
[2511.16992] FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models
Llms

[2511.16992] FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

Abstract page for arXiv paper 2511.16992: FIRM: Federated In-client Regularized Multi-objective Alignment for Large Language Models

arXiv - Machine Learning · 4 min ·
[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness
Llms

[2510.06790] Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

Abstract page for arXiv paper 2510.06790: Get RICH or Die Scaling: Profitably Trading Inference Compute for Robustness

arXiv - Machine Learning · 4 min ·
[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase
Llms

[2603.25697] The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

Abstract page for arXiv paper 2603.25697: The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

arXiv - AI · 3 min ·
[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
Llms

[2507.19737] Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

Abstract page for arXiv paper 2507.19737: Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning

arXiv - Machine Learning · 4 min ·
[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining
Llms

[2505.23004] QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

Abstract page for arXiv paper 2505.23004: QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining

arXiv - Machine Learning · 4 min ·
[2603.25674] Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors
Llms

[2603.25674] Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

Abstract page for arXiv paper 2603.25674: Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Con...

arXiv - AI · 4 min ·
[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots
Llms

[2603.25646] A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

Abstract page for arXiv paper 2603.25646: A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots

arXiv - AI · 3 min ·
[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification
Llms

[2603.25613] Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification

Abstract page for arXiv paper 2603.25613: Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verif...

arXiv - AI · 4 min ·
[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction
Llms

[2408.05696] SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

Abstract page for arXiv paper 2408.05696: SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction

arXiv - Machine Learning · 4 min ·
[2603.25568] Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL
Llms

[2603.25568] Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL

Abstract page for arXiv paper 2603.25568: Are LLMs Overkill for Databases?: A Study on the Finiteness of SQL

arXiv - AI · 3 min ·
[2603.25638] Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers
Llms

[2603.25638] Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

Abstract page for arXiv paper 2603.25638: Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers

arXiv - Machine Learning · 3 min ·
[2603.25322] AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study
Llms

[2603.25322] AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study

Abstract page for arXiv paper 2603.25322: AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer's Disease D...

arXiv - AI · 4 min ·
[2603.25268] CRAFT: Grounded Multi-Agent Coordination Under Partial Information
Llms

[2603.25268] CRAFT: Grounded Multi-Agent Coordination Under Partial Information

Abstract page for arXiv paper 2603.25268: CRAFT: Grounded Multi-Agent Coordination Under Partial Information

arXiv - AI · 3 min ·
[2603.25403] Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models
Llms

[2603.25403] Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models

Abstract page for arXiv paper 2603.25403: Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models

arXiv - Machine Learning · 3 min ·
[2603.25253] MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation
Llms

[2603.25253] MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Elucidation

Abstract page for arXiv paper 2603.25253: MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Eluci...

arXiv - AI · 4 min ·
[2603.25374] Supercharging Federated Intelligence Retrieval
Llms

[2603.25374] Supercharging Federated Intelligence Retrieval

Abstract page for arXiv paper 2603.25374: Supercharging Federated Intelligence Retrieval

arXiv - Machine Learning · 3 min ·
[2603.25243] FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA
Llms

[2603.25243] FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA

Abstract page for arXiv paper 2603.25243: FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA

arXiv - AI · 3 min ·
[2603.25226] WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing
Llms

[2603.25226] WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

Abstract page for arXiv paper 2603.25226: WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing

arXiv - AI · 4 min ·
Previous Page 3 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime