Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

How to Disable Google's Gemini in Chrome | WIRED

Chrome users were caught off guard by a 4-GB Google AI model baked into Chrome, sparking privacy concerns. The good news: You can easily ...

Wired - AI · 6 min · about 4 hours ago

Llms

OpenAI introduces new 'Trusted Contact' safeguard for cases of possible self-harm | TechCrunch

The company is expanding its efforts to protect ChatGPT users in cases where conversations may turn to self-harm.

TechCrunch - AI · 5 min · about 4 hours ago

All Content

Llms

[2505.19653] Token-Importance Guided Direct Preference Optimization

Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization

arXiv - AI · 3 min · 2 months ago

Llms

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...

arXiv - AI · 4 min · 2 months ago

Llms

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

arXiv - AI · 4 min · 2 months ago

Llms

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

arXiv - AI · 4 min · 2 months ago

Llms

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning

arXiv - AI · 4 min · 2 months ago

Llms

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

Abstract page for arXiv paper 2603.00638: RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

Abstract page for arXiv paper 2603.02156: How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations

Abstract page for arXiv paper 2603.02128: LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in ...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.00474] Wireless Power Control Based on Large Language Models

Abstract page for arXiv paper 2603.00474: Wireless Power Control Based on Large Language Models

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors

Abstract page for arXiv paper 2603.00359: How Large Language Models Get Stuck: Early structure with persistent errors

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2603.02041] EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training

Abstract page for arXiv paper 2603.02041: EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post...

arXiv - AI · 4 min · 2 months ago

Llms

[2603.02024] MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

Abstract page for arXiv paper 2603.02024: MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01973] CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production

Abstract page for arXiv paper 2603.01973: CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01966] AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

Abstract page for arXiv paper 2603.01966: AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01942] Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to resist LLM social media bots

Abstract page for arXiv paper 2603.01942: Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to re...

arXiv - AI · 3 min · 2 months ago

Llms

[2603.01919] Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

Abstract page for arXiv paper 2603.01919: Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

arXiv - AI · 4 min · 2 months ago

Llms

[2603.01912] Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

Abstract page for arXiv paper 2603.01912: Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

arXiv - AI · 3 min · 2 months ago

Previous Page 345 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

How to Disable Google's Gemini in Chrome | WIRED

OpenAI introduces new 'Trusted Contact' safeguard for cases of possible self-harm | TechCrunch

All Content

[2505.19653] Token-Importance Guided Direct Preference Optimization

[2504.18453] Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation

[2502.07644] SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models

[2503.11832] Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning

[2603.00846] Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

[2412.03772] A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

[2410.05669] ACPBench: Reasoning about Action, Change, and Planning

[2408.05233] Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Factors: A Statistical-Driven LLM based Agent Approach

[2603.00638] RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation

[2603.02156] How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks

[2603.02128] LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations

[2603.00474] Wireless Power Control Based on Large Language Models

[2603.00359] How Large Language Models Get Stuck: Early structure with persistent errors

[2603.02041] EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post-Training

[2603.02024] MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning

[2603.01973] CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production

[2603.01966] AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

[2603.01942] Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to resist LLM social media bots

[2603.01919] Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

[2603.01912] Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

Related Topics

Stay updated with AI News