We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”
What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...
GPT, Claude, Gemini, and other LLMs
What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...
Chrome users were caught off guard by a 4-GB Google AI model baked into Chrome, sparking privacy concerns. The good news: You can easily ...
The company is expanding its efforts to protect ChatGPT users in cases where conversations may turn to self-harm.
Abstract page for arXiv paper 2505.19653: Token-Importance Guided Direct Preference Optimization
Abstract page for arXiv paper 2504.18453: Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Ge...
Abstract page for arXiv paper 2502.07644: SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models
Abstract page for arXiv paper 2503.11832: Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated ...
Abstract page for arXiv paper 2603.00846: Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models
Abstract page for arXiv paper 2412.03772: A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices
Abstract page for arXiv paper 2410.05669: ACPBench: Reasoning about Action, Change, and Planning
Abstract page for arXiv paper 2408.05233: Electric Vehicle User Charging Behavior Analysis Integrating Psychological and Environmental Fa...
Abstract page for arXiv paper 2603.00638: RAIE: Region-Aware Incremental Preference Editing with LoRA for LLM-based Recommendation
Abstract page for arXiv paper 2603.02156: How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks
Abstract page for arXiv paper 2603.02128: LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in ...
Abstract page for arXiv paper 2603.00474: Wireless Power Control Based on Large Language Models
Abstract page for arXiv paper 2603.00359: How Large Language Models Get Stuck: Early structure with persistent errors
Abstract page for arXiv paper 2603.02041: EstLLM: Enhancing Estonian Capabilities in Multilingual LLMs via Continued Pretraining and Post...
Abstract page for arXiv paper 2603.02024: MMR-Life: Piecing Together Real-life Scenes for Multimodal Multi-image Reasoning
Abstract page for arXiv paper 2603.01973: CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production
Abstract page for arXiv paper 2603.01966: AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations
Abstract page for arXiv paper 2603.01942: Ignore All Previous Instructions: Jailbreaking as a de-escalatory peace building practise to re...
Abstract page for arXiv paper 2603.01919: Real Money, Fake Models: Deceptive Model Claims in Shadow APIs
Abstract page for arXiv paper 2603.01912: Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime