We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”
What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...
GPT, Claude, Gemini, and other LLMs
What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...
Chrome users were caught off guard by a 4-GB Google AI model baked into Chrome, sparking privacy concerns. The good news: You can easily ...
The company is expanding its efforts to protect ChatGPT users in cases where conversations may turn to self-harm.
Abstract page for arXiv paper 2505.02872: Decoding Open-Ended Information Seeking Goals from Eye Movements in Reading
Abstract page for arXiv paper 2504.02010: When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reason...
Abstract page for arXiv paper 2503.12988: ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM
Abstract page for arXiv paper 2503.21735: GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics
Abstract page for arXiv paper 2503.06749: Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models
Abstract page for arXiv paper 2503.06238: Token-Efficient Item Representation via Images for LLM Recommender Systems
Abstract page for arXiv paper 2404.08480: Using ChatGPT for Data Science Analyses
Abstract page for arXiv paper 2503.03862: Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Mode...
Abstract page for arXiv paper 2503.02879: Wikipedia in the Era of LLMs: Evolution and Risks
Abstract page for arXiv paper 2502.12179: Sparse Shift Autoencoders for Identifying Concepts from Large Language Model Activations
Abstract page for arXiv paper 2502.04326: WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs
Abstract page for arXiv paper 2412.19496: Multi-PA: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models
Abstract page for arXiv paper 2411.03292: Interaction2Code: Benchmarking MLLM-based Interactive Webpage Code Generation from Interactive ...
Abstract page for arXiv paper 2410.13648: SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Abstract page for arXiv paper 2410.05254: GLEE: A Unified Framework and Benchmark for Language-based Economic Environments
Abstract page for arXiv paper 2603.02080: From Pixels to Patches: Pooling Strategies for Earth Embeddings
Abstract page for arXiv paper 2603.02026: Learning to Read Where to Look: Disease-Aware Vision-Language Pretraining for 3D CT
Abstract page for arXiv paper 2603.01834: Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions
Abstract page for arXiv paper 2602.11661: Quark Medical Alignment: A Holistic Multi-Dimensional Alignment and Collaborative Optimization ...
Abstract page for arXiv paper 2602.10625: To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime