Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out
Hello r/artificial I built this specifically for Claude Code users - every prompt you run feeds a digital pet called a Prompt Creature. T...
GPT, Claude, Gemini, and other LLMs
Hello r/artificial I built this specifically for Claude Code users - every prompt you run feeds a digital pet called a Prompt Creature. T...
TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models tr...
TL;DR: I got tired of manually running Shapiro-Wilk tests and copy-pasting p-values at 2 AM. I built an open-source, async Python pipelin...
Abstract page for arXiv paper 2602.06412: Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding
Abstract page for arXiv paper 2601.23157: No More, No Less: Least-Privilege Language Models
Abstract page for arXiv paper 2602.04755: When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?
Abstract page for arXiv paper 2601.19933: NRR-Phi: Text-to-State Mapping for Ambiguity Preservation in LLM Inference
Abstract page for arXiv paper 2512.12594: ceLLMate: Sandboxing Browser AI Agents
Abstract page for arXiv paper 2512.19570: The Epistemological Consequences of Large Language Models: Rethinking collective intelligence a...
Abstract page for arXiv paper 2510.15040: Composition-Grounded Data Synthesis for Visual Reasoning
Abstract page for arXiv paper 2512.18925: Beyond the Prompt: An Empirical Study of Cursor Rules
Abstract page for arXiv paper 2512.15792: A Systematic Analysis of Biases in Large Language Models
Abstract page for arXiv paper 2510.02578: FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D l...
Abstract page for arXiv paper 2511.09396: Multimodal Large Language Models for Low-Resource Languages: A Case Study for Basque
Abstract page for arXiv paper 2511.03441: CareMedEval dataset: Evaluating Critical Appraisal and Reasoning in the Biomedical Field
Abstract page for arXiv paper 2510.24702: Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
Abstract page for arXiv paper 2510.24178: MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations
Abstract page for arXiv paper 2510.10889: Topological Alignment of Shared Vision-Language Embedding Space
Abstract page for arXiv paper 2510.07181: TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Abstract page for arXiv paper 2505.06046: Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime