Gemma 4 actually running usable on an Android phone (not llama.cpp)
I wanted a real local assistant on my phone, not a demo. First tried the usual llama.cpp in Termux — Gemma 4 was 2–3 tok/s and the phone ...
GPT, Claude, Gemini, and other LLMs
I wanted a real local assistant on my phone, not a demo. First tried the usual llama.cpp in Termux — Gemma 4 was 2–3 tok/s and the phone ...
AI Coding contest day 8 The eighth challenge is a weighted variant of the classic knight's tour. The knight must visit every square of a ...
Mid-October, early morning at work. I was hunting for a podcast to throw on while I worked and stumbled into something about what AI coul...
Abstract page for arXiv paper 2603.03336: Prompt-Dependent Ranking of Large Language Models with Uncertainty Quantification
Abstract page for arXiv paper 2603.03310: Entropic-Time Inference: Self-Organizing Large Language Model Decoding Beyond Attention
Abstract page for arXiv paper 2603.03823: SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Abstract page for arXiv paper 2603.03790: T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Re...
Abstract page for arXiv paper 2603.04378: Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization
Abstract page for arXiv paper 2603.04355: Efficient Refusal Ablation in LLM through Optimal Transport
Abstract page for arXiv paper 2603.04354: Out-of-distribution transfer of PDE foundation models to material dynamics under extreme loading
Abstract page for arXiv paper 2603.03752: Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning
Abstract page for arXiv paper 2603.04300: LUMINA: Foundation Models for Topology Transferable ACOPF
Abstract page for arXiv paper 2603.03739: PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent ...
Abstract page for arXiv paper 2603.03727: Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots throug...
Abstract page for arXiv paper 2603.04276: Causality Elicitation from Large Language Models
Abstract page for arXiv paper 2603.04142: A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series
Abstract page for arXiv paper 2603.03681: EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs
Abstract page for arXiv paper 2603.03677: MIND: Unified Inquiry and Diagnosis RL with Criteria Grounded Clinical Supports for Psychiatric...
Abstract page for arXiv paper 2603.04135: Unbiased Dynamic Pruning for Efficient Group-Based Policy Optimization
Abstract page for arXiv paper 2603.03637: Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial I...
Abstract page for arXiv paper 2603.03633: Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study
Abstract page for arXiv paper 2603.04045: Inference-Time Toxicity Mitigation in Protein Language Models
Abstract page for arXiv paper 2603.03590: Social Norm Reasoning in Multimodal Language Models: An Evaluation
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime