[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs
Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs
Open weights models, datasets, and frameworks
Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs
Abstract page for arXiv paper 2509.08461: Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics
Abstract page for arXiv paper 2605.07990: Tool Calling is Linearly Readable and Steerable in Language Models
Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs
Abstract page for arXiv paper 2509.08461: Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics
Abstract page for arXiv paper 2605.07990: Tool Calling is Linearly Readable and Steerable in Language Models
Abstract page for arXiv paper 2605.07984: Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Inte...
Abstract page for arXiv paper 2605.07395: Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face
submitted by /u/Heavy-Factor-1919 [link] [comments]
I built a small website called LLM Win: https://llm-win.com It turns LLM benchmark results into a directed graph: text If model A beats m...
A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face
Abstract page for arXiv paper 2605.05716: More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding
He lives on your desktop as a transparent overlay and does whatever he wants. You can try to talk to him, throw him across the screen, or...
A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face
A Blog post by Ai2 on Hugging Face
A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face
Been building this for a while. Sharing now because it's past the point where I'm embarrassed by the code. **The stack:** * Python 3.12, ...
Abstract page for arXiv paper 2605.04177: Are LLMs Ready for Conflict Monitoring? Empirical Evidence from West Africa
Hiring: AI industry-savvy outreach / ecosystem operator (contract or freelance) I run a small AI company building proprietary domain-spec...
Abstract page for arXiv paper 2512.22671: Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2
Abstract page for arXiv paper 2505.18244: Emergent Hierarchical Structure in Large Language Models: An Information-Theoretic Framework fo...
A Blog post by ServiceNow-AI on Hugging Face
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime