Open Source AI

Open weights models, datasets, and frameworks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

arXiv - AI · 4 min · about 8 hours ago

Llms

[2509.08461] Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

Abstract page for arXiv paper 2509.08461: Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

arXiv - Machine Learning · 4 min · about 10 hours ago

Llms

[2605.07990] Tool Calling is Linearly Readable and Steerable in Language Models

Abstract page for arXiv paper 2605.07990: Tool Calling is Linearly Readable and Steerable in Language Models

arXiv - Machine Learning · 4 min · about 10 hours ago

All Content

Llms

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

arXiv - AI · 4 min · about 8 hours ago

Llms

[2509.08461] Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

Abstract page for arXiv paper 2509.08461: Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

arXiv - Machine Learning · 4 min · about 10 hours ago

Llms

[2605.07990] Tool Calling is Linearly Readable and Steerable in Language Models

Abstract page for arXiv paper 2605.07990: Tool Calling is Linearly Readable and Steerable in Language Models

arXiv - Machine Learning · 4 min · about 10 hours ago

Llms

[2605.07984] Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions

Abstract page for arXiv paper 2605.07984: Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Inte...

arXiv - Machine Learning · 3 min · about 10 hours ago

Llms

[2605.07395] Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts

Abstract page for arXiv paper 2605.07395: Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts

arXiv - Machine Learning · 4 min · about 10 hours ago

Open Source Ai

MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X

A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face

Hugging Face Blog · 7 min · about 20 hours ago

Llms

Locally running Mistral on an i7 from 2017 so I don't waste water or ram

submitted by /u/Heavy-Factor-1919 [link] [comments]

Reddit - Artificial Intelligence · 1 min · 1 day ago

Llms

LLM rankings are not a ladder: experimental results from a transitive benchmark graph [D]

I built a small website called LLM Win: https://llm-win.com It turns LLM benchmark results into a directed graph: text If model A beats m...

Reddit - Machine Learning · 1 min · 2 days ago

Open Source Ai

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face

Hugging Face Blog · 15 min · 2 days ago

Llms

[2605.05716] More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding

Abstract page for arXiv paper 2605.05716: More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding

arXiv - AI · 3 min · 2 days ago

Llms

I made a desktop crab that bullies you back

He lives on your desktop as a transparent overlay and does whatever he wants. You can try to talk to him, throw him across the screen, or...

Reddit - Artificial Intelligence · 1 min · 2 days ago

Open Source Ai

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face

Hugging Face Blog · 9 min · 3 days ago

Open Source Ai

EMO: Pretraining mixture of experts for emergent modularity

A Blog post by Ai2 on Hugging Face

Hugging Face Blog · 9 min · 3 days ago

Open Source Ai

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

A Blog post by Lablab.ai AMD Developer Hackathon on Hugging Face

Hugging Face Blog · 8 min · 3 days ago

Llms

I built a local AI companion with GWT, IIT proxy, ChromaDB hybrid retrieval, and Ollama fallback — here's every architectural decision I made and why

Been building this for a while. Sharing now because it's past the point where I'm embarrassed by the code. **The stack:** * Python 3.12, ...

Reddit - Artificial Intelligence · 1 min · 3 days ago

Llms

[2605.04177] Are LLMs Ready for Conflict Monitoring? Empirical Evidence from West Africa

Abstract page for arXiv paper 2605.04177: Are LLMs Ready for Conflict Monitoring? Empirical Evidence from West Africa

arXiv - Machine Learning · 4 min · 4 days ago

Open Source Ai

[Hiring] Relations Manager for AI (Remote)

Hiring: AI industry-savvy outreach / ecosystem operator (contract or freelance) I run a small AI company building proprietary domain-spec...

Reddit - ML Jobs · 1 min · 4 days ago

Llms

[2512.22671] Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2

Abstract page for arXiv paper 2512.22671: Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2

arXiv - AI · 4 min · 4 days ago

Llms

[2505.18244] Emergent Hierarchical Structure in Large Language Models: An Information-Theoretic Framework for Multi-Scale Representation

Abstract page for arXiv paper 2505.18244: Emergent Hierarchical Structure in Large Language Models: An Information-Theoretic Framework fo...

arXiv - AI · 4 min · 4 days ago

Llms

vLLM V0 to V1: Correctness Before Corrections in RL

A Blog post by ServiceNow-AI on Hugging Face

Hugging Face Blog · 8 min · 5 days ago

Page 1 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Open Source AI

Top This Week

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

[2509.08461] Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

[2605.07990] Tool Calling is Linearly Readable and Steerable in Language Models

All Content

[2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

[2509.08461] Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

[2605.07990] Tool Calling is Linearly Readable and Steerable in Language Models

[2605.07984] Where's the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions

[2605.07395] Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts

MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X

Locally running Mistral on an i7 from 2017 so I don't waste water or ram

LLM rankings are not a ladder: experimental results from a transitive benchmark graph [D]

"OncoAgent: A Dual-Tier Multi-Agent Framework for Privacy-Preserving Oncology Clinical Decision Support"

[2605.05716] More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding

I made a desktop crab that bullies you back

CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models

EMO: Pretraining mixture of experts for emergent modularity

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

I built a local AI companion with GWT, IIT proxy, ChromaDB hybrid retrieval, and Ollama fallback — here's every architectural decision I made and why

[2605.04177] Are LLMs Ready for Conflict Monitoring? Empirical Evidence from West Africa

[Hiring] Relations Manager for AI (Remote)

[2512.22671] Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2

[2505.18244] Emergent Hierarchical Structure in Large Language Models: An Information-Theoretic Framework for Multi-Scale Representation

vLLM V0 to V1: Correctness Before Corrections in RL

Related Topics

Stay updated with AI News