Open Source AI

Open weights models, datasets, and frameworks

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min ·
[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset
Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min ·
[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min ·

All Content

Machine Learning

[P] Ai Learns to play Street Fighter 6

This article details the process of training an AI to play Street Fighter 6 using imitation learning, showcasing both the gameplay and te...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] I built an AI that teaches itself to play Mario from scratch using Python — it starts knowing absolutely nothing

An AI bot learns to play Mario from scratch using reinforcement learning, starting with no prior knowledge and improving through trial an...

Reddit - Machine Learning · 1 min ·
Llms

[D] Do we expect any future for home-rolled language models, or will it all be dominated by the big labs?

The discussion explores the future of home-rolled language models versus those developed by large labs, emphasizing the potential for ope...

Reddit - Machine Learning · 1 min ·
Llms

Ollama 0.17 released with improved OpenClaw onboarding

Ollama 0.17 has been released, featuring enhancements to the OpenClaw onboarding process, aimed at improving user experience and accessib...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

[P] Designing an on-device contextual intelligence engine for Android

The article discusses the potential for developing an on-device contextual intelligence engine for Android, inspired by Apple's intellige...

Reddit - Machine Learning · 1 min ·
Llms

[D] antaris-suite 3.0 (open source, free) — zero-dependency agent memory, guard, routing, and context management (benchmarks + 3-model code review inside)

Antaris-suite 3.0 is an open-source tool designed for AI agents, offering zero-dependency memory management, routing, and context handlin...

Reddit - Machine Learning · 1 min ·
Llms

[P] I built an LLM gateway in Rust because I was tired of API failures

The article discusses the creation of Sentinel, an open-source LLM gateway built in Rust to address common issues faced with LLMs in prod...

Reddit - Machine Learning · 1 min ·
Ai Startups

optimize_anything: one API to optimize code, prompts, agents, configs — if you can measure it, you can optimize it

The article introduces 'optimize_anything', an open-source API designed to optimize various text artifacts, including code and prompts, b...

Reddit - Artificial Intelligence · 1 min ·
Show HN: Vipune – Simple Memory for Agents
Ai Agents

Show HN: Vipune – Simple Memory for Agents

Vipune is a minimal memory layer for AI agents that allows for semantic memory storage and retrieval without requiring network dependenci...

Hacker News - AI · 5 min ·
GGML and llama.cpp join HF to ensure the long-term progress of Local AI
Llms

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

GGML and its project llama.cpp join Hugging Face to enhance local AI development, ensuring continued open-source progress and community s...

Hugging Face Blog · 3 min ·
Machine Learning

[D] My Gradio app wouldn’t run… the reason was embarrassingly small

A Reddit user shares their frustrating experience of troubleshooting a Gradio app, only to discover that a missing line in the .env file ...

Reddit - Machine Learning · 1 min ·
Llms

[P] Open source LLM gateway in Rust looking for feedback and contributors

The article introduces Sentinel, an open-source LLM gateway in Rust designed to streamline interactions with multiple LLM APIs, focusing ...

Reddit - Machine Learning · 1 min ·
[2602.17543] genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression
Machine Learning

[2602.17543] genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression

The article presents 'genriesz', an open-source Python package designed for automatic debiased machine learning using generalized Riesz r...

arXiv - Machine Learning · 4 min ·
[2602.17206] SoftDTW-CUDA-Torch: Memory-Efficient GPU-Accelerated Soft Dynamic Time Warping for PyTorch
Ai Infrastructure

[2602.17206] SoftDTW-CUDA-Torch: Memory-Efficient GPU-Accelerated Soft Dynamic Time Warping for PyTorch

The paper presents SoftDTW-CUDA-Torch, an open-source PyTorch library that enhances Soft Dynamic Time Warping (SoftDTW) by improving memo...

arXiv - Machine Learning · 3 min ·
[2602.11337] MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation
Robotics

[2602.11337] MolmoSpaces: A Large-Scale Open Ecosystem for Robot Navigation and Manipulation

MolmoSpaces introduces a large-scale open ecosystem designed for benchmarking robot navigation and manipulation, featuring over 230k dive...

arXiv - AI · 4 min ·
Nlp

I built a free local AI image search app — find images by typing what's in them

Makimus-AI is a free, open-source local app that enables users to search their image libraries using natural language queries, functionin...

Reddit - Artificial Intelligence · 1 min ·
Llms

Knowledge graph of the transformer paper lineage — from Attention Is All You Need to DPO, mapped as an interactive concept graph [generated from a CLI + 12 PDFs]

This article presents an interactive knowledge graph mapping the lineage of transformer papers, illustrating the connections between key ...

Reddit - Artificial Intelligence · 1 min ·
Data Science

[P] V2 of a PaperWithCode alternative - Wizwand

Wizwand, an alternative to PaperWithCode, has launched its second version, addressing dataset inconsistencies and improving leaderboard a...

Reddit - Machine Learning · 1 min ·
Train AI models with Unsloth and Hugging Face Jobs for FREE
Open Source Ai

Train AI models with Unsloth and Hugging Face Jobs for FREE

This article discusses how to train AI models using Unsloth and Hugging Face Jobs, highlighting the benefits of faster training and lower...

Hugging Face Blog · 5 min ·
Machine Learning

[P] Open Source Fraud Detection System handling 0.17% class imbalance with Random Forest

This article discusses the development of an open-source credit card fraud detection system utilizing Random Forest to address class imba...

Reddit - Machine Learning · 1 min ·
Previous Page 7 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime