Open Source AI

Open weights models, datasets, and frameworks

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min ·
[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset
Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min ·
[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min ·

All Content

[2602.22808] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks
Llms

[2602.22808] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

MiroFlow is an innovative open-source agent framework designed to enhance the performance and robustness of large language models in comp...

arXiv - AI · 3 min ·
Ai Infrastructure

I Made a Auto-complete AI form scratch in python and thought it would be funny to use family guy episodes as a database. It was not a good idea.

The article discusses a humorous attempt to create an auto-complete AI using Family Guy episodes as a database, highlighting the unexpect...

Reddit - Artificial Intelligence · 1 min ·
Ai Infrastructure

NXP posts new Linux accelerator driver for their Neutron NPU

NXP has released a new Linux accelerator driver for their Neutron NPU, enhancing support for machine learning applications and improving ...

Reddit - Artificial Intelligence · 1 min ·
This AI Agent Is Designed to Not Go Rogue | WIRED
Ai Agents

This AI Agent Is Designed to Not Go Rogue | WIRED

IronCurtain is an open-source AI assistant designed to enhance security and control over AI agents, preventing them from executing harmfu...

Wired - AI · 7 min ·
Mistral AI inks a deal with global consulting giant Accenture | TechCrunch
Llms

Mistral AI inks a deal with global consulting giant Accenture | TechCrunch

Mistral AI has partnered with Accenture to enhance enterprise AI adoption by leveraging Mistral's AI models, marking a significant collab...

TechCrunch - AI · 3 min ·
Machine Learning

[P] Implementing Better Pytorch Schedulers

This article discusses the limitations of current PyTorch schedulers and introduces a flexible suite for scheduling various optimizer hyp...

Reddit - Machine Learning · 1 min ·
Llms

[P] FP8 inference on Ampere without native hardware support | TinyLlama running on RTX 3050

This article discusses the emulation of FP8 inference on Ampere GPUs, specifically the RTX 3050, using custom Triton kernels to optimize ...

Reddit - Machine Learning · 1 min ·
Mixture of Experts (MoEs) in Transformers
Open Source Ai

Mixture of Experts (MoEs) in Transformers

The article discusses Mixture of Experts (MoEs) in Transformer models, highlighting their efficiency and scalability compared to traditio...

Hugging Face Blog · 10 min ·
Machine Learning

[P] PerpetualBooster v1.9.0 - GBM with no hyperparameter tuning, now with built-in causal ML, drift detection, and conformal prediction

PerpetualBooster v1.9.0 introduces significant enhancements to its gradient boosting machine, including built-in causal ML, drift detecti...

Reddit - Machine Learning · 1 min ·
[2602.21307] SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks
Machine Learning

[2602.21307] SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

SymTorch is a new library that automates the symbolic distillation of deep neural networks, converting them into interpretable mathematic...

arXiv - Machine Learning · 3 min ·
[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models
Llms

[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

The OGD4All framework enhances citizen interaction with geospatial Open Government Data using Large Language Models, achieving high accur...

arXiv - Machine Learning · 3 min ·
[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control
Llms

[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control

The EO-1 model is introduced as a unified foundation for general robot control, enhancing multimodal reasoning through a large dataset an...

arXiv - AI · 4 min ·
[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations
Ai Infrastructure

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

The article introduces xai-cola, an open-source Python library designed to sparsify counterfactual explanations, enhancing interpretabili...

arXiv - Machine Learning · 3 min ·
[2602.21374] Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages
Llms

[2602.21374] Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages

This study explores the use of small language models for extracting clinical information from low-resource languages, focusing on a priva...

arXiv - Machine Learning · 4 min ·
[2602.21233] AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression
Machine Learning

[2602.21233] AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

AngelSlim introduces a versatile toolkit for large model compression, integrating advanced algorithms for efficient deployment and improv...

arXiv - Machine Learning · 4 min ·
Machine Learning

[P] Reproducing Google’s Nested Learning / HOPE in PyTorch (mechanism-faithful implementation + reproducible tooling and library)

This article discusses the reproduction of Google's Nested Learning/HOPE framework in PyTorch, addressing the lack of available code and ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] A lightweight FoundationPose TensorRT implementation

This article discusses a lightweight TensorRT implementation of FoundationPose, aimed at improving robotics research by eliminating the h...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ML Engineers — How did you actually learn PyTorch? I keep forgetting everything.

A Reddit discussion among ML engineers on effective strategies for learning PyTorch, addressing common challenges like forgetting concept...

Reddit - Machine Learning · 1 min ·
OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED
Ai Agents

OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED

The article discusses how users of the OpenClaw AI tool are leveraging an open-source project called Scrapling to bypass anti-bot systems...

Wired - AI · 6 min ·
Machine Learning

[Project] Sovereign Mohawk: Formally Verified Federated Learning at 10M-Node Scale (O(n log n) & Byzantine Tolerant)

Sovereign Mohawk is a Go-based runtime for federated learning that addresses scaling and trust issues, achieving empirical validation for...

Reddit - Machine Learning · 1 min ·
Previous Page 4 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime