Open Source AI

Open weights models, datasets, and frameworks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min · about 12 hours ago

Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min · about 12 hours ago

All Content

Llms

[2602.22808] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

MiroFlow is an innovative open-source agent framework designed to enhance the performance and robustness of large language models in comp...

arXiv - AI · 3 min · 28 days ago

Ai Infrastructure

I Made a Auto-complete AI form scratch in python and thought it would be funny to use family guy episodes as a database. It was not a good idea.

The article discusses a humorous attempt to create an auto-complete AI using Family Guy episodes as a database, highlighting the unexpect...

Reddit - Artificial Intelligence · 1 min · 28 days ago

Ai Infrastructure

NXP posts new Linux accelerator driver for their Neutron NPU

NXP has released a new Linux accelerator driver for their Neutron NPU, enhancing support for machine learning applications and improving ...

Reddit - Artificial Intelligence · 1 min · 28 days ago

Ai Agents

This AI Agent Is Designed to Not Go Rogue | WIRED

IronCurtain is an open-source AI assistant designed to enhance security and control over AI agents, preventing them from executing harmfu...

Wired - AI · 7 min · 29 days ago

Llms

Mistral AI inks a deal with global consulting giant Accenture | TechCrunch

Mistral AI has partnered with Accenture to enhance enterprise AI adoption by leveraging Mistral's AI models, marking a significant collab...

TechCrunch - AI · 3 min · 29 days ago

Machine Learning

[P] Implementing Better Pytorch Schedulers

This article discusses the limitations of current PyTorch schedulers and introduces a flexible suite for scheduling various optimizer hyp...

Reddit - Machine Learning · 1 min · 29 days ago

Llms

[P] FP8 inference on Ampere without native hardware support | TinyLlama running on RTX 3050

This article discusses the emulation of FP8 inference on Ampere GPUs, specifically the RTX 3050, using custom Triton kernels to optimize ...

Reddit - Machine Learning · 1 min · 29 days ago

Open Source Ai

Mixture of Experts (MoEs) in Transformers

The article discusses Mixture of Experts (MoEs) in Transformer models, highlighting their efficiency and scalability compared to traditio...

Hugging Face Blog · 10 min · 29 days ago

Machine Learning

[P] PerpetualBooster v1.9.0 - GBM with no hyperparameter tuning, now with built-in causal ML, drift detection, and conformal prediction

PerpetualBooster v1.9.0 introduces significant enhancements to its gradient boosting machine, including built-in causal ML, drift detecti...

Reddit - Machine Learning · 1 min · 29 days ago

Machine Learning

[2602.21307] SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

SymTorch is a new library that automates the symbolic distillation of deep neural networks, converting them into interpretable mathematic...

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

The OGD4All framework enhances citizen interaction with geospatial Open Government Data using Large Language Models, achieving high accur...

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control

The EO-1 model is introduced as a unified foundation for general robot control, enhancing multimodal reasoning through a large dataset an...

arXiv - AI · 4 min · 29 days ago

Ai Infrastructure

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

The article introduces xai-cola, an open-source Python library designed to sparsify counterfactual explanations, enhancing interpretabili...

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2602.21374] Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages

This study explores the use of small language models for extracting clinical information from low-resource languages, focusing on a priva...

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2602.21233] AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

AngelSlim introduces a versatile toolkit for large model compression, integrating advanced algorithms for efficient deployment and improv...

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[P] Reproducing Google’s Nested Learning / HOPE in PyTorch (mechanism-faithful implementation + reproducible tooling and library)

This article discusses the reproduction of Google's Nested Learning/HOPE framework in PyTorch, addressing the lack of available code and ...

Reddit - Machine Learning · 1 min · 30 days ago

Machine Learning

[P] A lightweight FoundationPose TensorRT implementation

This article discusses a lightweight TensorRT implementation of FoundationPose, aimed at improving robotics research by eliminating the h...

Reddit - Machine Learning · 1 min · 30 days ago

Machine Learning

[D] ML Engineers — How did you actually learn PyTorch? I keep forgetting everything.

A Reddit discussion among ML engineers on effective strategies for learning PyTorch, addressing common challenges like forgetting concept...

Reddit - Machine Learning · 1 min · 30 days ago

Ai Agents

OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED

The article discusses how users of the OpenClaw AI tool are leveraging an open-source project called Scrapling to bypass anti-bot systems...

Wired - AI · 6 min · 30 days ago

Machine Learning

[Project] Sovereign Mohawk: Formally Verified Federated Learning at 10M-Node Scale (O(n log n) & Byzantine Tolerant)

Sovereign Mohawk is a Go-based runtime for federated learning that addresses scaling and trust issues, achieving empirical validation for...

Reddit - Machine Learning · 1 min · about 1 month ago

Previous Page 4 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Open Source AI

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

All Content

[2602.22808] MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks

I Made a Auto-complete AI form scratch in python and thought it would be funny to use family guy episodes as a database. It was not a good idea.

NXP posts new Linux accelerator driver for their Neutron NPU

This AI Agent Is Designed to Not Go Rogue | WIRED

Mistral AI inks a deal with global consulting giant Accenture | TechCrunch

[P] Implementing Better Pytorch Schedulers

[P] FP8 inference on Ampere without native hardware support | TinyLlama running on RTX 3050

Mixture of Experts (MoEs) in Transformers

[P] PerpetualBooster v1.9.0 - GBM with no hyperparameter tuning, now with built-in causal ML, drift detection, and conformal prediction

[2602.21307] SymTorch: A Framework for Symbolic Distillation of Deep Neural Networks

[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control

[2602.21845] xai-cola: A Python library for sparsifying counterfactual explanations

[2602.21374] Small Language Models for Privacy-Preserving Clinical Information Extraction in Low-Resource Languages

[2602.21233] AngelSlim: A more accessible, comprehensive, and efficient toolkit for large model compression

[P] Reproducing Google’s Nested Learning / HOPE in PyTorch (mechanism-faithful implementation + reproducible tooling and library)

[P] A lightweight FoundationPose TensorRT implementation

[D] ML Engineers — How did you actually learn PyTorch? I keep forgetting everything.

OpenClaw Users Are Allegedly Bypassing Anti-Bot Systems | WIRED

[Project] Sovereign Mohawk: Formally Verified Federated Learning at 10M-Node Scale (O(n log n) & Byzantine Tolerant)

Related Topics

Stay updated with AI News