Open Source AI

Open weights models, datasets, and frameworks

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min ·
[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset
Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min ·
[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min ·

All Content

Machine Learning

[Project] Sovereign Mohawk: Formally Verified Federated Learning at 10M-Node Scale (O(n log n) & Byzantine Tolerant)

Sovereign Mohawk is a Go-based runtime for federated learning that addresses scaling and trust issues, achieving empirical validation for...

Reddit - Machine Learning · 1 min ·
[2601.10611] Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding
Llms

[2601.10611] Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Molmo2 introduces a new family of open-weight vision-language models that excel in video understanding and grounding, featuring innovativ...

arXiv - AI · 4 min ·
[2602.21033] MIP Candy: A Modular PyTorch Framework for Medical Image Processing
Machine Learning

[2602.21033] MIP Candy: A Modular PyTorch Framework for Medical Image Processing

MIP Candy is a modular framework built on PyTorch for medical image processing, offering a flexible pipeline for data handling, training,...

arXiv - Machine Learning · 4 min ·
[2602.20810] POMDPPlanners: Open-Source Package for POMDP Planning
Ai Startups

[2602.20810] POMDPPlanners: Open-Source Package for POMDP Planning

POMDPPlanners is an open-source Python package designed for the empirical evaluation of POMDP planning algorithms, integrating advanced f...

arXiv - AI · 3 min ·
Meet Buzzdetect: An Open-Source AI Tool for Listening to Pollinators
Open Source Ai

Meet Buzzdetect: An Open-Source AI Tool for Listening to Pollinators

Buzzdetect is an open-source AI tool that uses machine learning and microphones to monitor pollinator activity in real-time, providing a ...

AI News - General · 11 min ·
OpenAI defeats xAI’s trade secrets lawsuit | The Verge
Ai Startups

OpenAI defeats xAI’s trade secrets lawsuit | The Verge

OpenAI successfully dismissed xAI's trade secrets lawsuit, with the court ruling that xAI failed to demonstrate any misconduct by OpenAI ...

The Verge - AI · 4 min ·
Spanish 'soonicorn' Multiverse Computing releases free compressed AI model | TechCrunch
Llms

Spanish 'soonicorn' Multiverse Computing releases free compressed AI model | TechCrunch

Spanish startup Multiverse Computing has launched a free compressed version of its HyperNova 60B AI model, claiming it outperforms Mistra...

TechCrunch - AI · 5 min ·
Machine Learning

[P] mlx-onnx: Run your MLX models in the browser using ONNX / WebGPU

The article discusses mlx-onnx, a tool that converts MLX models into ONNX format for execution in web browsers using WebGPU, targeting de...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Would a p2p distributed AI model be possible?

The article explores the potential for a peer-to-peer (P2P) distributed AI model, emphasizing a decentralized approach that relies on ver...

Reddit - Machine Learning · 1 min ·
Ai Startups

Looking for Coding buddies

A Reddit user is seeking programming buddies to collaborate with on coding projects, inviting all types of programmers to join the initia...

Reddit - ML Jobs · 1 min ·
New Relic launches new AI agent platform and OpenTelemetry tools | TechCrunch
Ai Agents

New Relic launches new AI agent platform and OpenTelemetry tools | TechCrunch

New Relic has launched an AI agent platform and enhanced OpenTelemetry tools to improve data observability for enterprises, allowing bett...

TechCrunch - AI · 5 min ·
Llms

[P] A minimalist implementation for Recursive Language Models

This article introduces a minimalist implementation of Recursive Language Models (RLMs), providing a tutorial and open-source code reposi...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] Whisper Accent — Accent-Aware English Speech Recognition

Whisper-Accent is a project aimed at enhancing Whisper's performance in recognizing accented English speech, providing tools for research...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Papers with no code

The discussion highlights concerns over the prevalence of academic papers in machine learning that lack accompanying code, questioning th...

Reddit - Machine Learning · 1 min ·
[2601.16449] Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding
Llms

[2601.16449] Emotion-LLaMAv2 and MMEVerse: A New Framework and Benchmark for Multimodal Emotion Understanding

The paper introduces Emotion-LLaMAv2 and MMEVerse, a new framework and benchmark aimed at enhancing multimodal emotion understanding thro...

arXiv - AI · 4 min ·
[2512.09730] Interpreto: An Explainability Library for Transformers
Llms

[2512.09730] Interpreto: An Explainability Library for Transformers

Interpreto is an open-source library designed for interpreting HuggingFace transformers, offering both attribution methods and concept-ba...

arXiv - Machine Learning · 3 min ·
[2602.20130] To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering
Llms

[2602.20130] To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering

The paper presents Selective Chain-of-Thought (Selective CoT), a method to enhance medical question answering efficiency using large lang...

arXiv - AI · 4 min ·
[2502.05795] The Curse of Depth in Large Language Models
Llms

[2502.05795] The Curse of Depth in Large Language Models

This paper introduces the 'Curse of Depth' in Large Language Models (LLMs), revealing that many deep layers are ineffective due to Pre-La...

arXiv - AI · 4 min ·
[2602.19818] SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models
Open Source Ai

[2602.19818] SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

The paper presents SafePickle, a machine-learning-based scanner designed to detect malicious Pickle-based ML models, achieving a high F1-...

arXiv - AI · 4 min ·
[2602.19762] Hexagon-MLIR: An AI Compilation Stack For Qualcomm's Neural Processing Units (NPUs)
Machine Learning

[2602.19762] Hexagon-MLIR: An AI Compilation Stack For Qualcomm's Neural Processing Units (NPUs)

Hexagon-MLIR presents an open-source compilation stack designed for Qualcomm's NPUs, enhancing AI workload performance by optimizing Trit...

arXiv - AI · 4 min ·
Previous Page 5 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime