Open Source AI

Open weights models, datasets, and frameworks

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min ·
[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset
Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min ·
[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models
Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min ·

All Content

「データ不足」の壁を越える:合成ペルソナが日本のAI開発を加速
Open Source Ai

「データ不足」の壁を越える:合成ペルソナが日本のAI開発を加速

The article discusses how synthetic personas can help overcome data scarcity in AI development in Japan, showcasing NTT DATA's innovative...

Hugging Face Blog · 2 min ·
Machine Learning

[P] Catalyst N1 & N2: Two open neuromorphic processors with Loihi 1/2 feature parity, 5 neuron models, 85.9% SHD accuracy

The article discusses the development of two open neuromorphic processors, Catalyst N1 and N2, which achieve feature parity with Intel's ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] SoftDTW-CUDA for PyTorch package: fast + memory-efficient Soft Dynamic Time Warping with CUDA support

The SoftDTW-CUDA for PyTorch package offers a fast and memory-efficient implementation of Soft Dynamic Time Warping, optimized for GPU us...

Reddit - Machine Learning · 1 min ·
For open-source programs, AI coding tools are a mixed blessing | TechCrunch
Open Source Ai

For open-source programs, AI coding tools are a mixed blessing | TechCrunch

The article discusses the dual impact of AI coding tools on open-source software, highlighting both the ease of feature development and t...

TechCrunch - AI · 7 min ·
Ai Agents

Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits

EVMbench is an open-source benchmark developed by OpenAI and Paradigm to evaluate AI agents' capabilities in handling smart contract secu...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] Which hyperparameters search library to use?

The article discusses various hyperparameter optimization libraries in machine learning, including hyperopt, Optuna, sklearn.GridSearchCV...

Reddit - Machine Learning · 1 min ·
Machine Learning

[p] I Made my first Transformer architecture code

A Reddit user shares their first implementation of a Transformer architecture using PyTorch, detailing the structure and parameters used,...

Reddit - Machine Learning · 1 min ·
[2509.06085] Software Dependencies 2.0: An Empirical Study of Reuse and Integration of Pre-Trained Models in Open-Source Projects
Machine Learning

[2509.06085] Software Dependencies 2.0: An Empirical Study of Reuse and Integration of Pre-Trained Models in Open-Source Projects

This article investigates the integration and management of pre-trained models (PTMs) in open-source software projects, introducing the c...

arXiv - AI · 4 min ·
[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes
Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min ·
[2602.05088] VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health
Generative Ai

[2602.05088] VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

The article presents VERA-MH, an open-source evaluation tool designed to assess the safety of AI in mental health contexts, focusing on s...

arXiv - AI · 4 min ·
[2602.16063] MARLEM: A Multi-Agent Reinforcement Learning Simulation Framework for Implicit Cooperation in Decentralized Local Energy Markets
Machine Learning

[2602.16063] MARLEM: A Multi-Agent Reinforcement Learning Simulation Framework for Implicit Cooperation in Decentralized Local Energy Markets

The paper presents MARLEM, a novel multi-agent reinforcement learning framework designed for studying implicit cooperation in decentraliz...

arXiv - Machine Learning · 4 min ·
[2602.16085] Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs
Llms

[2602.16085] Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs

This article investigates the mental state reasoning of language models (LMs) using 41 open-weight models, revealing insights into their ...

arXiv - AI · 4 min ·
[2602.15847] Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models
Llms

[2602.15847] Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models

This article explores the geometric limitations of steering personality traits in large language models (LLMs), revealing that traits are...

arXiv - Machine Learning · 3 min ·
Mistral CEO: AI could replace more than half of companies’ software
Llms

Mistral CEO: AI could replace more than half of companies’ software

Arthur Mensch sees a major transition under way, with traditional SaaS services being replaced by proprietary AI apps.

AI Tools & Products ·
Open Source Ai

[P] Utterance, an open source client-side semantic endpointing SDK for voice apps. We are looking for contributors.

Utterance is an open-source SDK designed to improve voice app interactions by addressing issues with pauses and interruptions, inviting c...

Reddit - Machine Learning · 1 min ·
One-Shot Any Web App with Gradio's gr.HTML
Open Source Ai

One-Shot Any Web App with Gradio's gr.HTML

Gradio's new gr.HTML feature allows users to create interactive web apps using a single Python file, enabling seamless integration of fro...

Hugging Face Blog · 4 min ·
Llms

[P] I just launched an open-source framework to help researchers *responsibly* and *rigorously* harness frontier LLM coding assistants for rapidly accelerating data analysis. I genuinely think this change the future of science with your help -- it's also kind of terrifying, so let's talk about it!

Brian Heseung Kim introduces an open-source framework designed to help researchers utilize LLM coding assistants for efficient data analy...

Reddit - Machine Learning · 1 min ·
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST
Open Source Ai

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

IBM and UC Berkeley explore the failures of enterprise agents in IT automation, utilizing IT-Bench and MAST to diagnose issues and improv...

Hugging Face Blog · 11 min ·
Llms

Unpopular opinion: OpenAI made OpenClaw viral, then hired its founder, to justify / market their next product

The article presents a speculative theory suggesting that OpenAI engineered the viral success of OpenClaw to promote its own products, ra...

Reddit - Artificial Intelligence · 1 min ·
Indian AI lab Sarvam's new models are a major bet on the viability of open-source AI | TechCrunch
Machine Learning

Indian AI lab Sarvam's new models are a major bet on the viability of open-source AI | TechCrunch

Indian AI lab Sarvam launches new large language models, including 30B and 105B parameter models, aiming to challenge foreign AI systems ...

TechCrunch - AI · 5 min ·
Previous Page 8 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime