Open Source AI

Open weights models, datasets, and frameworks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min · 2 days ago

Llms

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

Abstract page for arXiv paper 2603.24772: Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Val...

arXiv - Machine Learning · 4 min · 2 days ago

Llms

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

Abstract page for arXiv paper 2603.25325: How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

arXiv - AI · 4 min · 2 days ago

All Content

Open Source Ai

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

The article discusses how synthetic personas can help overcome data scarcity in AI development in Japan, showcasing NTT DATA's innovative...

Hugging Face Blog · 2 min · about 1 month ago

Machine Learning

[P] Catalyst N1 & N2: Two open neuromorphic processors with Loihi 1/2 feature parity, 5 neuron models, 85.9% SHD accuracy

The article discusses the development of two open neuromorphic processors, Catalyst N1 and N2, which achieve feature parity with Intel's ...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[P] SoftDTW-CUDA for PyTorch package: fast + memory-efficient Soft Dynamic Time Warping with CUDA support

The SoftDTW-CUDA for PyTorch package offers a fast and memory-efficient implementation of Soft Dynamic Time Warping, optimized for GPU us...

Reddit - Machine Learning · 1 min · about 1 month ago

Open Source Ai

For open-source programs, AI coding tools are a mixed blessing | TechCrunch

The article discusses the dual impact of AI coding tools on open-source software, highlighting both the ease of feature development and t...

TechCrunch - AI · 7 min · about 1 month ago

Ai Agents

Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits

EVMbench is an open-source benchmark developed by OpenAI and Paradigm to evaluate AI agents' capabilities in handling smart contract secu...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

[D] Which hyperparameters search library to use?

The article discusses various hyperparameter optimization libraries in machine learning, including hyperopt, Optuna, sklearn.GridSearchCV...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[p] I Made my first Transformer architecture code

A Reddit user shares their first implementation of a Transformer architecture using PyTorch, detailing the structure and parameters used,...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[2509.06085] Software Dependencies 2.0: An Empirical Study of Reuse and Integration of Pre-Trained Models in Open-Source Projects

This article investigates the integration and management of pre-trained models (PTMs) in open-source software projects, introducing the c...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

This article presents a novel approach combining Chain-of-Thought (CoT) and Retrieval Augmented Generation (RAG) to improve rare disease ...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2602.05088] VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

The article presents VERA-MH, an open-source evaluation tool designed to assess the safety of AI in mental health contexts, focusing on s...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.16063] MARLEM: A Multi-Agent Reinforcement Learning Simulation Framework for Implicit Cooperation in Decentralized Local Energy Markets

The paper presents MARLEM, a novel multi-agent reinforcement learning framework designed for studying implicit cooperation in decentraliz...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.16085] Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs

This article investigates the mental state reasoning of language models (LMs) using 41 open-weight models, revealing insights into their ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.15847] Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models

This article explores the geometric limitations of steering personality traits in large language models (LLMs), revealing that traits are...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

Mistral CEO: AI could replace more than half of companies’ software

Arthur Mensch sees a major transition under way, with traditional SaaS services being replaced by proprietary AI apps.

AI Tools & Products · about 1 month ago

Open Source Ai

[P] Utterance, an open source client-side semantic endpointing SDK for voice apps. We are looking for contributors.

Utterance is an open-source SDK designed to improve voice app interactions by addressing issues with pauses and interruptions, inviting c...

Reddit - Machine Learning · 1 min · about 1 month ago

Open Source Ai

One-Shot Any Web App with Gradio's gr.HTML

Gradio's new gr.HTML feature allows users to create interactive web apps using a single Python file, enabling seamless integration of fro...

Hugging Face Blog · 4 min · about 1 month ago

Llms

[P] I just launched an open-source framework to help researchers responsibly and rigorously harness frontier LLM coding assistants for rapidly accelerating data analysis. I genuinely think this change the future of science with your help -- it's also kind of terrifying, so let's talk about it!

Brian Heseung Kim introduces an open-source framework designed to help researchers utilize LLM coding assistants for efficient data analy...

Reddit - Machine Learning · 1 min · about 1 month ago

Open Source Ai

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

IBM and UC Berkeley explore the failures of enterprise agents in IT automation, utilizing IT-Bench and MAST to diagnose issues and improv...

Hugging Face Blog · 11 min · about 1 month ago

Llms

Unpopular opinion: OpenAI made OpenClaw viral, then hired its founder, to justify / market their next product

The article presents a speculative theory suggesting that OpenAI engineered the viral success of OpenClaw to promote its own products, ra...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

Indian AI lab Sarvam's new models are a major bet on the viability of open-source AI | TechCrunch

Indian AI lab Sarvam launches new large language models, including 30B and 105B parameter models, aiming to challenge foreign AI systems ...

TechCrunch - AI · 5 min · about 1 month ago

Previous Page 8 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Open Source AI

Top This Week

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

[2603.24772] Evaluating Fine-Tuned LLM Model For Medical Transcription With Small Low-Resource Languages Validated Dataset

[2603.25325] How Pruning Reshapes Features: Sparse Autoencoder Analysis of Weight-Pruned Language Models

All Content

「データ不足」の壁を越える：合成ペルソナが日本のAI開発を加速

[P] Catalyst N1 & N2: Two open neuromorphic processors with Loihi 1/2 feature parity, 5 neuron models, 85.9% SHD accuracy

[P] SoftDTW-CUDA for PyTorch package: fast + memory-efficient Soft Dynamic Time Warping with CUDA support

For open-source programs, AI coding tools are a mixed blessing | TechCrunch

Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits

[D] Which hyperparameters search library to use?

[p] I Made my first Transformer architecture code

[2509.06085] Software Dependencies 2.0: An Empirical Study of Reuse and Integration of Pre-Trained Models in Open-Source Projects

[2503.12286] Integrating Chain-of-Thought and Retrieval Augmented Generation Enhances Rare Disease Diagnosis from Clinical Notes

[2602.05088] VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

[2602.16063] MARLEM: A Multi-Agent Reinforcement Learning Simulation Framework for Implicit Cooperation in Decentralized Local Energy Markets

[2602.16085] Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs

[2602.15847] Do Personality Traits Interfere? Geometric Limitations of Steering in Large Language Models

Mistral CEO: AI could replace more than half of companies’ software

[P] Utterance, an open source client-side semantic endpointing SDK for voice apps. We are looking for contributors.

One-Shot Any Web App with Gradio's gr.HTML

[P] I just launched an open-source framework to help researchers *responsibly* and *rigorously* harness frontier LLM coding assistants for rapidly accelerating data analysis. I genuinely think this change the future of science with your help -- it's also kind of terrifying, so let's talk about it!

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

Unpopular opinion: OpenAI made OpenClaw viral, then hired its founder, to justify / market their next product

Indian AI lab Sarvam's new models are a major bet on the viability of open-source AI | TechCrunch

Related Topics

Stay updated with AI News

[P] I just launched an open-source framework to help researchers responsibly and rigorously harness frontier LLM coding assistants for rapidly accelerating data analysis. I genuinely think this change the future of science with your help -- it's also kind of terrifying, so let's talk about it!