Top Open Source AI This Month

The most engaging open source ai content from this month, curated by AI News.

This Week This Month Guide Trending
  1. 1

    [2601.17172] Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text

    Abstract page for arXiv paper 2601.17172: Who Gets Which Message? Auditing Demographic Bias in LLM-Generated Targeted Text

    arXiv - Machine Learning · 27 days ago
  2. 2

    Granite 4.1 LLMs: How They’re Built

    A Blog post by IBM Granite on Hugging Face

    Hugging Face Blog · 12 days ago
  3. 3

    [2605.07731] Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

    Abstract page for arXiv paper 2605.07731: Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

    arXiv - AI · about 9 hours ago
  4. 4

    [P] QLoRA Fine-Tuning of Qwen2.5-1.5B for CEFR English Proficiency Classification (A1–C2) [P]

    I fine-tuned Qwen2.5-1.5B for multi-class CEFR English proficiency classification using QLoRA (4-bit NF4). The goal was to classify English text into one of the 6 CEFR levels (A1 → C2), which can b...

    Reddit - Machine Learning · 7 days ago
  5. 5

    The Scaling Bandaid is Wearing Thin (And Nobody Wants to Admit It)

    Let me be direct: we’ve hit a wall with scaling, and the entire field is kind of bullshitting about what comes next. I’ve spent enough time in research circles to know this isn’t controversial, peo...

    Reddit - Artificial Intelligence · 9 days ago
  6. 6

    Locally running Mistral on an i7 from 2017 so I don't waste water or ram

    submitted by /u/Heavy-Factor-1919 [link] [comments]

    Reddit - Artificial Intelligence · 1 day ago
  7. 7

    How much are you actually spending on AI APIs? I built an OpenSource router to cut that.

    I've been working on Manifest, an open-source AI cost optimization tool. The idea is simple: instead of sending every request to the same expensive model, it routes each one to the cheapest model t...

    Reddit - Artificial Intelligence · 27 days ago
  8. 8

    How to Use Transformers.js in a Chrome Extension

    We’re on a journey to advance and democratize artificial intelligence through open source and open science.

    Hugging Face Blog · 19 days ago
  9. 9

    [2604.12168] Fully Homomorphic Encryption on Llama 3 model for privacy preserving LLM inference

    Abstract page for arXiv paper 2604.12168: Fully Homomorphic Encryption on Llama 3 model for privacy preserving LLM inference

    arXiv - AI · 26 days ago
  10. 10

    [2605.02069] Pair2Score: Pairwise-to-Absolute Transfer for LLM-Based Essay Scoring

    Abstract page for arXiv paper 2605.02069: Pair2Score: Pairwise-to-Absolute Transfer for LLM-Based Essay Scoring

    arXiv - AI · 6 days ago
  11. 11

    Free LLM security audit

    I built Arc Sentry, a pre-generation guardrail for open source LLMs that blocks prompt injection before the model generates a response. It works on Mistral, Qwen, and Llama by reading the residual ...

    Reddit - Artificial Intelligence · 27 days ago
  12. 12

    LLM Guard scored 0/8 detecting a Crescendo multi-turn attack. Arc Sentry flagged it at Turn 3.

    Crescendo (Russinovich et al., USENIX Security 2025) is a multi-turn jailbreak that starts with innocent questions and gradually steers a model toward harmful output. It’s specifically designed to ...

    Reddit - Artificial Intelligence · 27 days ago
  13. 13

    Asserting American Leadership in Open Source AI

    Policymakers should protect and promote American open source development, taking steps to protect open source tools from undue restraints while also promoting open source use and adoption.

    AI Tools & Products · 12 days ago
  14. 14

    Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

    Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try random demos. Most lists I saw were either outdated, full of affi...

    Reddit - Artificial Intelligence · about 1 month ago
  15. 15

    [D] Runtime layer on Hugging Face Transformers (no source changes) [D]

    I’ve been experimenting with a runtime-layer approach to augmenting existing ML systems without modifying their source code. As a test case, I took modeling_utils.py from Transformers (v5.5.0), kep...

    Reddit - Machine Learning · about 1 month ago
  16. 16

    Meet HoloTab by HCompany. Your AI browser companion.

    A Blog post by H company on Hugging Face

    Hugging Face Blog · 26 days ago
  17. 17

    [2604.15409] The Illusion of Equivalence: Systematic FP16 Divergence in KV-Cached Autoregressive Inference

    Abstract page for arXiv paper 2604.15409: The Illusion of Equivalence: Systematic FP16 Divergence in KV-Cached Autoregressive Inference

    arXiv - AI · 21 days ago
  18. 18

    [2604.18655] Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

    Abstract page for arXiv paper 2604.18655: Unlocking the Edge deployment and ondevice acceleration of multi-LoRA enabled one-for-all foundational LLM

    arXiv - AI · 19 days ago
  19. 19

    Built GPT-2, Llama 3, and DeepSeek from scratch in PyTorch - open source code + book [p]

    I spent the past year implementing five LLM architectures from scratch in PyTorch and wrote a book documenting the process. What's covered: Vanilla encoder-decoder transformer (English to Hindi tra...

    Reddit - Machine Learning · 26 days ago
  20. 20

    [2604.09617] AdaQE-CG: Adaptive Query Expansion for Web-Scale Generative AI Model and Data Card Generation

    Abstract page for arXiv paper 2604.09617: AdaQE-CG: Adaptive Query Expansion for Web-Scale Generative AI Model and Data Card Generation

    arXiv - AI · 27 days ago
  21. 21

    [2604.08844] Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance

    Abstract page for arXiv paper 2604.08844: Spectral Geometry of LoRA Adapters Encodes Training Objective and Predicts Harmful Compliance

    arXiv - Machine Learning · 28 days ago
  22. 22

    [2604.09034] The nextAI Solution to the NeurIPS 2023 LLM Efficiency Challenge

    Abstract page for arXiv paper 2604.09034: The nextAI Solution to the NeurIPS 2023 LLM Efficiency Challenge

    arXiv - Machine Learning · 28 days ago
  23. 23

    [2604.08561] A Representation-Level Assessment of Bias Mitigation in Foundation Models

    Abstract page for arXiv paper 2604.08561: A Representation-Level Assessment of Bias Mitigation in Foundation Models

    arXiv - Machine Learning · 28 days ago
  24. 24

    [2509.21623] OjaKV: Context-Aware Online Low-Rank KV Cache Compression

    Abstract page for arXiv paper 2509.21623: OjaKV: Context-Aware Online Low-Rank KV Cache Compression

    arXiv - AI · 21 days ago
  25. 25

    [2604.10508] How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks

    Abstract page for arXiv paper 2604.10508: How Many Tries Does It Take? Iterative Self-Repair in LLM Code Generation Across Model Scales and Benchmarks

    arXiv - AI · 27 days ago
  26. 26

    Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

    A Blog post by IBM Research on Hugging Face

    Hugging Face Blog · 26 days ago
  27. 27

    Gemma 4 actually running usable on an Android phone (not llama.cpp)

    I wanted a real local assistant on my phone, not a demo. First tried the usual llama.cpp in Termux — Gemma 4 was 2–3 tok/s and the phone was on fire. Then I switched to Google’s LiteRT setup, got G...

    Reddit - Artificial Intelligence · 23 days ago
  28. 28

    [2604.11050] Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds

    Abstract page for arXiv paper 2604.11050: Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds

    arXiv - AI · 27 days ago
  29. 29

    [2409.02231] SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration

    Abstract page for arXiv paper 2409.02231: SmileyLlama: Modifying Large Language Models for Directed Chemical Space Exploration

    arXiv - Machine Learning · 27 days ago
  30. 30

    LLM rankings are not a ladder: experimental results from a transitive benchmark graph [D]

    I built a small website called LLM Win: https://llm-win.com It turns LLM benchmark results into a directed graph: text If model A beats model B on benchmark X, add an edge A -> B. Then it search...

    Reddit - Machine Learning · 2 days ago

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime