Transformer Math Explorer [P]
This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...
ML algorithms, training, and inference
This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...
For most people I've talked to, it's embarrassingly high. New machine? Set up CUDA again. New team member? Good luck for reproducing the ...
Hi! I am trying to sanity-check an assumption for diffusion video generation reproducibility. Suppose I run the same video diffusion mode...
Hi, rebuttals recently finished, and I wanted to share my paper's scores to ask for thoughts on this, and whether this situation is borde...
Been working on this for a bit and figured it was ready to share. KIV (K-Indexed V Materialization) is a middleware layer that replaces t...
I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...
AMD’s AI director just analyzed 6,852 Claude Code sessions, 234,760 tool calls, and 17,871 thinking blocks. Her conclusion: “Claude canno...
Code of Project: https://github.com/paulo101977/notebooks-rl/tree/main/re_requiem I’ve been working on training an agent to play a segmen...
I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...
submitted by /u/avn1sh [link] [comments]
Google's TurboQuant claims to compress the KV cache by up to 6x with 'little apparent loss in accuracy' by reconstructing it on the fly. ...
Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...
Hello, I’ve been seeing discussions about “Mythos AI” showing behaviors that seem far beyond simple text prediction—like accessing inform...
Right now people are experiencing shallow depth, token limits and diluted intelligence from frontier models. I'm inviting people to exper...
If a term can mean anything from "passed a Turing test" to "achieved consciousness", it's not a spectrum - it's a category error. Current...
I have an average of 3.5. One of the reviewer gave us a 2 by bringing up a new issue he hadn't mentioned in his initial review, taking th...
Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...
I’ve been experimenting with a runtime-layer approach to augmenting existing ML systems without modifying their source code. As a test ca...
I am aware of an organization that evaluates proposals by feeding them into a public version of AI. Is there a way to make that AI rate m...
Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...
submitted by /u/esporx [link] [comments]
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime