Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

Transformer Math Explorer [P]

This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...

Reddit - Machine Learning · 1 min ·
Machine Learning

how much of your time goes into environment setup vs actual model work?

For most people I've talked to, it's embarrassingly high. New machine? Set up CUDA again. New team member? Good luck for reproducing the ...

Reddit - ML Jobs · 1 min ·
Machine Learning

How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D]

Hi! I am trying to sanity-check an assumption for diffusion video generation reproducibility. Suppose I run the same video diffusion mode...

Reddit - Machine Learning · 1 min ·

All Content

Machine Learning

So Confused about Polarizing ICML Reviews [D]

Hi, rebuttals recently finished, and I wanted to share my paper's scores to ask for thoughts on this, and whether this situation is borde...

Reddit - Machine Learning · 1 min ·
Machine Learning

KIV: 1M token context window on a RTX 4070 (12GB VRAM), no retraining, drop-in HuggingFace cache replacement - Works with any model that uses DynamicCache [P]

Been working on this for a bit and figured it was ready to share. KIV (K-Indexed V Materialization) is a middleware layer that replaces t...

Reddit - Machine Learning · 1 min ·
Machine Learning

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP

I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude cannot be trusted to perform complex engineering tasks

AMD’s AI director just analyzed 6,852 Claude Code sessions, 234,760 tool calls, and 17,871 thinking blocks. Her conclusion: “Claude canno...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]

Code of Project: https://github.com/paulo101977/notebooks-rl/tree/main/re_requiem I’ve been working on training an agent to play a segmen...

Reddit - Machine Learning · 1 min ·
Machine Learning

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP [P]

I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...

Reddit - Machine Learning · 1 min ·
Generalist AI unveils GEN-1 model, claiming breakthrough in real-world robotic task performance
Machine Learning

Generalist AI unveils GEN-1 model, claiming breakthrough in real-world robotic task performance

AI News - General · 6 min ·
Machine Learning

Can someone review my resume for ML Engineer / Data Scientist / GenAI roles and give blunt feedback?

submitted by /u/avn1sh [link] [comments]

Reddit - ML Jobs · 1 min ·
New AI model sparks alarm as governments brace for AI-driven cyberattacks
Machine Learning

New AI model sparks alarm as governments brace for AI-driven cyberattacks

AI Tools & Products · 6 min ·
Machine Learning

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

Google's TurboQuant claims to compress the KV cache by up to 6x with 'little apparent loss in accuracy' by reconstructing it on the fly. ...

Reddit - Machine Learning · 1 min ·
Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min ·
Llms

How is mythos mythos ? [D]

Hello, I’ve been seeing discussions about “Mythos AI” showing behaviors that seem far beyond simple text prediction—like accessing inform...

Reddit - Machine Learning · 1 min ·
Machine Learning

What happens when intelligent systems move beyond simple utility?

Right now people are experiencing shallow depth, token limits and diluted intelligence from frontier models. I'm inviting people to exper...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

AGI is the wrong term, how do we define progress?

If a term can mean anything from "passed a Turing test" to "achieved consciousness", it's not a spectrum - it's a category error. Current...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Post Rebuttal ICML Average Scores? [D]

I have an average of 3.5. One of the reviewer gave us a 2 by bringing up a new issue he hadn't mentioned in his initial review, taking th...

Reddit - Machine Learning · 1 min ·
Machine Learning

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...

Reddit - Machine Learning · 1 min ·
Open Source Ai

[D] Runtime layer on Hugging Face Transformers (no source changes) [D]

I’ve been experimenting with a runtime-layer approach to augmenting existing ML systems without modifying their source code. As a test ca...

Reddit - Machine Learning · 1 min ·
Machine Learning

Can I trick a public AI to spit out an outcome I prefer?

I am aware of an organization that evaluates proposals by feeding them into a public version of AI. Is there a way to make that AI rate m...

Reddit - Artificial Intelligence · 1 min ·
Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Fed Chair Jerome Powell, Treasury's Bessent and top bank CEOs met over Anthropic's Mythos model

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Previous Page 343 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime