Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Transformer Math Explorer [P]

This is an interactive math reference for transformer models, presented via dataflow graphs, all the way down to elementary math. Covers ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

how much of your time goes into environment setup vs actual model work?

For most people I've talked to, it's embarrassingly high. New machine? Set up CUDA again. New team member? Good luck for reproducing the ...

Reddit - ML Jobs · 1 min · about 2 hours ago

Machine Learning

How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D]

Hi! I am trying to sanity-check an assumption for diffusion video generation reproducibility. Suppose I run the same video diffusion mode...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Machine Learning

So Confused about Polarizing ICML Reviews [D]

Hi, rebuttals recently finished, and I wanted to share my paper's scores to ask for thoughts on this, and whether this situation is borde...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

KIV: 1M token context window on a RTX 4070 (12GB VRAM), no retraining, drop-in HuggingFace cache replacement - Works with any model that uses DynamicCache [P]

Been working on this for a bit and figured it was ready to share. KIV (K-Indexed V Materialization) is a middleware layer that replaces t...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP

I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...

Reddit - Artificial Intelligence · 1 min · 25 days ago

Llms

Claude cannot be trusted to perform complex engineering tasks

AMD’s AI director just analyzed 6,852 Claude Code sessions, 234,760 tool calls, and 17,871 thinking blocks. Her conclusion: “Claude canno...

Reddit - Artificial Intelligence · 1 min · 25 days ago

Machine Learning

Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]

Code of Project: https://github.com/paulo101977/notebooks-rl/tree/main/re_requiem I’ve been working on training an agent to play a segmen...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP [P]

I put together a small educational repo that implements distributed training parallelism from scratch in PyTorch: https://github.com/shre...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

Generalist AI unveils GEN-1 model, claiming breakthrough in real-world robotic task performance

AI News - General · 6 min · 25 days ago

Machine Learning

Can someone review my resume for ML Engineer / Data Scientist / GenAI roles and give blunt feedback?

submitted by /u/avn1sh [link] [comments]

Reddit - ML Jobs · 1 min · 25 days ago

Machine Learning

New AI model sparks alarm as governments brace for AI-driven cyberattacks

AI Tools & Products · 6 min · 25 days ago

Machine Learning

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

Google's TurboQuant claims to compress the KV cache by up to 6x with 'little apparent loss in accuracy' by reconstructing it on the fly. ...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min · 25 days ago

Llms

How is mythos mythos ? [D]

Hello, I’ve been seeing discussions about “Mythos AI” showing behaviors that seem far beyond simple text prediction—like accessing inform...

Reddit - Machine Learning · 1 min · 25 days ago

Machine Learning

What happens when intelligent systems move beyond simple utility?

Right now people are experiencing shallow depth, token limits and diluted intelligence from frontier models. I'm inviting people to exper...

Reddit - Artificial Intelligence · 1 min · 26 days ago

Machine Learning

AGI is the wrong term, how do we define progress?

If a term can mean anything from "passed a Turing test" to "achieved consciousness", it's not a spectrum - it's a category error. Current...

Reddit - Artificial Intelligence · 1 min · 26 days ago

Machine Learning

Post Rebuttal ICML Average Scores? [D]

I have an average of 3.5. One of the reviewer gave us a 2 by bringing up a new issue he hadn't mentioned in his initial review, taking th...

Reddit - Machine Learning · 1 min · 26 days ago

Machine Learning

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...

Reddit - Machine Learning · 1 min · 26 days ago

Open Source Ai

[D] Runtime layer on Hugging Face Transformers (no source changes) [D]

I’ve been experimenting with a runtime-layer approach to augmenting existing ML systems without modifying their source code. As a test ca...

Reddit - Machine Learning · 1 min · 26 days ago

Machine Learning

Can I trick a public AI to spit out an outcome I prefer?

I am aware of an organization that evaluates proposals by feeding them into a public version of AI. Is there a way to make that AI rate m...

Reddit - Artificial Intelligence · 1 min · 26 days ago

Llms

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Over the last few days I was collecting free or low cost AI tools that are actually useful if you want to build stuff, not just try rando...

Reddit - Artificial Intelligence · 1 min · 26 days ago

Machine Learning

Fed Chair Jerome Powell, Treasury's Bessent and top bank CEOs met over Anthropic's Mythos model

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min · 26 days ago

Previous Page 343 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Transformer Math Explorer [P]

how much of your time goes into environment setup vs actual model work?

How much can a video generated by the same diffusion model differ across GPU architectures if the initial noise latent is fixed? [D]

All Content

So Confused about Polarizing ICML Reviews [D]

KIV: 1M token context window on a RTX 4070 (12GB VRAM), no retraining, drop-in HuggingFace cache replacement - Works with any model that uses DynamicCache [P]

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP

Claude cannot be trusted to perform complex engineering tasks

Training an AI to play Resident Evil Requiem using Behavior Cloning + HG-DAgge [P]

Educational PyTorch repo for distributed training from scratch: DP, FSDP, TP, FSDP+TP, and PP [P]

Generalist AI unveils GEN-1 model, claiming breakthrough in real-world robotic task performance

Can someone review my resume for ML Engineer / Data Scientist / GenAI roles and give blunt feedback?

New AI model sparks alarm as governments brace for AI-driven cyberattacks

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

How is mythos mythos ? [D]

What happens when intelligent systems move beyond simple utility?

AGI is the wrong term, how do we define progress?

Post Rebuttal ICML Average Scores? [D]

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

[D] Runtime layer on Hugging Face Transformers (no source changes) [D]

Can I trick a public AI to spit out an outcome I prefer?

Curated 550+ free AI tools useful for building projects (LLMs, APIs, local models, RAG, agents)

Fed Chair Jerome Powell, Treasury's Bessent and top bank CEOs met over Anthropic's Mythos model

Related Topics

Stay updated with AI News