Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

White House and Anthropic hold 'productive' meeting amid fears over Mythos model

The discussion is a sign the AI firm's technology may be too critical for even the US government to do without.

AI Tools & Products · 4 min · about 2 hours ago

All Content

Llms

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

Posting this for a friend who isn't on Reddit. A recent graduate, entry level, no commercial production experience but spent the past yea...

Reddit - ML Jobs · 1 min · 17 days ago

Machine Learning

The end of AI

I am a computer science student graduating this year, as far as ai is concerned my knowledge is fairly limited and fairly high level i kn...

Reddit - Artificial Intelligence · 1 min · 17 days ago

Machine Learning

The gig workers who are training humanoid robots at home | MIT Technology Review

People in Nigeria and India are strapping iPhones onto their heads and recording themselves doing chores.

MIT Technology Review - AI · 9 min · 17 days ago

Machine Learning

Biggest Opportunity for Builders to monetise their agents

We’re working on something where AI agent builders can publish their agents and earn from day one. This model is profitable from day 1 so...

Reddit - Artificial Intelligence · 1 min · 17 days ago

Machine Learning

[2603.14841] Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

Abstract page for arXiv paper 2603.14841: Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

arXiv - AI · 4 min · 17 days ago

Llms

[2603.17839] How do LLMs Compute Verbal Confidence

Abstract page for arXiv paper 2603.17839: How do LLMs Compute Verbal Confidence

arXiv - AI · 4 min · 17 days ago

Llms

[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Abstract page for arXiv paper 2603.15970: 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight...

arXiv - AI · 4 min · 17 days ago

Llms

[2603.09085] Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting

Abstract page for arXiv paper 2603.09085: Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum ...

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2603.05659] When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On

Abstract page for arXiv paper 2603.05659: When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual T...

arXiv - AI · 4 min · 17 days ago

Llms

[2602.03584] $V_0$: A Generalist Value Model for Any Policy at State Zero

Abstract page for arXiv paper 2602.03584: $V_0$: A Generalist Value Model for Any Policy at State Zero

arXiv - AI · 4 min · 17 days ago

Llms

[2601.17094] The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

Abstract page for arXiv paper 2601.17094: The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

arXiv - AI · 4 min · 17 days ago

Llms

[2601.04448] Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

Abstract page for arXiv paper 2601.04448: Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2512.02902] VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

Abstract page for arXiv paper 2512.02902: VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

Abstract page for arXiv paper 2512.19576: LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2512.16081] Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Abstract page for arXiv paper 2512.16081: Evaluation of Generative Models for Emotional 3D Animation Generation in VR

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2512.15987] Provably Extracting the Features from a General Superposition

Abstract page for arXiv paper 2512.15987: Provably Extracting the Features from a General Superposition

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2512.10938] Stronger Normalization-Free Transformers

Abstract page for arXiv paper 2512.10938: Stronger Normalization-Free Transformers

arXiv - AI · 4 min · 17 days ago

Llms

[2512.08829] InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Abstract page for arXiv paper 2512.08829: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Visio...

arXiv - AI · 4 min · 17 days ago

Llms

[2512.05411] A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems

Abstract page for arXiv paper 2512.05411: A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to...

arXiv - AI · 4 min · 17 days ago

Llms

[2511.22715] ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

Abstract page for arXiv paper 2511.22715: ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

arXiv - AI · 4 min · 17 days ago

Previous Page 204 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

White House and Anthropic hold 'productive' meeting amid fears over Mythos model

All Content

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

The end of AI

The gig workers who are training humanoid robots at home | MIT Technology Review

Biggest Opportunity for Builders to monetise their agents

[2603.14841] Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

[2603.17839] How do LLMs Compute Verbal Confidence

[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

[2603.09085] Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting

[2603.05659] When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On

[2602.03584] $V_0$: A Generalist Value Model for Any Policy at State Zero

[2601.17094] The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

[2601.04448] Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

[2512.02902] VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

[2512.16081] Evaluation of Generative Models for Emotional 3D Animation Generation in VR

[2512.15987] Provably Extracting the Features from a General Superposition

[2512.10938] Stronger Normalization-Free Transformers

[2512.08829] InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

[2512.05411] A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems

[2511.22715] ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

Related Topics

Stay updated with AI News