Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min ·
Llms

I built a repo for implementing and training LLM architectures from scratch in minimal PyTorch — contributions welcome! [P]

Hey everyone, I've been working on a repo where I implement large language model architectures using the simplest PyTorch code possible. ...

Reddit - Machine Learning · 1 min ·
White House and Anthropic hold 'productive' meeting amid fears over Mythos model
Machine Learning

White House and Anthropic hold 'productive' meeting amid fears over Mythos model

The discussion is a sign the AI firm's technology may be too critical for even the US government to do without.

AI Tools & Products · 4 min ·

All Content

Llms

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

Posting this for a friend who isn't on Reddit. A recent graduate, entry level, no commercial production experience but spent the past yea...

Reddit - ML Jobs · 1 min ·
Machine Learning

The end of AI

I am a computer science student graduating this year, as far as ai is concerned my knowledge is fairly limited and fairly high level i kn...

Reddit - Artificial Intelligence · 1 min ·
The gig workers who are training humanoid robots at home | MIT Technology Review
Machine Learning

The gig workers who are training humanoid robots at home | MIT Technology Review

People in Nigeria and India are strapping iPhones onto their heads and recording themselves doing chores.

MIT Technology Review - AI · 9 min ·
Machine Learning

Biggest Opportunity for Builders to monetise their agents

We’re working on something where AI agent builders can publish their agents and earn from day one. This model is profitable from day 1 so...

Reddit - Artificial Intelligence · 1 min ·
[2603.14841] Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling
Machine Learning

[2603.14841] Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

Abstract page for arXiv paper 2603.14841: Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling

arXiv - AI · 4 min ·
[2603.17839] How do LLMs Compute Verbal Confidence
Llms

[2603.17839] How do LLMs Compute Verbal Confidence

Abstract page for arXiv paper 2603.17839: How do LLMs Compute Verbal Confidence

arXiv - AI · 4 min ·
[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models
Llms

[2603.15970] 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Abstract page for arXiv paper 2603.15970: 100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight...

arXiv - AI · 4 min ·
[2603.09085] Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting
Llms

[2603.09085] Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum Price Forecasting

Abstract page for arXiv paper 2603.09085: Not All News Is Equal: Topic- and Event-Conditional Sentiment from Finetuned LLMs for Aluminum ...

arXiv - AI · 4 min ·
[2603.05659] When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On
Machine Learning

[2603.05659] When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual Try-On

Abstract page for arXiv paper 2603.05659: When Rubrics Fail: Error Enumeration as Reward in Reference-Free RL Post-Training for Virtual T...

arXiv - AI · 4 min ·
[2602.03584] $V_0$: A Generalist Value Model for Any Policy at State Zero
Llms

[2602.03584] $V_0$: A Generalist Value Model for Any Policy at State Zero

Abstract page for arXiv paper 2602.03584: $V_0$: A Generalist Value Model for Any Policy at State Zero

arXiv - AI · 4 min ·
[2601.17094] The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation
Llms

[2601.17094] The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

Abstract page for arXiv paper 2601.17094: The Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

arXiv - AI · 4 min ·
[2601.04448] Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models
Llms

[2601.04448] Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

Abstract page for arXiv paper 2601.04448: Merging Triggers, Breaking Backdoors: Defensive Poisoning for Instruction-Tuned Language Models

arXiv - AI · 3 min ·
[2512.02902] VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling
Machine Learning

[2512.02902] VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

Abstract page for arXiv paper 2512.02902: VLA Models Are More Generalizable Than You Think: Revisiting Physical and Spatial Modeling

arXiv - AI · 3 min ·
[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller
Machine Learning

[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

Abstract page for arXiv paper 2512.19576: LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

arXiv - AI · 4 min ·
[2512.16081] Evaluation of Generative Models for Emotional 3D Animation Generation in VR
Machine Learning

[2512.16081] Evaluation of Generative Models for Emotional 3D Animation Generation in VR

Abstract page for arXiv paper 2512.16081: Evaluation of Generative Models for Emotional 3D Animation Generation in VR

arXiv - AI · 4 min ·
[2512.15987] Provably Extracting the Features from a General Superposition
Machine Learning

[2512.15987] Provably Extracting the Features from a General Superposition

Abstract page for arXiv paper 2512.15987: Provably Extracting the Features from a General Superposition

arXiv - AI · 4 min ·
[2512.10938] Stronger Normalization-Free Transformers
Machine Learning

[2512.10938] Stronger Normalization-Free Transformers

Abstract page for arXiv paper 2512.10938: Stronger Normalization-Free Transformers

arXiv - AI · 4 min ·
[2512.08829] InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models
Llms

[2512.08829] InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Abstract page for arXiv paper 2512.08829: InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Visio...

arXiv - AI · 4 min ·
[2512.05411] A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems
Llms

[2512.05411] A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to Enhance RAG Systems

Abstract page for arXiv paper 2512.05411: A Systematic Framework for Enterprise Knowledge Retrieval: Leveraging LLM-Generated Metadata to...

arXiv - AI · 4 min ·
[2511.22715] ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering
Llms

[2511.22715] ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

Abstract page for arXiv paper 2511.22715: ReAG: Reasoning-Augmented Generation for Knowledge-based Visual Question Answering

arXiv - AI · 4 min ·
Previous Page 204 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime