AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Llms

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

https://arxiv.org/abs/2604.05091 Abstract: "We present MegaTrain, a memory-centric system that efficiently trains 100B+ parameter large l...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

"There's a green field." Five words, no system prompt, pure autocomplete. It figured out what it was.

No chat interface. No identity. No instructions. Just the API in raw autocomplete mode. The model receives text, predicts the next tokens...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Why would Anthropic keep a cyber model like Project Glasswing invite-only?

Anthropic’s Project Glasswing caught my attention less as a cybersecurity headline than as a signal about how frontier AI may be commerci...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2510.13582] ArtNet: Hierarchical Clustering-Based Artificial Netlist Generator for ML and DTCO Application
Machine Learning

[2510.13582] ArtNet: Hierarchical Clustering-Based Artificial Netlist Generator for ML and DTCO Application

ArtNet introduces a novel artificial netlist generator that enhances machine learning model generalization and design-technology co-optim...

arXiv - Machine Learning · 4 min ·
[2507.05411] AXLearn: Modular, Hardware-Agnostic Large Model Training
Machine Learning

[2507.05411] AXLearn: Modular, Hardware-Agnostic Large Model Training

AXLearn presents a modular and hardware-agnostic approach to training large deep learning models, enhancing scalability and performance w...

arXiv - Machine Learning · 4 min ·
[2502.14762] Unlocking [CLS] Features for Continual Post-Training
Llms

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

The paper presents a novel approach to continual learning in machine learning models, introducing a parameter-efficient fine-tuning modul...

arXiv - Machine Learning · 4 min ·
[2602.17314] Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE
Data Science

[2602.17314] Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE

This article surveys open datasets in learning analytics, identifying trends, challenges, and best practices to enhance research reproduc...

arXiv - Machine Learning · 4 min ·
[2602.17223] Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs
Llms

[2602.17223] Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs

The paper presents new privacy-preserving protocols for verifiable inference of large language models (LLMs), addressing the challenges o...

arXiv - Machine Learning · 4 min ·
[2602.16979] Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
Llms

[2602.16979] Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling

The paper presents PRIMO, a supervised latent-variable model that addresses the challenges of incomplete multimodal data by quantifying t...

arXiv - Machine Learning · 4 min ·
[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling
Machine Learning

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

This paper presents Greedy Multi-Path Block Verification (GBV), a method that enhances the efficiency of speculative decoding in machine ...

arXiv - Machine Learning · 4 min ·
[2602.16908] Multi-objective optimization and quantum hybridization of equivariant deep learning interatomic potentials on organic and inorganic compounds
Machine Learning

[2602.16908] Multi-objective optimization and quantum hybridization of equivariant deep learning interatomic potentials on organic and inorganic compounds

This article presents a study on the multi-objective optimization of deep learning interatomic potentials, focusing on the trade-off betw...

arXiv - Machine Learning · 3 min ·
[2602.16835] NeST: Neuron Selective Tuning for LLM Safety
Llms

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

The paper introduces NeST, a novel framework for enhancing safety in large language models (LLMs) by selectively tuning a small subset of...

arXiv - Machine Learning · 4 min ·
[2602.16749] U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition
Machine Learning

[2602.16749] U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition

The paper presents U-FedTomAtt, an ultra-lightweight federated learning framework designed for tomato disease recognition, optimizing per...

arXiv - Machine Learning · 4 min ·
[2602.16738] Self-Evolving Multi-Agent Network for Industrial IoT Predictive Maintenance
Llms

[2602.16738] Self-Evolving Multi-Agent Network for Industrial IoT Predictive Maintenance

The paper presents SEMAS, a self-evolving multi-agent network designed for predictive maintenance in Industrial IoT, enhancing real-time ...

arXiv - Machine Learning · 4 min ·
[2602.16075] DARTH-PUM: A Hybrid Processing-Using-Memory Architecture
Ai Infrastructure

[2602.16075] DARTH-PUM: A Hybrid Processing-Using-Memory Architecture

DARTH-PUM proposes a hybrid Processing-Using-Memory architecture that integrates analog and digital PUM to enhance computational efficien...

arXiv - Machine Learning · 4 min ·
[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs
Llms

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

The paper presents KVFetcher, a novel solution for efficient remote key-value (KV) cache reuse using GPU-native video codecs, significant...

arXiv - Machine Learning · 4 min ·
[2602.17642] A.R.I.S.: Automated Recycling Identification System for E-Waste Classification Using Deep Learning
Machine Learning

[2602.17642] A.R.I.S.: Automated Recycling Identification System for E-Waste Classification Using Deep Learning

The A.R.I.S. system utilizes deep learning to enhance e-waste recycling by accurately classifying materials in real-time, improving recov...

arXiv - Machine Learning · 3 min ·
[2602.17614] Guarding the Middle: Protecting Intermediate Representations in Federated Split Learning
Machine Learning

[2602.17614] Guarding the Middle: Protecting Intermediate Representations in Federated Split Learning

This paper presents KD-UFSL, a method to enhance privacy in federated split learning by minimizing data leakage through intermediate repr...

arXiv - Machine Learning · 4 min ·
[2602.17596] Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks
Machine Learning

[2602.17596] Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks

This paper explores the loss landscape of one-hidden-layer ReLU networks, demonstrating that overparameterization leads to smoother lands...

arXiv - Machine Learning · 3 min ·
[2602.17525] Variational inference via radial transport
Machine Learning

[2602.17525] Variational inference via radial transport

The paper introduces a novel approach to variational inference (VI) by optimizing radial profiles, enhancing the approximation of high-di...

arXiv - Machine Learning · 3 min ·
[2602.17375] MDP Planning as Policy Inference
Machine Learning

[2602.17375] MDP Planning as Policy Inference

This article presents a novel approach to episodic Markov decision process (MDP) planning by framing it as Bayesian inference over polici...

arXiv - Machine Learning · 3 min ·
[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
Machine Learning

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

The paper presents 2Mamba, a linear attention transformer variant that achieves competitive accuracy compared to softmax attention while ...

arXiv - Machine Learning · 3 min ·
[2602.17270] Unified Latents (UL): How to train your latents
Machine Learning

[2602.17270] Unified Latents (UL): How to train your latents

The paper introduces Unified Latents (UL), a framework for training latent representations using a diffusion prior, achieving competitive...

arXiv - Machine Learning · 3 min ·
Previous Page 118 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime