AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

https://arxiv.org/abs/2604.05091 Abstract: "We present MegaTrain, a memory-centric system that efficiently trains 100B+ parameter large l...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

"There's a green field." Five words, no system prompt, pure autocomplete. It figured out what it was.

No chat interface. No identity. No instructions. Just the API in raw autocomplete mode. The model receives text, predicts the next tokens...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Why would Anthropic keep a cyber model like Project Glasswing invite-only?

Anthropic’s Project Glasswing caught my attention less as a cybersecurity headline than as a signal about how frontier AI may be commerci...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

All Content

Machine Learning

[2510.13582] ArtNet: Hierarchical Clustering-Based Artificial Netlist Generator for ML and DTCO Application

ArtNet introduces a novel artificial netlist generator that enhances machine learning model generalization and design-technology co-optim...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2507.05411] AXLearn: Modular, Hardware-Agnostic Large Model Training

AXLearn presents a modular and hardware-agnostic approach to training large deep learning models, enhancing scalability and performance w...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

The paper presents a novel approach to continual learning in machine learning models, introducing a parameter-efficient fine-tuning modul...

arXiv - Machine Learning · 4 min · about 2 months ago

Data Science

[2602.17314] Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE

This article surveys open datasets in learning analytics, identifying trends, challenges, and best practices to enhance research reproduc...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.17223] Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs

The paper presents new privacy-preserving protocols for verifiable inference of large language models (LLMs), addressing the challenges o...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16979] Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling

The paper presents PRIMO, a supervised latent-variable model that addresses the challenges of incomplete multimodal data by quantifying t...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

This paper presents Greedy Multi-Path Block Verification (GBV), a method that enhances the efficiency of speculative decoding in machine ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16908] Multi-objective optimization and quantum hybridization of equivariant deep learning interatomic potentials on organic and inorganic compounds

This article presents a study on the multi-objective optimization of deep learning interatomic potentials, focusing on the trade-off betw...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

The paper introduces NeST, a novel framework for enhancing safety in large language models (LLMs) by selectively tuning a small subset of...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16749] U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition

The paper presents U-FedTomAtt, an ultra-lightweight federated learning framework designed for tomato disease recognition, optimizing per...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16738] Self-Evolving Multi-Agent Network for Industrial IoT Predictive Maintenance

The paper presents SEMAS, a self-evolving multi-agent network designed for predictive maintenance in Industrial IoT, enhancing real-time ...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Infrastructure

[2602.16075] DARTH-PUM: A Hybrid Processing-Using-Memory Architecture

DARTH-PUM proposes a hybrid Processing-Using-Memory architecture that integrates analog and digital PUM to enhance computational efficien...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

The paper presents KVFetcher, a novel solution for efficient remote key-value (KV) cache reuse using GPU-native video codecs, significant...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.17642] A.R.I.S.: Automated Recycling Identification System for E-Waste Classification Using Deep Learning

The A.R.I.S. system utilizes deep learning to enhance e-waste recycling by accurately classifying materials in real-time, improving recov...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.17614] Guarding the Middle: Protecting Intermediate Representations in Federated Split Learning

This paper presents KD-UFSL, a method to enhance privacy in federated split learning by minimizing data leakage through intermediate repr...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.17596] Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks

This paper explores the loss landscape of one-hidden-layer ReLU networks, demonstrating that overparameterization leads to smoother lands...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.17525] Variational inference via radial transport

The paper introduces a novel approach to variational inference (VI) by optimizing radial profiles, enhancing the approximation of high-di...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.17375] MDP Planning as Policy Inference

This article presents a novel approach to episodic Markov decision process (MDP) planning by framing it as Bayesian inference over polici...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

The paper presents 2Mamba, a linear attention transformer variant that achieves competitive accuracy compared to softmax attention while ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.17270] Unified Latents (UL): How to train your latents

The paper introduces Unified Latents (UL), a framework for training latent representations using a diffusion prior, achieving competitive...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 118 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

"There's a green field." Five words, no system prompt, pure autocomplete. It figured out what it was.

Why would Anthropic keep a cyber model like Project Glasswing invite-only?

All Content

[2510.13582] ArtNet: Hierarchical Clustering-Based Artificial Netlist Generator for ML and DTCO Application

[2507.05411] AXLearn: Modular, Hardware-Agnostic Large Model Training

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

[2602.17314] Open Datasets in Learning Analytics: Trends, Challenges, and Best PRACTICE

[2602.17223] Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs

[2602.16979] Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

[2602.16908] Multi-objective optimization and quantum hybridization of equivariant deep learning interatomic potentials on organic and inorganic compounds

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

[2602.16749] U-FedTomAtt: Ultra-lightweight Federated Learning with Attention for Tomato Disease Recognition

[2602.16738] Self-Evolving Multi-Agent Network for Industrial IoT Predictive Maintenance

[2602.16075] DARTH-PUM: A Hybrid Processing-Using-Memory Architecture

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

[2602.17642] A.R.I.S.: Automated Recycling Identification System for E-Waste Classification Using Deep Learning

[2602.17614] Guarding the Middle: Protecting Intermediate Representations in Federated Split Learning

[2602.17596] Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks

[2602.17525] Variational inference via radial transport

[2602.17375] MDP Planning as Policy Inference

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

[2602.17270] Unified Latents (UL): How to train your latents

Related Topics

Stay updated with AI News