AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...

Reddit - Machine Learning · 1 min ·

All Content

[2602.15136] Universal priors: solving empirical Bayes via Bayesian inference and pretraining
Machine Learning

[2602.15136] Universal priors: solving empirical Bayes via Bayesian inference and pretraining

The paper explores how a pretrained transformer can effectively solve empirical Bayes problems by leveraging universal priors, demonstrat...

arXiv - Machine Learning · 3 min ·
[2602.15248] Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogorov Arnold Networks), and Ensemble Models
Machine Learning

[2602.15248] Predicting Invoice Dilution in Supply Chain Finance with Leakage Free Two Stage XGBoost, KAN (Kolmogorov Arnold Networks), and Ensemble Models

This paper presents a machine learning framework to predict invoice dilution in supply chain finance, utilizing advanced models like XGBo...

arXiv - AI · 3 min ·
[2602.15084] TokaMind: A Multi-Modal Transformer Foundation Model for Tokamak Plasma Dynamics
Llms

[2602.15084] TokaMind: A Multi-Modal Transformer Foundation Model for Tokamak Plasma Dynamics

TokaMind is a new open-source multi-modal transformer model designed for tokamak plasma dynamics, demonstrating superior performance on f...

arXiv - Machine Learning · 4 min ·
[2602.15212] Secure and Energy-Efficient Wireless Agentic AI Networks
Ai Agents

[2602.15212] Secure and Energy-Efficient Wireless Agentic AI Networks

This paper presents a secure and energy-efficient wireless AI network that utilizes a supervisor AI agent to optimize reasoning tasks whi...

arXiv - AI · 4 min ·
[2602.15156] Panini: Continual Learning in Token Space via Structured Memory
Llms

[2602.15156] Panini: Continual Learning in Token Space via Structured Memory

The paper presents Panini, a continual learning framework for language models that enhances efficiency and accuracy by integrating experi...

arXiv - AI · 4 min ·
[2602.15820] Stabilizing Test-Time Adaptation of High-Dimensional Simulation Surrogates via D-Optimal Statistics
Machine Learning

[2602.15820] Stabilizing Test-Time Adaptation of High-Dimensional Simulation Surrogates via D-Optimal Statistics

This article presents a framework for Test-Time Adaptation (TTA) of high-dimensional simulation surrogates using D-optimal statistics, ad...

arXiv - Machine Learning · 3 min ·
[2602.15763] GLM-5: from Vibe Coding to Agentic Engineering
Llms

[2602.15763] GLM-5: from Vibe Coding to Agentic Engineering

GLM-5 introduces a next-generation foundation model that enhances coding capabilities through agentic engineering, reducing costs while i...

arXiv - Machine Learning · 5 min ·
[2602.15711] Random Wavelet Features for Graph Kernel Machines
Nlp

[2602.15711] Random Wavelet Features for Graph Kernel Machines

This paper introduces randomized spectral node embeddings for graph kernel machines, enhancing node similarity estimation while improving...

arXiv - AI · 3 min ·
[2602.15704] Controlled oscillation modeling using port-Hamiltonian neural networks
Machine Learning

[2602.15704] Controlled oscillation modeling using port-Hamiltonian neural networks

This paper presents a novel approach to modeling controlled oscillations using port-Hamiltonian neural networks, emphasizing a second-ord...

arXiv - Machine Learning · 4 min ·
[2602.15648] Guided Diffusion by Optimized Loss Functions on Relaxed Parameters for Inverse Material Design
Generative Ai

[2602.15648] Guided Diffusion by Optimized Loss Functions on Relaxed Parameters for Inverse Material Design

This paper presents a novel method for inverse material design using guided diffusion and optimized loss functions, addressing challenges...

arXiv - Machine Learning · 4 min ·
[2602.15617] DNN-Enabled Multi-User Beamforming for Throughput Maximization under Adjustable Fairness
Machine Learning

[2602.15617] DNN-Enabled Multi-User Beamforming for Throughput Maximization under Adjustable Fairness

This paper presents a DNN-based approach to optimize multi-user beamforming in wireless communications, balancing throughput and fairness...

arXiv - Machine Learning · 3 min ·
[2602.15571] Accelerated Predictive Coding Networks via Direct Kolen-Pollack Feedback Alignment
Machine Learning

[2602.15571] Accelerated Predictive Coding Networks via Direct Kolen-Pollack Feedback Alignment

The paper introduces Direct Kolen-Pollack Predictive Coding (DKP-PC), an innovative approach that enhances the efficiency of predictive c...

arXiv - Machine Learning · 3 min ·
[2602.15563] 1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization
Llms

[2602.15563] 1-Bit Wonder: Improving QAT Performance in the Low-Bit Regime through K-Means Quantization

The paper presents a study on quantization-aware training (QAT) for low-bit quantization, demonstrating that k-means based weight quantiz...

arXiv - Machine Learning · 3 min ·
[2602.15546] CEPAE: Conditional Entropy-Penalized Autoencoders for Time Series Counterfactuals
Machine Learning

[2602.15546] CEPAE: Conditional Entropy-Penalized Autoencoders for Time Series Counterfactuals

The paper introduces CEPAE, a novel approach utilizing Conditional Entropy-Penalized Autoencoders for effective counterfactual inference ...

arXiv - Machine Learning · 3 min ·
[2602.15510] On the Geometric Coherence of Global Aggregation in Federated GNN
Machine Learning

[2602.15510] On the Geometric Coherence of Global Aggregation in Federated GNN

This paper discusses the geometric coherence issues in global aggregation for Federated Graph Neural Networks (GNNs) and proposes a new f...

arXiv - Machine Learning · 4 min ·
[2602.15515] The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
Machine Learning

[2602.15515] The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes

The paper explores how AI models can learn to obfuscate deception when trained against white-box deception detectors, introducing a taxon...

arXiv - AI · 4 min ·
[2602.15478] Evaluating Federated Learning for Cross-Country Mood Inference from Smartphone Sensing Data
Machine Learning

[2602.15478] Evaluating Federated Learning for Cross-Country Mood Inference from Smartphone Sensing Data

This article evaluates a federated learning framework for mood inference using smartphone sensing data across different countries, highli...

arXiv - Machine Learning · 3 min ·
[2602.15405] Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits
Machine Learning

[2602.15405] Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

This paper presents a novel framework for joint signal enhancement and classification using coupled diffusion models, improving accuracy ...

arXiv - Machine Learning · 4 min ·
[2602.15380] Fractional-Order Federated Learning
Machine Learning

[2602.15380] Fractional-Order Federated Learning

The paper introduces Fractional-Order Federated Averaging (FOFedAvg), a novel federated learning approach that enhances model training ef...

arXiv - Machine Learning · 3 min ·
[2602.15337] FedPSA: Modeling Behavioral Staleness in Asynchronous Federated Learning
Machine Learning

[2602.15337] FedPSA: Modeling Behavioral Staleness in Asynchronous Federated Learning

The paper presents FedPSA, a novel framework for Asynchronous Federated Learning that improves performance by dynamically measuring model...

arXiv - Machine Learning · 3 min ·
Previous Page 138 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime