AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Llms

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

Hi Everybody! I just wanted to share an update on a project I’ve been working on called BULaMU, a family of language models trained (20M,...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

The traditional "app" might be a transitional form. What actually replaces it when AI becomes the primary interface?

Something I keep coming back to after 30 years in engineering: if AI becomes a primary way we interact with our data, the "app" as an org...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.04224] Nearest-Neighbor Density Estimation for Dependency Suppression
Ai Infrastructure

[2603.04224] Nearest-Neighbor Density Estimation for Dependency Suppression

Abstract page for arXiv paper 2603.04224: Nearest-Neighbor Density Estimation for Dependency Suppression

arXiv - Machine Learning · 3 min ·
[2603.04142] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series
Llms

[2603.04142] A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series

Abstract page for arXiv paper 2603.04142: A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series

arXiv - Machine Learning · 4 min ·
[2603.03681] EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs
Llms

[2603.03681] EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

Abstract page for arXiv paper 2603.03681: EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs

arXiv - AI · 3 min ·
[2603.04134] InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs
Machine Learning

[2603.04134] InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs

Abstract page for arXiv paper 2603.04134: InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs

arXiv - Machine Learning · 4 min ·
[2603.04045] Inference-Time Toxicity Mitigation in Protein Language Models
Llms

[2603.04045] Inference-Time Toxicity Mitigation in Protein Language Models

Abstract page for arXiv paper 2603.04045: Inference-Time Toxicity Mitigation in Protein Language Models

arXiv - AI · 3 min ·
[2603.04035] mlx-vis: GPU-Accelerated Dimensionality Reduction and Visualization on Apple Silicon
Machine Learning

[2603.04035] mlx-vis: GPU-Accelerated Dimensionality Reduction and Visualization on Apple Silicon

Abstract page for arXiv paper 2603.04035: mlx-vis: GPU-Accelerated Dimensionality Reduction and Visualization on Apple Silicon

arXiv - Machine Learning · 3 min ·
[2603.04028] A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Quality
Llms

[2603.04028] A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Quality

Abstract page for arXiv paper 2603.04028: A Multi-Dimensional Quality Scoring Framework for Decentralized LLM Inference with Proof of Qua...

arXiv - AI · 4 min ·
[2603.03536] SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems
Llms

[2603.03536] SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

Abstract page for arXiv paper 2603.03536: SafeCRS: Personalized Safety Alignment for LLM-Based Conversational Recommender Systems

arXiv - AI · 3 min ·
[2603.03973] Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction
Machine Learning

[2603.03973] Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction

Abstract page for arXiv paper 2603.03973: Dual-Solver: A Generalized ODE Solver for Diffusion Models with Dual Prediction

arXiv - Machine Learning · 3 min ·
[2603.03922] Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs
Machine Learning

[2603.03922] Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs

Abstract page for arXiv paper 2603.03922: Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs

arXiv - Machine Learning · 4 min ·
[2603.03830] Large-Margin Hyperdimensional Computing: A Learning-Theoretical Perspective
Machine Learning

[2603.03830] Large-Margin Hyperdimensional Computing: A Learning-Theoretical Perspective

Abstract page for arXiv paper 2603.03830: Large-Margin Hyperdimensional Computing: A Learning-Theoretical Perspective

arXiv - Machine Learning · 3 min ·
[2603.03417] Parallel Test-Time Scaling with Multi-Sequence Verifiers
Llms

[2603.03417] Parallel Test-Time Scaling with Multi-Sequence Verifiers

Abstract page for arXiv paper 2603.03417: Parallel Test-Time Scaling with Multi-Sequence Verifiers

arXiv - AI · 4 min ·
[2603.03777] LEA: Label Enumeration Attack in Vertical Federated Learning
Machine Learning

[2603.03777] LEA: Label Enumeration Attack in Vertical Federated Learning

Abstract page for arXiv paper 2603.03777: LEA: Label Enumeration Attack in Vertical Federated Learning

arXiv - Machine Learning · 4 min ·
[2603.03756] MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier
Llms

[2603.03756] MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

Abstract page for arXiv paper 2603.03756: MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Ba...

arXiv - Machine Learning · 3 min ·
[2603.03412] PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing
Machine Learning

[2603.03412] PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing

Abstract page for arXiv paper 2603.03412: PRIVATEEDIT: A Privacy-Preserving Pipeline for Face-Centric Generative Image Editing

arXiv - AI · 4 min ·
[2603.03380] LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics
Machine Learning

[2603.03380] LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics

Abstract page for arXiv paper 2603.03380: LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics

arXiv - AI · 3 min ·
[2603.03621] Extending Neural Operators: Robust Handling of Functions Beyond the Training Set
Machine Learning

[2603.03621] Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

Abstract page for arXiv paper 2603.03621: Extending Neural Operators: Robust Handling of Functions Beyond the Training Set

arXiv - Machine Learning · 3 min ·
[2603.03597] NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training
Llms

[2603.03597] NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Abstract page for arXiv paper 2603.03597: NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

arXiv - Machine Learning · 3 min ·
[2603.03538] Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs
Llms

[2603.03538] Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

Abstract page for arXiv paper 2603.03538: Online Learnability of Chain-of-Thought Verifiers: Soundness and Completeness Trade-offs

arXiv - Machine Learning · 4 min ·
[2603.03535] Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts
Llms

[2603.03535] Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

Abstract page for arXiv paper 2603.03535: Trade-offs in Ensembling, Merging and Routing Among Parameter-Efficient Experts

arXiv - Machine Learning · 3 min ·
Previous Page 33 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime