AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min ·
NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots
Open Source Ai

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog · 4 min ·

All Content

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting
Machine Learning

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

This paper presents a novel forward diffusion process for time-series forecasting that effectively decomposes signals into spectral compo...

arXiv - Machine Learning · 3 min ·
[2510.12764] AnyUp: Universal Feature Upsampling
Machine Learning

[2510.12764] AnyUp: Universal Feature Upsampling

The paper presents AnyUp, a novel method for universal feature upsampling applicable to various vision features at any resolution, enhanc...

arXiv - Machine Learning · 3 min ·
[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications
Machine Learning

[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications

This article presents Forward-Forward Autoencoder architectures aimed at enhancing energy efficiency in wireless communications, demonstr...

arXiv - Machine Learning · 3 min ·
[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving
Llms

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

The paper presents Nightjar, a novel algorithm for dynamic adaptive speculative decoding in large language models, enhancing throughput a...

arXiv - AI · 3 min ·
[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs
Machine Learning

[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs

The paper discusses TrackCore-F, a methodology for deploying Transformer-based models for subatomic particle tracking on FPGAs, highlight...

arXiv - Machine Learning · 3 min ·
[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking
Nlp

[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking

This paper presents FlexGT, a method for optimizing distributed stochastic problems by balancing communication and computation, achieving...

arXiv - Machine Learning · 4 min ·
[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks
Machine Learning

[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks

This paper presents a framework for formal reasoning about the confidence and robustness of neural networks, proposing a unified techniqu...

arXiv - AI · 3 min ·
[2510.22876] Batch Speculative Decoding Done Right
Nlp

[2510.22876] Batch Speculative Decoding Done Right

The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...

arXiv - AI · 4 min ·
[2506.08749] Superposed parameterised quantum circuits
Machine Learning

[2506.08749] Superposed parameterised quantum circuits

The paper introduces superposed parameterised quantum circuits, enhancing quantum machine learning by embedding multiple parameter sets i...

arXiv - Machine Learning · 4 min ·
[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness
Machine Learning

[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness

The paper presents Lorica, a novel framework aimed at enhancing personalized adversarial robustness in machine learning models, particula...

arXiv - Machine Learning · 4 min ·
[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems
Machine Learning

[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems

This article examines the relevance of statistical methods in the age of deep learning, using ordinary differential equation (ODE) invers...

arXiv - Machine Learning · 4 min ·
[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark
Llms

[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark

This article presents EAPrivacy, a benchmark for evaluating the physical-world privacy awareness of large language models (LLMs), reveali...

arXiv - AI · 4 min ·
[2509.25275] VoiceBridge: General Speech Restoration with One-step Latent Bridge Models
Machine Learning

[2509.25275] VoiceBridge: General Speech Restoration with One-step Latent Bridge Models

VoiceBridge introduces a novel one-step latent bridge model for general speech restoration, enhancing audio quality from various distorti...

arXiv - AI · 4 min ·
[2509.23519] ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search
Llms

[2509.23519] ReliabilityRAG: Effective and Provably Robust Defense for RAG-based Web-Search

The paper introduces ReliabilityRAG, a framework designed to enhance the robustness of Retrieval-Augmented Generation (RAG) systems again...

arXiv - AI · 4 min ·
[2403.15605] Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization
Machine Learning

[2403.15605] Efficiently Assemble Normalization Layers and Regularization for Federated Domain Generalization

The paper presents a novel method, gPerXAN, for Federated Domain Generalization (FedDG) that enhances model performance by effectively as...

arXiv - Machine Learning · 4 min ·
[2507.19234] Virne: A Comprehensive Benchmark for RL-based Network Resource Allocation in NFV
Ai Infrastructure

[2507.19234] Virne: A Comprehensive Benchmark for RL-based Network Resource Allocation in NFV

The paper introduces Virne, a benchmarking framework designed for Reinforcement Learning-based resource allocation in Network Function Vi...

arXiv - AI · 4 min ·
[2306.14297] Inference for relative sparsity
Machine Learning

[2306.14297] Inference for relative sparsity

The paper discusses a novel approach to inference for relative sparsity in healthcare decision-making, addressing the need for uncertaint...

arXiv - Machine Learning · 4 min ·
[2507.14186] A Disentangled Representation Learning Framework for Low-altitude Network Coverage Prediction
Nlp

[2507.14186] A Disentangled Representation Learning Framework for Low-altitude Network Coverage Prediction

This paper presents a novel framework for predicting low-altitude network coverage using disentangled representation learning, addressing...

arXiv - Machine Learning · 4 min ·
[2602.12247] ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction
Llms

[2602.12247] ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction

ExtractBench introduces a benchmark and evaluation framework for extracting structured data from unstructured documents like PDFs, addres...

arXiv - AI · 4 min ·
[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models
Llms

[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models

This paper explores the non-identifiability of steering vectors in large language models (LLMs), revealing that these vectors cannot be u...

arXiv - AI · 3 min ·
Previous Page 165 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime