AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

Google's TurboQuant claims to compress the KV cache by up to 6x with 'little apparent loss in accuracy' by reconstructing it on the fly. ...

Reddit - Machine Learning · 1 min ·
Emails show Bank of America's struggles with Nvidia AI: 'You have to help us as local car mechanics drive the race car!'
Ai Infrastructure

Emails show Bank of America's struggles with Nvidia AI: 'You have to help us as local car mechanics drive the race car!'

Internal emails show Bank of America having difficulties with Nvidia's AI Factory, showing the challenges of integrating AI in regulated ...

AI Events · 5 min ·

All Content

[2602.11858] Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
Llms

[2602.11858] Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

The paper presents Region-to-Image Distillation, a novel approach to enhance fine-grained multimodal perception in MLLMs by internalizing...

arXiv - AI · 4 min ·
[2602.11575] ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles
Machine Learning

[2602.11575] ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles

The paper presents ReaDy-Go, a novel simulation pipeline that enhances visual navigation in dynamic environments by integrating 3D Gaussi...

arXiv - AI · 4 min ·
[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference
Machine Learning

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

The paper explores religious perennialism through the lens of generative inference, using mathematical models to analyze distinct religio...

arXiv - AI · 3 min ·
[2602.10016] Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design
Llms

[2602.10016] Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design

The paper 'Kunlun' presents a unified architecture for massive-scale recommendation systems, addressing scaling laws and resource allocat...

arXiv - AI · 4 min ·
[2602.09572] Predictive Query Language: A Domain-Specific Language for Predictive Modeling on Relational Databases
Machine Learning

[2602.09572] Predictive Query Language: A Domain-Specific Language for Predictive Modeling on Relational Databases

The paper introduces Predictive Query Language (PQL), a domain-specific language designed to streamline predictive modeling on relational...

arXiv - AI · 4 min ·
[2602.07506] VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots
Machine Learning

[2602.07506] VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots

VividFace presents a real-time system for humanoid robots to mimic human facial expressions, enhancing emotional interaction through adva...

arXiv - AI · 4 min ·
[2602.07294] Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings
Llms

[2602.07294] Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings

The paper introduces Fin-RATE, a benchmark for evaluating Large Language Models (LLMs) on SEC filings, addressing the limitations of exis...

arXiv - AI · 4 min ·
[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models
Llms

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

The paper introduces ShallowJail, a novel attack method targeting large language models (LLMs) by exploiting shallow alignment to manipul...

arXiv - AI · 3 min ·
[2601.02241] From Mice to Trains: Amortized Bayesian Inference on Graph Data
Machine Learning

[2601.02241] From Mice to Trains: Amortized Bayesian Inference on Graph Data

This article presents a novel approach to Amortized Bayesian Inference (ABI) tailored for graph data, addressing challenges in posterior ...

arXiv - Machine Learning · 4 min ·
[2512.16051] Graph Neural Networks for Interferometer Simulations
Machine Learning

[2512.16051] Graph Neural Networks for Interferometer Simulations

This article presents a novel application of Graph Neural Networks (GNNs) for simulating interferometer designs, specifically for the LIG...

arXiv - Machine Learning · 3 min ·
[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting
Machine Learning

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

This paper presents a novel forward diffusion process for time-series forecasting that effectively decomposes signals into spectral compo...

arXiv - Machine Learning · 3 min ·
[2510.12764] AnyUp: Universal Feature Upsampling
Machine Learning

[2510.12764] AnyUp: Universal Feature Upsampling

The paper presents AnyUp, a novel method for universal feature upsampling applicable to various vision features at any resolution, enhanc...

arXiv - Machine Learning · 3 min ·
[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications
Machine Learning

[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications

This article presents Forward-Forward Autoencoder architectures aimed at enhancing energy efficiency in wireless communications, demonstr...

arXiv - Machine Learning · 3 min ·
[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving
Llms

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

The paper presents Nightjar, a novel algorithm for dynamic adaptive speculative decoding in large language models, enhancing throughput a...

arXiv - AI · 3 min ·
[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs
Machine Learning

[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs

The paper discusses TrackCore-F, a methodology for deploying Transformer-based models for subatomic particle tracking on FPGAs, highlight...

arXiv - Machine Learning · 3 min ·
[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking
Nlp

[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking

This paper presents FlexGT, a method for optimizing distributed stochastic problems by balancing communication and computation, achieving...

arXiv - Machine Learning · 4 min ·
[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks
Machine Learning

[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks

This paper presents a framework for formal reasoning about the confidence and robustness of neural networks, proposing a unified techniqu...

arXiv - AI · 3 min ·
[2510.22876] Batch Speculative Decoding Done Right
Nlp

[2510.22876] Batch Speculative Decoding Done Right

The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...

arXiv - AI · 4 min ·
[2506.08749] Superposed parameterised quantum circuits
Machine Learning

[2506.08749] Superposed parameterised quantum circuits

The paper introduces superposed parameterised quantum circuits, enhancing quantum machine learning by embedding multiple parameter sets i...

arXiv - Machine Learning · 4 min ·
[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness
Machine Learning

[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness

The paper presents Lorica, a novel framework aimed at enhancing personalized adversarial robustness in machine learning models, particula...

arXiv - Machine Learning · 4 min ·
Previous Page 141 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime