AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 18 minutes ago

Machine Learning

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

Google's TurboQuant claims to compress the KV cache by up to 6x with 'little apparent loss in accuracy' by reconstructing it on the fly. ...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

Emails show Bank of America's struggles with Nvidia AI: 'You have to help us as local car mechanics drive the race car!'

Internal emails show Bank of America having difficulties with Nvidia's AI Factory, showing the challenges of integrating AI in regulated ...

AI Events · 5 min · about 1 hour ago

All Content

Llms

[2602.11858] Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

The paper presents Region-to-Image Distillation, a novel approach to enhance fine-grained multimodal perception in MLLMs by internalizing...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.11575] ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles

The paper presents ReaDy-Go, a novel simulation pipeline that enhances visual navigation in dynamic environments by integrating 3D Gaussi...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

The paper explores religious perennialism through the lens of generative inference, using mathematical models to analyze distinct religio...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.10016] Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design

The paper 'Kunlun' presents a unified architecture for massive-scale recommendation systems, addressing scaling laws and resource allocat...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.09572] Predictive Query Language: A Domain-Specific Language for Predictive Modeling on Relational Databases

The paper introduces Predictive Query Language (PQL), a domain-specific language designed to streamline predictive modeling on relational...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.07506] VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots

VividFace presents a real-time system for humanoid robots to mimic human facial expressions, enhancing emotional interaction through adva...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07294] Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings

The paper introduces Fin-RATE, a benchmark for evaluating Large Language Models (LLMs) on SEC filings, addressing the limitations of exis...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

The paper introduces ShallowJail, a novel attack method targeting large language models (LLMs) by exploiting shallow alignment to manipul...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2601.02241] From Mice to Trains: Amortized Bayesian Inference on Graph Data

This article presents a novel approach to Amortized Bayesian Inference (ABI) tailored for graph data, addressing challenges in posterior ...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2512.16051] Graph Neural Networks for Interferometer Simulations

This article presents a novel application of Graph Neural Networks (GNNs) for simulating interferometer designs, specifically for the LIG...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

This paper presents a novel forward diffusion process for time-series forecasting that effectively decomposes signals into spectral compo...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2510.12764] AnyUp: Universal Feature Upsampling

The paper presents AnyUp, a novel method for universal feature upsampling applicable to various vision features at any resolution, enhanc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications

This article presents Forward-Forward Autoencoder architectures aimed at enhancing energy efficiency in wireless communications, demonstr...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

The paper presents Nightjar, a novel algorithm for dynamic adaptive speculative decoding in large language models, enhancing throughput a...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs

The paper discusses TrackCore-F, a methodology for deploying Transformer-based models for subatomic particle tracking on FPGAs, highlight...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking

This paper presents FlexGT, a method for optimizing distributed stochastic problems by balancing communication and computation, achieving...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks

This paper presents a framework for formal reasoning about the confidence and robustness of neural networks, proposing a unified techniqu...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2510.22876] Batch Speculative Decoding Done Right

The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2506.08749] Superposed parameterised quantum circuits

The paper introduces superposed parameterised quantum circuits, enhancing quantum machine learning by embedding multiple parameter sets i...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness

The paper presents Lorica, a novel framework aimed at enhancing personalized adversarial robustness in machine learning models, particula...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 141 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[D] Will Google’s TurboQuant algorithm hurt AI demand for memory chips? [D]

Emails show Bank of America's struggles with Nvidia AI: 'You have to help us as local car mechanics drive the race car!'

All Content

[2602.11858] Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

[2602.11575] ReaDy-Go: Real-to-Sim Dynamic 3D Gaussian Splatting Simulation for Environment-Specific Visual Navigation with Moving Obstacles

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

[2602.10016] Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design

[2602.09572] Predictive Query Language: A Domain-Specific Language for Predictive Modeling on Relational Databases

[2602.07506] VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots

[2602.07294] Fin-RATE: A Real-world Financial Analytics and Tracking Evaluation Benchmark for LLMs on SEC Filings

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

[2601.02241] From Mice to Trains: Amortized Bayesian Inference on Graph Data

[2512.16051] Graph Neural Networks for Interferometer Simulations

[2601.21812] A Decomposable Forward Process in Diffusion Models for Time-Series Forecasting

[2510.12764] AnyUp: Universal Feature Upsampling

[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications

[2512.22420] Nightjar: Dynamic Adaptive Speculative Decoding for Large Language Models Serving

[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs

[2509.18129] Pareto-optimal Trade-offs Between Communication and Computation with Flexible Gradient Tracking

[2511.07293] Formal Reasoning About Confidence and Automated Verification of Neural Networks

[2510.22876] Batch Speculative Decoding Done Right

[2506.08749] Superposed parameterised quantum circuits

[2506.05402] Lorica: A Synergistic Fine-Tuning Framework for Advancing Personalized Adversarial Robustness

Related Topics

Stay updated with AI News