AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 15 minutes ago

Ai Infrastructure

NVIDIA GTC 2026: Live Updates on What’s Next in AI

Rolling coverage from San Jose, including NVIDIA CEO Jensen Huang’s keynote, news highlights, live demos and on‑the‑ground color through ...

AI Events · 84 min · about 2 hours ago

Llms

Seeking Critique on Research Approach to Open Set Recognition (Novelty Detection) [R]

Hey guys, I'm an independent researcher working on a project that tries to address a very specific failure mode in LLMs and embedding bas...

Reddit - Machine Learning · 1 min · about 7 hours ago

All Content

Machine Learning

[2602.13977] WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

The paper presents WoVR, a novel reinforcement learning framework that enhances the reliability of world models for Vision-Language-Actio...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13515] SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

The paper presents SpargeAttention2, a novel trainable sparse attention method that enhances the efficiency of diffusion models by combin...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13476] AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge

AsyncVLA introduces an asynchronous control framework for robotic navigation, enhancing real-time performance by decoupling semantic reas...

arXiv - Machine Learning · 3 min · about 2 months ago

Ai Infrastructure

[2602.13817] What happens when reviewers receive AI feedback in their reviews?

This article examines the impact of AI feedback on peer reviews, revealing both benefits and challenges faced by reviewers when using an ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13334] Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators

This article presents a collaborative inference framework for deploying Vision Transformers on edge devices, addressing computational cha...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13718] HybridFlow: A Two-Step Generative Policy for Robotic Manipulation

The paper presents HybridFlow, a two-step generative policy designed to improve robotic manipulation by enhancing real-time interaction c...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

The paper presents Pailitao-VL, a multi-modal retrieval system designed for real-time industrial search, addressing key challenges in ret...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13671] MAS-on-the-Fly: Dynamic Adaptation of LLM-based Multi-Agent Systems at Test Time

The paper presents MASFly, a novel framework for dynamic adaptation of LLM-based multi-agent systems at test time, enhancing task perform...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13606] Multi-Modal Sensing and Fusion in mmWave Beamforming for Connected Vehicles: A Transformer Based Framework

This article presents a novel multi-modal sensing and fusion framework for mmWave beamforming in connected vehicles, enhancing communicat...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13547] AISA: Awakening Intrinsic Safety Awareness in Large Language Models against Jailbreak Attacks

The paper presents AISA, a novel defense mechanism for large language models (LLMs) that enhances safety against jailbreak attacks by act...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13540] On Calibration of Large Language Models: From Response To Capability

This paper introduces the concept of capability calibration for large language models (LLMs), emphasizing the importance of accurate conf...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.14972] Use What You Know: Causal Foundation Models with Partial Graphs

This paper introduces a method for enhancing Causal Foundation Models (CFMs) by incorporating partial causal graph information, improving...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13496] Future of Edge AI in biodiversity monitoring

This article explores the role of Edge AI in biodiversity monitoring, analyzing 82 studies to assess system types, architectural trade-of...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14896] Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs

This paper explores the algorithmic simplification of neural networks through a method called Mosaic-of-Motifs, demonstrating how structu...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13446] End-to-End NOMA with Perfect and Quantized CSI Over Rayleigh Fading Channels

This paper presents an end-to-end autoencoder framework for downlink non-orthogonal multiple access (NOMA) over Rayleigh fading channels,...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13452] LLM-Powered Automatic Translation and Urgency in Crisis Scenarios

This article examines the effectiveness of large language models (LLMs) in crisis communication, particularly focusing on multilingual tr...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13376] An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

This article presents a novel reference-free evaluation framework for assessing the quality of flowchart image-to-code generation, utiliz...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

The paper presents 'Inner Loop Inference,' a method for enhancing pretrained Transformers by iteratively refining outputs during inferenc...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14728] D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation

D2-LoRA introduces a novel method for efficient fine-tuning in machine learning, achieving significant accuracy improvements while minimi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13357] AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers

The paper introduces AdaCorrection, a framework that enhances the efficiency of Diffusion Transformers by correcting cache misalignment, ...

arXiv - AI · 3 min · about 2 months ago

Previous Page 162 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

NVIDIA GTC 2026: Live Updates on What’s Next in AI

Seeking Critique on Research Approach to Open Set Recognition (Novelty Detection) [R]

All Content

[2602.13977] WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

[2602.13515] SpargeAttention2: Trainable Sparse Attention via Hybrid Top-k+Top-p Masking and Distillation Fine-Tuning

[2602.13476] AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge

[2602.13817] What happens when reviewers receive AI feedback in their reviews?

[2602.13334] Ask the Expert: Collaborative Inference for Vision Transformers with Near-Edge Accelerators

[2602.13718] HybridFlow: A Two-Step Generative Policy for Robotic Manipulation

[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

[2602.13671] MAS-on-the-Fly: Dynamic Adaptation of LLM-based Multi-Agent Systems at Test Time

[2602.13606] Multi-Modal Sensing and Fusion in mmWave Beamforming for Connected Vehicles: A Transformer Based Framework

[2602.13547] AISA: Awakening Intrinsic Safety Awareness in Large Language Models against Jailbreak Attacks

[2602.13540] On Calibration of Large Language Models: From Response To Capability

[2602.14972] Use What You Know: Causal Foundation Models with Partial Graphs

[2602.13496] Future of Edge AI in biodiversity monitoring

[2602.14896] Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs

[2602.13446] End-to-End NOMA with Perfect and Quantized CSI Over Rayleigh Fading Channels

[2602.13452] LLM-Powered Automatic Translation and Urgency in Crisis Scenarios

[2602.13376] An Online Reference-Free Evaluation Framework for Flowchart Image-to-Code Generation

[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

[2602.14728] D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation

[2602.13357] AdaCorrection: Adaptive Offset Cache Correction for Accurate Diffusion Transformers

Related Topics

Stay updated with AI News