AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·
Llms

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

submitted by /u/PatienceHistorical70 [link] [comments]

Reddit - Machine Learning · 1 min ·
Llms

Lemonade 10.1 released for latest improvements for local LLMs on AMD GPUs & NPUs

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

All the important news from the ongoing India AI Impact Summit | TechCrunch
Ai Infrastructure

All the important news from the ongoing India AI Impact Summit | TechCrunch

India's AI Impact Summit gathers major tech leaders and heads of state, highlighting significant investments and developments in the AI s...

TechCrunch - AI · 7 min ·
[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model
Llms

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

AstroMLab 4 introduces a 70B-parameter AI model specialized for astronomy, achieving benchmark-topping performance in Q&A tasks, surpassi...

arXiv - Machine Learning · 4 min ·
[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models
Machine Learning

[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models

This paper presents a novel approach for fair community detection in heterogeneous graphical models, ensuring demographic representation ...

arXiv - Machine Learning · 3 min ·
[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion
Machine Learning

[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

The paper introduces HARPOON, a novel method for generating tabular data using generalized manifold guidance, addressing limitations in e...

arXiv - Machine Learning · 3 min ·
[2601.20198] DeRaDiff: Denoising Time Realignment of Diffusion Models
Machine Learning

[2601.20198] DeRaDiff: Denoising Time Realignment of Diffusion Models

The paper presents DeRaDiff, a novel method for denoising time realignment in diffusion models, enabling efficient adjustment of regulari...

arXiv - Machine Learning · 4 min ·
[2601.01944] The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities
Open Source Ai

[2601.01944] The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities

This article examines the impact of AI libraries on open source software (OSS) projects, analyzing their adoption in Python and Java to u...

arXiv - AI · 4 min ·
[2512.14873] How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal
Machine Learning

[2512.14873] How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal

This article analyzes the Fourier Analysis Network (FAN) and introduces a new Dual-Activation Layer (DAL) that enhances neural network pe...

arXiv - Machine Learning · 4 min ·
[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows
Machine Learning

[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows

This paper introduces a novel technique for amortized posterior estimation using Normalizing Flows, enhancing inference in high-dimension...

arXiv - Machine Learning · 3 min ·
[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling
Llms

[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling

The paper introduces Consistency Diffusion Language Models (CDLM), a method that accelerates inference in diffusion language models by re...

arXiv - Machine Learning · 3 min ·
[2511.10855] ExPairT-LLM: Exact Learning for LLM Code Selection by Pairwise Queries
Llms

[2511.10855] ExPairT-LLM: Exact Learning for LLM Code Selection by Pairwise Queries

The paper presents ExPairT-LLM, an innovative algorithm for code selection in LLMs that improves accuracy by using pairwise queries, outp...

arXiv - Machine Learning · 4 min ·
[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning
Machine Learning

[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning

This study presents TraDy, a novel transfer learning scheme for memory-constrained fine-tuning of deep neural networks, achieving state-o...

arXiv - Machine Learning · 3 min ·
[2508.04581] Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning
Llms

[2508.04581] Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

The paper presents a novel framework called MASA for weight sharing in transformers, reducing parameters by 66.7% while maintaining perfo...

arXiv - Machine Learning · 4 min ·
[2507.18031] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks
Llms

[2507.18031] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks

ViGText introduces a novel approach to deepfake detection by integrating Vision-Language Model explanations with Graph Neural Networks, e...

arXiv - Machine Learning · 4 min ·
[2510.02228] xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity
Llms

[2510.02228] xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity

The paper explores xLSTM scaling laws, demonstrating its competitive performance against Transformers with linear time complexity, offeri...

arXiv - Machine Learning · 4 min ·
[2507.11551] Landmark Detection for Medical Images using a General-purpose Segmentation Model
Machine Learning

[2507.11551] Landmark Detection for Medical Images using a General-purpose Segmentation Model

The paper presents a novel approach to anatomical landmark detection in medical images by combining YOLO and SAM models, enhancing segmen...

arXiv - AI · 4 min ·
[2509.22458] Physics-informed GNN for medium-high voltage AC power flow with edge-aware attention and line search correction operator
Machine Learning

[2509.22458] Physics-informed GNN for medium-high voltage AC power flow with edge-aware attention and line search correction operator

This article presents a novel Physics-informed Graph Neural Network (PIGNN) designed to enhance AC power flow analysis, achieving signifi...

arXiv - Machine Learning · 4 min ·
[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing
Llms

[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing

The paper discusses the concept of anthropomimetic uncertainty in language models, emphasizing the need for these models to express confi...

arXiv - AI · 4 min ·
[2506.15316] J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor
Machine Learning

[2506.15316] J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor

The paper presents J3DAI, a compact DNN-based hardware accelerator designed for 3D-stacked CMOS image sensors, emphasizing its efficiency...

arXiv - AI · 4 min ·
[2501.00755] An AI-powered Bayesian generative modeling approach for causal inference in observational studies
Machine Learning

[2501.00755] An AI-powered Bayesian generative modeling approach for causal inference in observational studies

The paper presents CausalBGM, an AI-driven Bayesian generative modeling approach designed for causal inference in observational studies, ...

arXiv - Machine Learning · 4 min ·
[2506.03725] Sign-SGD via Parameter-Free Optimization
Llms

[2506.03725] Sign-SGD via Parameter-Free Optimization

This paper introduces a parameter-free optimization method for Sign-SGD, enhancing efficiency in training large language models by elimin...

arXiv - Machine Learning · 4 min ·
Previous Page 107 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime