AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min · 37 minutes ago

Llms

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

submitted by /u/PatienceHistorical70 [link] [comments]

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

Lemonade 10.1 released for latest improvements for local LLMs on AMD GPUs & NPUs

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Ai Infrastructure

All the important news from the ongoing India AI Impact Summit | TechCrunch

India's AI Impact Summit gathers major tech leaders and heads of state, highlighting significant investments and developments in the AI s...

TechCrunch - AI · 7 min · about 1 month ago

Llms

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

AstroMLab 4 introduces a 70B-parameter AI model specialized for astronomy, achieving benchmark-topping performance in Q&A tasks, surpassi...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models

This paper presents a novel approach for fair community detection in heterogeneous graphical models, ensuring demographic representation ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

The paper introduces HARPOON, a novel method for generating tabular data using generalized manifold guidance, addressing limitations in e...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2601.20198] DeRaDiff: Denoising Time Realignment of Diffusion Models

The paper presents DeRaDiff, a novel method for denoising time realignment in diffusion models, enabling efficient adjustment of regulari...

arXiv - Machine Learning · 4 min · about 1 month ago

Open Source Ai

[2601.01944] The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities

This article examines the impact of AI libraries on open source software (OSS) projects, analyzing their adoption in Python and Java to u...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.14873] How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal

This article analyzes the Fourier Analysis Network (FAN) and introduces a new Dual-Activation Layer (DAL) that enhances neural network pe...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows

This paper introduces a novel technique for amortized posterior estimation using Normalizing Flows, enhancing inference in high-dimension...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling

The paper introduces Consistency Diffusion Language Models (CDLM), a method that accelerates inference in diffusion language models by re...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2511.10855] ExPairT-LLM: Exact Learning for LLM Code Selection by Pairwise Queries

The paper presents ExPairT-LLM, an innovative algorithm for code selection in LLMs that improves accuracy by using pairwise queries, outp...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning

This study presents TraDy, a novel transfer learning scheme for memory-constrained fine-tuning of deep neural networks, achieving state-o...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2508.04581] Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

The paper presents a novel framework called MASA for weight sharing in transformers, reducing parameters by 66.7% while maintaining perfo...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2507.18031] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks

ViGText introduces a novel approach to deepfake detection by integrating Vision-Language Model explanations with Graph Neural Networks, e...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.02228] xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity

The paper explores xLSTM scaling laws, demonstrating its competitive performance against Transformers with linear time complexity, offeri...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.11551] Landmark Detection for Medical Images using a General-purpose Segmentation Model

The paper presents a novel approach to anatomical landmark detection in medical images by combining YOLO and SAM models, enhancing segmen...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2509.22458] Physics-informed GNN for medium-high voltage AC power flow with edge-aware attention and line search correction operator

This article presents a novel Physics-informed Graph Neural Network (PIGNN) designed to enhance AC power flow analysis, achieving signifi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing

The paper discusses the concept of anthropomimetic uncertainty in language models, emphasizing the need for these models to express confi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2506.15316] J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor

The paper presents J3DAI, a compact DNN-based hardware accelerator designed for 3D-stacked CMOS image sensors, emphasizing its efficiency...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2501.00755] An AI-powered Bayesian generative modeling approach for causal inference in observational studies

The paper presents CausalBGM, an AI-driven Bayesian generative modeling approach designed for causal inference in observational studies, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.03725] Sign-SGD via Parameter-Free Optimization

This paper introduces a parameter-free optimization method for Sign-SGD, enhancing efficiency in training large language models by elimin...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 107 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Has anyone here switched to TeraBox recently? Is it actually worth it?

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

Lemonade 10.1 released for latest improvements for local LLMs on AMD GPUs & NPUs

All Content

All the important news from the ongoing India AI Impact Summit | TechCrunch

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models

[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

[2601.20198] DeRaDiff: Denoising Time Realignment of Diffusion Models

[2601.01944] The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities

[2512.14873] How Does Fourier Analysis Network Work? A Mechanism Analysis and a New Dual-Activation Layer Proposal

[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows

[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling

[2511.10855] ExPairT-LLM: Exact Learning for LLM Code Selection by Pairwise Queries

[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning

[2508.04581] Share Your Attention: Transformer Weight Sharing via Matrix-based Dictionary Learning

[2507.18031] ViGText: Deepfake Image Detection with Vision-Language Model Explanations and Graph Neural Networks

[2510.02228] xLSTM Scaling Laws: Competitive Performance with Linear Time-Complexity

[2507.11551] Landmark Detection for Medical Images using a General-purpose Segmentation Model

[2509.22458] Physics-informed GNN for medium-high voltage AC power flow with edge-aware attention and line search correction operator

[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing

[2506.15316] J3DAI: A tiny DNN-Based Edge AI Accelerator for 3D-Stacked CMOS Image Sensor

[2501.00755] An AI-powered Bayesian generative modeling approach for causal inference in observational studies

[2506.03725] Sign-SGD via Parameter-Free Optimization

Related Topics

Stay updated with AI News