AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Infrastructure

Firmus, the 'Southgate' AI datacenter builder backed by Nvidia, hits $5.5B valuation | TechCrunch

Nvidia-backed Asia AI data center provider Firmus has now raised $1.35 billion in six months.

TechCrunch - AI · 3 min · about 5 hours ago

Machine Learning

Anthropic debuts ‘Project Glasswing’ and new AI model for cybersecurity | The Verge

Anthropic launched Project Glasswing, a cybersecurity initiative in which it’s partnering with Nvidia, Apple, and others, and debuted a n...

The Verge - AI · 5 min · about 5 hours ago

Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

All Content

Llms

[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification

This paper explores how model misspecification leads to rational misalignments in AI behavior, presenting a new framework for understandi...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17952] Hardware-Friendly Input Expansion for Accelerating Function Approximation

This paper presents a hardware-friendly method for accelerating function approximation through input-space expansion, enhancing convergen...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17861] JAX-Privacy: A library for differentially private machine learning

JAX-Privacy is a new library aimed at simplifying the implementation of differentially private machine learning, offering both customizat...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.17849] Dual Length Codes for Lossless Compression of BFloat16

This paper presents Dual Length Codes, a novel hybrid approach for lossless compression of BFloat16 data, improving decoding speed while ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

The paper presents Iprox, a two-stage framework for gradient-based data selection in LLM fine-tuning, which constructs influence-preservi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

This paper introduces ruleXplain, a framework utilizing Large Language Models to extract causal rules from multivariate timeseries data, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17809] Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning

This paper introduces Stiefel-Bayes Adapters (SBA), a Bayesian framework for parameter-efficient fine-tuning of large language models, en...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17751] Investigating Target Class Influence on Neural Network Compressibility for Energy-Autonomous Avian Monitoring

This paper explores the impact of target class selection on the compressibility of neural networks for avian monitoring using energy-auto...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17700] MIDAS: Mosaic Input-Specific Differentiable Architecture Search

MIDAS introduces a novel approach to differentiable neural architecture search by utilizing input-specific parameters and self-attention ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

The paper presents ScaleBITS, a mixed-precision quantization framework designed to optimize bitwidth allocation in large language models,...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

This article introduces a novel approach to optimizing inference hyperparameters in Large Language Models (LLMs) using variability modeli...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models

The paper presents AsynDBT, an innovative algorithm for asynchronous distributed bilevel tuning aimed at improving in-context learning wi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17693] A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU

This article presents a case study on the effectiveness of Post-Training Quantization (PTQ) methods for reasoning-oriented large language...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.17691] Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

This paper introduces HELIX, a framework to improve quantized language models by decoupling output entropy from hallucination, enhancing ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

The paper presents CodeScaler, an execution-free reward model that enhances the scalability of code LLM training and test-time inference,...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17682] Duality Models: An Embarrassingly Simple One-step Generation Paradigm

The paper presents Duality Models (DuMo), a novel approach in generative modeling that enhances stability and efficiency by using a share...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17681] LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

The paper presents LATMiX, a method for enhancing quantization in large language models (LLMs) through learnable affine transformations, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17679] Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

This article presents a novel Bayesian optimization framework, POGPN-JPSS, that integrates process expertise to enhance the efficiency of...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

'Thermodynamic computer' can mimic AI neural networks — using orders of magnitude less energy to generate images

The article discusses a new type of thermodynamic computer that can replicate the functions of AI neural networks while consuming signifi...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Generative Ai

Interested in AI workflow for filmmaking

The discussion focuses on leveraging AI in filmmaking, emphasizing the need for professionals to adapt and prepare for AI's growing role ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Previous Page 110 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Firmus, the 'Southgate' AI datacenter builder backed by Nvidia, hits $5.5B valuation | TechCrunch

Anthropic debuts ‘Project Glasswing’ and new AI model for cybersecurity | The Verge

Has anyone here switched to TeraBox recently? Is it actually worth it?

All Content

[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification

[2602.17952] Hardware-Friendly Input Expansion for Accelerating Function Approximation

[2602.17861] JAX-Privacy: A library for differentially private machine learning

[2602.17849] Dual Length Codes for Lossless Compression of BFloat16

[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

[2602.17809] Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning

[2602.17751] Investigating Target Class Influence on Neural Network Compressibility for Energy-Autonomous Avian Monitoring

[2602.17700] MIDAS: Mosaic Input-Specific Differentiable Architecture Search

[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models

[2602.17693] A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU

[2602.17691] Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

[2602.17682] Duality Models: An Embarrassingly Simple One-step Generation Paradigm

[2602.17681] LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

[2602.17679] Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

'Thermodynamic computer' can mimic AI neural networks — using orders of magnitude less energy to generate images

Interested in AI workflow for filmmaking

Related Topics

Stay updated with AI News