AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Firmus, the 'Southgate' AI datacenter builder backed by Nvidia, hits $5.5B valuation | TechCrunch
Ai Infrastructure

Firmus, the 'Southgate' AI datacenter builder backed by Nvidia, hits $5.5B valuation | TechCrunch

Nvidia-backed Asia AI data center provider Firmus has now raised $1.35 billion in six months.

TechCrunch - AI · 3 min ·
Anthropic debuts ‘Project Glasswing’ and new AI model for cybersecurity | The Verge
Machine Learning

Anthropic debuts ‘Project Glasswing’ and new AI model for cybersecurity | The Verge

Anthropic launched Project Glasswing, a cybersecurity initiative in which it’s partnering with Nvidia, Apple, and others, and debuted a n...

The Verge - AI · 5 min ·
Nlp

Has anyone here switched to TeraBox recently? Is it actually worth it?

I’ve been seeing more people talk about TeraBox lately, especially around storage for AI-related workflows. Curious if anyone here has us...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification
Llms

[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification

This paper explores how model misspecification leads to rational misalignments in AI behavior, presenting a new framework for understandi...

arXiv - Machine Learning · 4 min ·
[2602.17952] Hardware-Friendly Input Expansion for Accelerating Function Approximation
Machine Learning

[2602.17952] Hardware-Friendly Input Expansion for Accelerating Function Approximation

This paper presents a hardware-friendly method for accelerating function approximation through input-space expansion, enhancing convergen...

arXiv - Machine Learning · 4 min ·
[2602.17861] JAX-Privacy: A library for differentially private machine learning
Machine Learning

[2602.17861] JAX-Privacy: A library for differentially private machine learning

JAX-Privacy is a new library aimed at simplifying the implementation of differentially private machine learning, offering both customizat...

arXiv - Machine Learning · 3 min ·
[2602.17849] Dual Length Codes for Lossless Compression of BFloat16
Llms

[2602.17849] Dual Length Codes for Lossless Compression of BFloat16

This paper presents Dual Length Codes, a novel hybrid approach for lossless compression of BFloat16 data, improving decoding speed while ...

arXiv - Machine Learning · 4 min ·
[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning
Llms

[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

The paper presents Iprox, a two-stage framework for gradient-based data selection in LLM fine-tuning, which constructs influence-preservi...

arXiv - Machine Learning · 4 min ·
[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models
Llms

[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

This paper introduces ruleXplain, a framework utilizing Large Language Models to extract causal rules from multivariate timeseries data, ...

arXiv - Machine Learning · 4 min ·
[2602.17809] Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning
Llms

[2602.17809] Calibrated Adaptation: Bayesian Stiefel Manifold Priors for Reliable Parameter-Efficient Fine-Tuning

This paper introduces Stiefel-Bayes Adapters (SBA), a Bayesian framework for parameter-efficient fine-tuning of large language models, en...

arXiv - Machine Learning · 4 min ·
[2602.17751] Investigating Target Class Influence on Neural Network Compressibility for Energy-Autonomous Avian Monitoring
Machine Learning

[2602.17751] Investigating Target Class Influence on Neural Network Compressibility for Energy-Autonomous Avian Monitoring

This paper explores the impact of target class selection on the compressibility of neural networks for avian monitoring using energy-auto...

arXiv - Machine Learning · 4 min ·
[2602.17700] MIDAS: Mosaic Input-Specific Differentiable Architecture Search
Machine Learning

[2602.17700] MIDAS: Mosaic Input-Specific Differentiable Architecture Search

MIDAS introduces a novel approach to differentiable neural architecture search by utilizing input-specific parameters and self-attention ...

arXiv - Machine Learning · 3 min ·
[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs
Llms

[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

The paper presents ScaleBITS, a mixed-precision quantization framework designed to optimize bitwidth allocation in large language models,...

arXiv - Machine Learning · 3 min ·
[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters
Llms

[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

This article introduces a novel approach to optimizing inference hyperparameters in Large Language Models (LLMs) using variability modeli...

arXiv - Machine Learning · 4 min ·
[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models
Llms

[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models

The paper presents AsynDBT, an innovative algorithm for asynchronous distributed bilevel tuning aimed at improving in-context learning wi...

arXiv - Machine Learning · 4 min ·
[2602.17693] A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU
Llms

[2602.17693] A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU

This article presents a case study on the effectiveness of Post-Training Quantization (PTQ) methods for reasoning-oriented large language...

arXiv - Machine Learning · 3 min ·
[2602.17691] Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering
Llms

[2602.17691] Tethered Reasoning: Decoupling Entropy from Hallucination in Quantized LLMs via Manifold Steering

This paper introduces HELIX, a framework to improve quantized language models by decoupling output entropy from hallucination, enhancing ...

arXiv - Machine Learning · 4 min ·
[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
Llms

[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

The paper presents CodeScaler, an execution-free reward model that enhances the scalability of code LLM training and test-time inference,...

arXiv - Machine Learning · 4 min ·
[2602.17682] Duality Models: An Embarrassingly Simple One-step Generation Paradigm
Machine Learning

[2602.17682] Duality Models: An Embarrassingly Simple One-step Generation Paradigm

The paper presents Duality Models (DuMo), a novel approach in generative modeling that enhances stability and efficiency by using a share...

arXiv - Machine Learning · 4 min ·
[2602.17681] LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs
Llms

[2602.17681] LATMiX: Learnable Affine Transformations for Microscaling Quantization of LLMs

The paper presents LATMiX, a method for enhancing quantization in large language models (LLMs) through learnable affine transformations, ...

arXiv - Machine Learning · 4 min ·
[2602.17679] Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization
Machine Learning

[2602.17679] Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

This article presents a novel Bayesian optimization framework, POGPN-JPSS, that integrates process expertise to enhance the efficiency of...

arXiv - Machine Learning · 4 min ·
Machine Learning

'Thermodynamic computer' can mimic AI neural networks — using orders of magnitude less energy to generate images

The article discusses a new type of thermodynamic computer that can replicate the functions of AI neural networks while consuming signifi...

Reddit - Artificial Intelligence · 1 min ·
Generative Ai

Interested in AI workflow for filmmaking

The discussion focuses on leveraging AI in filmmaking, emphasizing the need for professionals to adapt and prepare for AI's growing role ...

Reddit - Artificial Intelligence · 1 min ·
Previous Page 110 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime