AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch

OpenAI's latest funding round, led by Amazon, Nvidia, and SoftBank, values the AI lab at $852 billion as it nears an IPO.

TechCrunch - AI · 4 min · about 2 hours ago

Machine Learning

[R] Fine-tuning services report

If you have some data and want to train or run a small custom model but don't have powerful enough hardware for training, fine-tuning ser...

Reddit - Machine Learning · 1 min · about 6 hours ago

Machine Learning

The AI Chip War is Just Getting Started

Everyone talks about AI models, but the real bottleneck might be hardware. According to a recent study by Roots Analysis: AI chip market ...

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

All Content

Llms

[2510.18871] How Do LLMs Use Their Depth?

Abstract page for arXiv paper 2510.18871: How Do LLMs Use Their Depth?

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2510.16028] TAO: Tolerance-Aware Optimistic Verification for Floating-Point Neural Networks

Abstract page for arXiv paper 2510.16028: TAO: Tolerance-Aware Optimistic Verification for Floating-Point Neural Networks

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

Abstract page for arXiv paper 2510.21910: Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

Abstract page for arXiv paper 2510.20264: Optimistic Task Inference for Behavior Foundation Models

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2510.15301] Latent Diffusion Model without Variational Autoencoder

Abstract page for arXiv paper 2510.15301: Latent Diffusion Model without Variational Autoencoder

arXiv - AI · 4 min · 29 days ago

Llms

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Abstract page for arXiv paper 2510.18245: Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Abstract page for arXiv paper 2510.09462: Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2510.07959] DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

Abstract page for arXiv paper 2510.07959: DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

arXiv - Machine Learning · 4 min · 29 days ago

Nlp

[2510.07746] t-SNE Exaggerates Clusters, Provably

Abstract page for arXiv paper 2510.07746: t-SNE Exaggerates Clusters, Provably

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

Abstract page for arXiv paper 2510.05109: Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on B...

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2510.03638] Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

Abstract page for arXiv paper 2510.03638: Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2510.02999] Untargeted Jailbreak Attack

Abstract page for arXiv paper 2510.02999: Untargeted Jailbreak Attack

arXiv - AI · 4 min · 29 days ago

Llms

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2509.25837] Distillation of Large Language Models via Concrete Score Matching

Abstract page for arXiv paper 2509.25837: Distillation of Large Language Models via Concrete Score Matching

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2509.25532] Calibrating Verbalized Confidence with Self-Generated Distractors

Abstract page for arXiv paper 2509.25532: Calibrating Verbalized Confidence with Self-Generated Distractors

arXiv - AI · 4 min · 29 days ago

Llms

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

Abstract page for arXiv paper 2509.22957: Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2509.25175] EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Abstract page for arXiv paper 2509.25175: EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

arXiv - AI · 3 min · 29 days ago

Machine Learning

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

Abstract page for arXiv paper 2509.21835: On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2509.20323] A Recovery Guarantee for Sparse Neural Networks

Abstract page for arXiv paper 2509.20323: A Recovery Guarantee for Sparse Neural Networks

arXiv - Machine Learning · 3 min · 29 days ago

Previous Page 47 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch

[R] Fine-tuning services report

The AI Chip War is Just Getting Started

All Content

[2510.18871] How Do LLMs Use Their Depth?

[2510.16028] TAO: Tolerance-Aware Optimistic Verification for Floating-Point Neural Networks

[2510.21910] Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

[2510.15301] Latent Diffusion Model without Variational Autoencoder

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

[2510.07959] DISCO: Diversifying Sample Condensation for Efficient Model Evaluation

[2510.07746] t-SNE Exaggerates Clusters, Provably

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

[2510.03638] Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

[2510.02999] Untargeted Jailbreak Attack

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

[2509.25837] Distillation of Large Language Models via Concrete Score Matching

[2509.25532] Calibrating Verbalized Confidence with Self-Generated Distractors

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

[2509.25175] EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

[2509.20323] A Recovery Guarantee for Sparse Neural Networks

Related Topics

Stay updated with AI News