AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[D] How's MLX and jax/ pytorch on MacBooks these days?

So I'm looking at buying a new 14 inch MacBook pro with m5 pro and 64 gb of memory vs m4 max with same specs. My priorities are pro sof...

Reddit - Machine Learning · 1 min · 37 minutes ago

Ai Infrastructure

Who needs fancy stuff, When you can program, build, train and run 2 completely different ai agents on an i3 4GB RAM and onboard gpu chip? looool

And I know some of yall doubt - so I’ll follow up. submitted by /u/Snoo-76697 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

94.42% Accuracy on Banking77 Official Test Split BANKING77-77 is deceptively hard: 77 fine-grained banking intents, noisy real-world quer...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Machine Learning

[2602.19271] Taming Preconditioner Drift: Unlocking the Potential of Second-Order Optimizers for Federated Learning on Non-IID Data

This paper presents FedPAC, a framework to enhance the stability and accuracy of second-order optimizers in federated learning on non-IID...

arXiv - AI · 4 min · about 1 month ago

Ai Infrastructure

[2602.18471] Charting the Future of AI-supported Science Education: A Human-Centered Vision

This article discusses the transformative potential of AI in science education, proposing a human-centered framework for its ethical inte...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2602.18470] Transforming Science Learning Materials in the Era of Artificial Intelligence

This article explores how AI is reshaping science learning materials, enhancing personalization, accessibility, and interactivity while a...

arXiv - AI · 4 min · about 1 month ago

Ai Infrastructure

[2602.18469] The Landscape of AI in Science Education: What is Changing and How to Respond

This article explores the transformative impact of AI on science education, highlighting changes in educational practices and the need fo...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18464] How Well Can LLM Agents Simulate End-User Security and Privacy Attitudes and Behaviors?

This paper investigates the effectiveness of large language model (LLM) agents in simulating user attitudes and behaviors towards securit...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection

The paper presents HybridFL, a federated learning approach designed for financial crime detection, which integrates horizontal and vertic...

arXiv - AI · 3 min · about 1 month ago

Robotics

[2602.18460] The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare

This article examines the limitations of agentic AI in healthcare, highlighting the gap between commercial promises and operational reali...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

The paper introduces Virtual Parameter Sharpening (VPS), a novel technique for enhancing inference-time reasoning in transformer models t...

arXiv - AI · 3 min · about 1 month ago

Robotics

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

The article presents a novel evaluation framework for mechanistic interpretability research, utilizing AI agents to enhance research rigo...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19142] Celo2: Towards Learned Optimization Free Lunch

The paper 'Celo2: Towards Learned Optimization Free Lunch' presents a novel learned optimizer that significantly reduces the computationa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

The paper presents TICL, a novel method for causal structure learning from interventional data, enhancing generalization across diverse s...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features

This paper presents a robust Bayesian approach to random feature regression, addressing prior and likelihood misspecification through Hub...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

The paper presents ConfSpec, a novel framework for efficient step-level speculative reasoning in large language models, achieving signifi...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

This study evaluates the effectiveness of large language models (LLMs) in generating subject lines for mental health counseling emails, h...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

This paper explores iterative feedback loops in image generative models, introducing the concept of neural resonance and its implications...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

This paper introduces the Active Data Reconstruction Attack (ADRA), a novel approach to detect language model training data by leveraging...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

This paper investigates the complexity of training deep neural networks under a realistic bit-level model, contrasting it with traditiona...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

The paper introduces CausalFlip, a benchmark for evaluating large language models' (LLMs) causal reasoning capabilities, emphasizing the ...

arXiv - AI · 4 min · about 1 month ago

Previous Page 97 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

[D] How's MLX and jax/ pytorch on MacBooks these days?

Who needs fancy stuff, When you can program, build, train and run 2 completely different ai agents on an i3 4GB RAM and onboard gpu chip? looool

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

All Content

[2602.19271] Taming Preconditioner Drift: Unlocking the Potential of Second-Order Optimizers for Federated Learning on Non-IID Data

[2602.18471] Charting the Future of AI-supported Science Education: A Human-Centered Vision

[2602.18470] Transforming Science Learning Materials in the Era of Artificial Intelligence

[2602.18469] The Landscape of AI in Science Education: What is Changing and How to Respond

[2602.18464] How Well Can LLM Agents Simulate End-User Security and Privacy Attitudes and Behaviors?

[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection

[2602.18460] The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare

[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

[2602.19142] Celo2: Towards Learned Optimization Free Lunch

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

Related Topics

Stay updated with AI News