Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Is it actually possible to define a persistent, model-agnostic text-based layer (loaded with the model each time) that keeps an AI system...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Hey everyone, I’m an AI news curator and editor currently working on a piece about a weird trend I’ve been spotting: technical simulators...

Reddit - Machine Learning · 1 min ·
Machine Learning

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Opening For the past year, most progress in multi-agent AI has followed a familiar pattern: Add more agents. Add more coordination. Watch...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting
Machine Learning

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

Abstract page for arXiv paper 2508.13773: PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series F...

arXiv - Machine Learning · 3 min ·
[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning
Llms

[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Abstract page for arXiv paper 2508.04329: Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

arXiv - Machine Learning · 4 min ·
[2508.02343] MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
Llms

[2508.02343] MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

Abstract page for arXiv paper 2508.02343: MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language M...

arXiv - Machine Learning · 4 min ·
[2507.15162] Designing User-Centric Metrics for Evaluation of Counterfactual Explanations
Machine Learning

[2507.15162] Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

Abstract page for arXiv paper 2507.15162: Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

arXiv - Machine Learning · 4 min ·
[2507.03119] Improving ideal MHD equilibrium accuracy with physics-informed neural networks
Machine Learning

[2507.03119] Improving ideal MHD equilibrium accuracy with physics-informed neural networks

Abstract page for arXiv paper 2507.03119: Improving ideal MHD equilibrium accuracy with physics-informed neural networks

arXiv - Machine Learning · 3 min ·
[2506.10127] Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms
Machine Learning

[2506.10127] Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

Abstract page for arXiv paper 2506.10127: Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

arXiv - Machine Learning · 3 min ·
[2505.13820] Structured Agent Distillation for Large Language Model
Llms

[2505.13820] Structured Agent Distillation for Large Language Model

Abstract page for arXiv paper 2505.13820: Structured Agent Distillation for Large Language Model

arXiv - Machine Learning · 4 min ·
[2505.13280] FlowPure: Continuous Normalizing Flows for Adversarial Purification
Machine Learning

[2505.13280] FlowPure: Continuous Normalizing Flows for Adversarial Purification

Abstract page for arXiv paper 2505.13280: FlowPure: Continuous Normalizing Flows for Adversarial Purification

arXiv - Machine Learning · 4 min ·
[2505.11349] Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning
Llms

[2505.11349] Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning

Abstract page for arXiv paper 2505.11349: Context parroting: A simple but tough-to-beat baseline for foundation models in scientific mach...

arXiv - Machine Learning · 4 min ·
[2505.11035] Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios
Machine Learning

[2505.11035] Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios

Abstract page for arXiv paper 2505.11035: Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeli...

arXiv - Machine Learning · 4 min ·
[2505.08137] Large Language Models for Computer-Aided Design: A Survey
Llms

[2505.08137] Large Language Models for Computer-Aided Design: A Survey

Abstract page for arXiv paper 2505.08137: Large Language Models for Computer-Aided Design: A Survey

arXiv - Machine Learning · 4 min ·
[2505.01448] OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models
Machine Learning

[2505.01448] OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

Abstract page for arXiv paper 2505.01448: OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

arXiv - Machine Learning · 4 min ·
[2504.10833] Measuring the (Un)Faithfulness of Concept-Based Explanations
Machine Learning

[2504.10833] Measuring the (Un)Faithfulness of Concept-Based Explanations

Abstract page for arXiv paper 2504.10833: Measuring the (Un)Faithfulness of Concept-Based Explanations

arXiv - Machine Learning · 4 min ·
[2503.09008] Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement
Machine Learning

[2503.09008] Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement

Abstract page for arXiv paper 2503.09008: Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset an...

arXiv - Machine Learning · 4 min ·
[2503.05371] Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs
Llms

[2503.05371] Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

Abstract page for arXiv paper 2503.05371: Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

arXiv - Machine Learning · 4 min ·
[2502.07297] MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials
Machine Learning

[2502.07297] MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

Abstract page for arXiv paper 2502.07297: MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

arXiv - Machine Learning · 4 min ·
[2502.00472] Binned Spectral Power Loss for Improved Prediction of Chaotic Systems
Machine Learning

[2502.00472] Binned Spectral Power Loss for Improved Prediction of Chaotic Systems

Abstract page for arXiv paper 2502.00472: Binned Spectral Power Loss for Improved Prediction of Chaotic Systems

arXiv - Machine Learning · 4 min ·
[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring
Machine Learning

[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

Abstract page for arXiv paper 2501.10677: Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Sc...

arXiv - Machine Learning · 4 min ·
[2501.07237] Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States
Llms

[2501.07237] Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

Abstract page for arXiv paper 2501.07237: Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

arXiv - Machine Learning · 4 min ·
[2501.00200] Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes
Machine Learning

[2501.00200] Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

Abstract page for arXiv paper 2501.00200: Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

arXiv - Machine Learning · 4 min ·
Previous Page 198 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime