Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Is it actually possible to define a persistent, model-agnostic text-based layer (loaded with the model each time) that keeps an AI system...

Reddit - Artificial Intelligence · 1 min · 18 minutes ago

Machine Learning

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Hey everyone, I’m an AI news curator and editor currently working on a piece about a weird trend I’ve been spotting: technical simulators...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Coherence Without Convergence: A New Protocol for Multi-Agent AI

Opening For the past year, most progress in multi-agent AI has followed a familiar pattern: Add more agents. Add more coordination. Watch...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Machine Learning

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

Abstract page for arXiv paper 2508.13773: PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series F...

arXiv - Machine Learning · 3 min · 16 days ago

Llms

[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Abstract page for arXiv paper 2508.04329: Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2508.02343] MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

Abstract page for arXiv paper 2508.02343: MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language M...

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2507.15162] Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

Abstract page for arXiv paper 2507.15162: Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2507.03119] Improving ideal MHD equilibrium accuracy with physics-informed neural networks

Abstract page for arXiv paper 2507.03119: Improving ideal MHD equilibrium accuracy with physics-informed neural networks

arXiv - Machine Learning · 3 min · 16 days ago

Machine Learning

[2506.10127] Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

Abstract page for arXiv paper 2506.10127: Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

arXiv - Machine Learning · 3 min · 16 days ago

Llms

[2505.13820] Structured Agent Distillation for Large Language Model

Abstract page for arXiv paper 2505.13820: Structured Agent Distillation for Large Language Model

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2505.13280] FlowPure: Continuous Normalizing Flows for Adversarial Purification

Abstract page for arXiv paper 2505.13280: FlowPure: Continuous Normalizing Flows for Adversarial Purification

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2505.11349] Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning

Abstract page for arXiv paper 2505.11349: Context parroting: A simple but tough-to-beat baseline for foundation models in scientific mach...

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2505.11035] Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios

Abstract page for arXiv paper 2505.11035: Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeli...

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2505.08137] Large Language Models for Computer-Aided Design: A Survey

Abstract page for arXiv paper 2505.08137: Large Language Models for Computer-Aided Design: A Survey

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2505.01448] OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

Abstract page for arXiv paper 2505.01448: OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2504.10833] Measuring the (Un)Faithfulness of Concept-Based Explanations

Abstract page for arXiv paper 2504.10833: Measuring the (Un)Faithfulness of Concept-Based Explanations

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2503.09008] Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement

Abstract page for arXiv paper 2503.09008: Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset an...

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2503.05371] Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

Abstract page for arXiv paper 2503.05371: Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2502.07297] MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

Abstract page for arXiv paper 2502.07297: MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2502.00472] Binned Spectral Power Loss for Improved Prediction of Chaotic Systems

Abstract page for arXiv paper 2502.00472: Binned Spectral Power Loss for Improved Prediction of Chaotic Systems

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

Abstract page for arXiv paper 2501.10677: Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Sc...

arXiv - Machine Learning · 4 min · 16 days ago

Llms

[2501.07237] Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

Abstract page for arXiv paper 2501.07237: Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

arXiv - Machine Learning · 4 min · 16 days ago

Machine Learning

[2501.00200] Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

Abstract page for arXiv paper 2501.00200: Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

arXiv - Machine Learning · 4 min · 16 days ago

Previous Page 198 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Coherence Without Convergence: A New Protocol for Multi-Agent AI

All Content

[2508.13773] PENGUIN: Enhancing Transformer with Periodic-Nested Group Attention for Long-term Time Series Forecasting

[2508.04329] Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

[2508.02343] MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models

[2507.15162] Designing User-Centric Metrics for Evaluation of Counterfactual Explanations

[2507.03119] Improving ideal MHD equilibrium accuracy with physics-informed neural networks

[2506.10127] Meet Me at the Arm: The Cooperative Multi-Armed Bandits Problem with Shareable Arms

[2505.13820] Structured Agent Distillation for Large Language Model

[2505.13280] FlowPure: Continuous Normalizing Flows for Adversarial Purification

[2505.11349] Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning

[2505.11035] Deep Latent Variable Model based Vertical Federated Learning with Flexible Alignment and Labeling Scenarios

[2505.08137] Large Language Models for Computer-Aided Design: A Survey

[2505.01448] OpenAVS: Training-Free Open-Vocabulary Audio Visual Segmentation with Foundational Models

[2504.10833] Measuring the (Un)Faithfulness of Concept-Based Explanations

[2503.09008] Towards Quantifying Long-Range Interactions in Graph Machine Learning: a Large Graph Dataset and a Measurement

[2503.05371] Shifting Perspectives: Steering Vectors for Robust Bias Mitigation in LLMs

[2502.07297] MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials

[2502.00472] Binned Spectral Power Loss for Improved Prediction of Chaotic Systems

[2501.10677] Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring

[2501.07237] Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States

[2501.00200] Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

Related Topics

Stay updated with AI News