Machine Learning

ML algorithms, training, and inference

Top This Week

Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
New technique makes AI models leaner and faster while they’re still learning
Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min ·
Machine Learning

Question regarding Transformer's pipeline module [D]

from transformers import pipeline , DistilBertTokenizer , DistilBertModel model = DistilBertModel . from_pretrained ('distilbert-base-cas...

Reddit - Machine Learning · 1 min ·

All Content

[2604.04681] Batch Loss Score for Dynamic Data Pruning
Machine Learning

[2604.04681] Batch Loss Score for Dynamic Data Pruning

Abstract page for arXiv paper 2604.04681: Batch Loss Score for Dynamic Data Pruning

arXiv - Machine Learning · 4 min ·
[2604.04655] Grokking as Dimensional Phase Transition in Neural Networks
Machine Learning

[2604.04655] Grokking as Dimensional Phase Transition in Neural Networks

Abstract page for arXiv paper 2604.04655: Grokking as Dimensional Phase Transition in Neural Networks

arXiv - AI · 3 min ·
[2604.04648] From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism
Llms

[2604.04648] From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

Abstract page for arXiv paper 2604.04648: From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

arXiv - Machine Learning · 4 min ·
[2604.04614] A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs
Machine Learning

[2604.04614] A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplete Multimodal EHRs

Abstract page for arXiv paper 2604.04614: A Clinical Point Cloud Paradigm for In-Hospital Mortality Prediction from Multi-Level Incomplet...

arXiv - AI · 4 min ·
[2604.04611] Dynamic Free-Rider Detection in Federated Learning via Simulated Attack Patterns
Machine Learning

[2604.04611] Dynamic Free-Rider Detection in Federated Learning via Simulated Attack Patterns

Abstract page for arXiv paper 2604.04611: Dynamic Free-Rider Detection in Federated Learning via Simulated Attack Patterns

arXiv - Machine Learning · 4 min ·
[2604.04535] Learning from Equivalence Queries, Revisited
Machine Learning

[2604.04535] Learning from Equivalence Queries, Revisited

Abstract page for arXiv paper 2604.04535: Learning from Equivalence Queries, Revisited

arXiv - Machine Learning · 4 min ·
[2604.04518] Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them
Machine Learning

[2604.04518] Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or Group-Distributional non-robustness and how to fix them

Abstract page for arXiv paper 2604.04518: Reproducibility study on how to find Spurious Correlations, Shortcut Learning, Clever Hans or G...

arXiv - AI · 4 min ·
[2604.04516] GAIN: Multiplicative Modulation for Domain Adaptation
Llms

[2604.04516] GAIN: Multiplicative Modulation for Domain Adaptation

Abstract page for arXiv paper 2604.04516: GAIN: Multiplicative Modulation for Domain Adaptation

arXiv - AI · 3 min ·
[2604.04497] One Model for All: Multi-Objective Controllable Language Models
Llms

[2604.04497] One Model for All: Multi-Objective Controllable Language Models

Abstract page for arXiv paper 2604.04497: One Model for All: Multi-Objective Controllable Language Models

arXiv - AI · 4 min ·
[2604.04493] SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models
Llms

[2604.04493] SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

Abstract page for arXiv paper 2604.04493: SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

arXiv - AI · 3 min ·
[2604.04485] ECG Biometrics with ArcFace-Inception: External Validation on MIMIC and HEEDB
Machine Learning

[2604.04485] ECG Biometrics with ArcFace-Inception: External Validation on MIMIC and HEEDB

Abstract page for arXiv paper 2604.04485: ECG Biometrics with ArcFace-Inception: External Validation on MIMIC and HEEDB

arXiv - AI · 3 min ·
[2604.04475] Discrete Prototypical Memories for Federated Time Series Foundation Models
Llms

[2604.04475] Discrete Prototypical Memories for Federated Time Series Foundation Models

Abstract page for arXiv paper 2604.04475: Discrete Prototypical Memories for Federated Time Series Foundation Models

arXiv - AI · 3 min ·
[2604.04474] MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation
Machine Learning

[2604.04474] MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation

Abstract page for arXiv paper 2604.04474: MAVEN: A Mesh-Aware Volumetric Encoding Network for Simulating 3D Flexible Deformation

arXiv - AI · 4 min ·
[2604.04461] DP-OPD: Differentially Private On-Policy Distillation for Language Models
Llms

[2604.04461] DP-OPD: Differentially Private On-Policy Distillation for Language Models

Abstract page for arXiv paper 2604.04461: DP-OPD: Differentially Private On-Policy Distillation for Language Models

arXiv - AI · 4 min ·
[2604.04410] Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment
Llms

[2604.04410] Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment

Abstract page for arXiv paper 2604.04410: Relative Density Ratio Optimization for Stable and Statistically Consistent Model Alignment

arXiv - AI · 4 min ·
[2604.04394] Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
Machine Learning

[2604.04394] Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

Abstract page for arXiv paper 2604.04394: Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games

arXiv - Machine Learning · 3 min ·
[2604.04380] CPT: Controllable and Editable Design Variations with Language Models
Llms

[2604.04380] CPT: Controllable and Editable Design Variations with Language Models

Abstract page for arXiv paper 2604.04380: CPT: Controllable and Editable Design Variations with Language Models

arXiv - Machine Learning · 3 min ·
[2604.04364] Context is All You Need
Machine Learning

[2604.04364] Context is All You Need

Abstract page for arXiv paper 2604.04364: Context is All You Need

arXiv - AI · 3 min ·
[2604.04343] Deep Kuratowski Embedding Neural Networks for Wasserstein Metric Learning
Machine Learning

[2604.04343] Deep Kuratowski Embedding Neural Networks for Wasserstein Metric Learning

Abstract page for arXiv paper 2604.04343: Deep Kuratowski Embedding Neural Networks for Wasserstein Metric Learning

arXiv - Machine Learning · 3 min ·
[2604.04342] Generative models for decision-making under distributional shift
Machine Learning

[2604.04342] Generative models for decision-making under distributional shift

Abstract page for arXiv paper 2604.04342: Generative models for decision-making under distributional shift

arXiv - Machine Learning · 3 min ·
Previous Page 331 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime