Machine Learning

ML algorithms, training, and inference

Top This Week

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment
Llms

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

Abstract page for arXiv paper 2602.06869: Uncovering Cross-Objective Interference in Multi-Objective Alignment

arXiv - Machine Learning · 3 min ·
[2604.07401] Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory
Machine Learning

[2604.07401] Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

Abstract page for arXiv paper 2604.07401: Geometric Entropy and Retrieval Phase Transitions in Continuous Thermal Dense Associative Memory

arXiv - Machine Learning · 4 min ·
[2512.14954] Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation
Llms

[2512.14954] Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation

Abstract page for arXiv paper 2512.14954: Cross-Tokenizer Likelihood Scoring Algorithms for Language Model Distillation

arXiv - Machine Learning · 4 min ·

All Content

[2510.17640] RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation
Machine Learning

[2510.17640] RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

Abstract page for arXiv paper 2510.17640: RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation

arXiv - AI · 4 min ·
[2510.06499] Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels
Llms

[2510.06499] Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Abstract page for arXiv paper 2510.06499: Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

arXiv - AI · 4 min ·
[2509.26435] Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
Llms

[2509.26435] Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search

Abstract page for arXiv paper 2509.26435: Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search

arXiv - AI · 3 min ·
[2509.25214] On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs
Llms

[2509.25214] On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Quantized LLMs

Abstract page for arXiv paper 2509.25214: On-the-Fly Adaptation to Quantization: Configuration-Aware LoRA for Efficient Fine-Tuning of Qu...

arXiv - AI · 4 min ·
[2509.02967] AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting
Llms

[2509.02967] AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting

Abstract page for arXiv paper 2509.02967: AR-KAN: Autoregressive-Weight-Enhanced Kolmogorov-Arnold Network for Time Series Forecasting

arXiv - AI · 4 min ·
[2508.16165] Investigating Multimodal Large Language Models to Support Usability Evaluation
Llms

[2508.16165] Investigating Multimodal Large Language Models to Support Usability Evaluation

Abstract page for arXiv paper 2508.16165: Investigating Multimodal Large Language Models to Support Usability Evaluation

arXiv - AI · 3 min ·
[2508.06869] VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding
Llms

[2508.06869] VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

Abstract page for arXiv paper 2508.06869: VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding

arXiv - AI · 4 min ·
[2508.04853] Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos
Llms

[2508.04853] Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos

Abstract page for arXiv paper 2508.04853: Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos

arXiv - AI · 4 min ·
[2506.22832] Listener-Rewarded Thinking in VLMs for Image Preferences
Machine Learning

[2506.22832] Listener-Rewarded Thinking in VLMs for Image Preferences

Abstract page for arXiv paper 2506.22832: Listener-Rewarded Thinking in VLMs for Image Preferences

arXiv - AI · 4 min ·
[2506.09067] Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations
Llms

[2506.09067] Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

Abstract page for arXiv paper 2506.09067: Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations

arXiv - AI · 3 min ·
[2505.18600] Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Machine Learning

[2505.18600] Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

Abstract page for arXiv paper 2505.18600: Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment

arXiv - AI · 3 min ·
[2505.12509] Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
Llms

[2505.12509] Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models

Abstract page for arXiv paper 2505.12509: Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models

arXiv - AI · 3 min ·
[2503.00035] Constraining Sequential Model Editing with Editing Anchor Compression
Llms

[2503.00035] Constraining Sequential Model Editing with Editing Anchor Compression

Abstract page for arXiv paper 2503.00035: Constraining Sequential Model Editing with Editing Anchor Compression

arXiv - AI · 4 min ·
[2502.08691] AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society
Llms

[2502.08691] AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and Society

Abstract page for arXiv paper 2502.08691: AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of ...

arXiv - AI · 4 min ·
[2502.06809] Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution
Llms

[2502.06809] Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution

Abstract page for arXiv paper 2502.06809: Neurons Speak in Ranges: Breaking Free from Discrete Neuronal Attribution

arXiv - AI · 4 min ·
[2411.10636] Mitigating Extrinsic Gender Bias for Bangla Classification Tasks
Llms

[2411.10636] Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

Abstract page for arXiv paper 2411.10636: Mitigating Extrinsic Gender Bias for Bangla Classification Tasks

arXiv - AI · 4 min ·
[2410.08559] Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture
Machine Learning

[2410.08559] Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture

Abstract page for arXiv paper 2410.08559: Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive ...

arXiv - AI · 4 min ·
[2404.10976] Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning
Machine Learning

[2404.10976] Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2404.10976: Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning

arXiv - AI · 4 min ·
[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning
Machine Learning

[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning

Abstract page for arXiv paper 2311.14756: Task-Distributionally Robust Data-Free Meta-Learning

arXiv - AI · 4 min ·
[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems
Machine Learning

[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

Abstract page for arXiv paper 2604.07956: MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

arXiv - AI · 3 min ·
Previous Page 334 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime