Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[R] Fine-tuning services report

If you have some data and want to train or run a small custom model but don't have powerful enough hardware for training, fine-tuning ser...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[D] Does ML have a "bible"/reference textbook at the Intermediate/Advanced level?

Hello, everyone! This is my first time posting here and I apologise if the question is, perhaps, a bit too basic for this sub-reddit. A b...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[D] ICML 2026 review policy debate: 100 responses suggest Policy B may score higher, while Policy A shows higher confidence

A week ago I made a thread asking whether ICML 2026’s review policy might have affected review outcomes, especially whether Policy A pape...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Llms

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

Abstract page for arXiv paper 2603.11687: SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv - AI · 4 min · 5 days ago

Llms

[2603.16179] 360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method

Abstract page for arXiv paper 2603.16179: 360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method

arXiv - AI · 4 min · 5 days ago

Llms

[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

Abstract page for arXiv paper 2603.11583: UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

arXiv - AI · 3 min · 5 days ago

Machine Learning

[2603.11560] Theory of Dynamic Adaptive Coordination

Abstract page for arXiv paper 2603.11560: Theory of Dynamic Adaptive Coordination

arXiv - AI · 3 min · 5 days ago

Llms

[2603.11413] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

Abstract page for arXiv paper 2603.11413: Evaluation format, not model capability, drives triage failure in the assessment of consumer he...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.16673] When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making

Abstract page for arXiv paper 2603.16673: When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Rob...

arXiv - AI · 4 min · 5 days ago

Llms

[2603.06663] Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting

Abstract page for arXiv paper 2603.06663: Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual ...

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2603.14294] Seeking Physics in Diffusion Noise

Abstract page for arXiv paper 2603.14294: Seeking Physics in Diffusion Noise

arXiv - AI · 3 min · 5 days ago

Machine Learning

[2601.07325] Robust Bayesian Inference via Variational Approximations of Generalized Rho-Posteriors

Abstract page for arXiv paper 2601.07325: Robust Bayesian Inference via Variational Approximations of Generalized Rho-Posteriors

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

Abstract page for arXiv paper 2512.22854: ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum ...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

Abstract page for arXiv paper 2512.05245: STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology...

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2511.14427] Self-Supervised Multisensory Pretraining for Contact-Rich Robot Reinforcement Learning

Abstract page for arXiv paper 2511.14427: Self-Supervised Multisensory Pretraining for Contact-Rich Robot Reinforcement Learning

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2601.08881] TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Abstract page for arXiv paper 2601.08881: TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2511.04454] Fitting Reinforcement Learning Model to Behavioral Data under Bandits

Abstract page for arXiv paper 2511.04454: Fitting Reinforcement Learning Model to Behavioral Data under Bandits

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2601.06394] Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification

Abstract page for arXiv paper 2601.06394: Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing an...

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2511.01464] Split-Flows: Measure Transport and Information Loss Across Molecular Resolutions

Abstract page for arXiv paper 2511.01464: Split-Flows: Measure Transport and Information Loss Across Molecular Resolutions

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2512.14698] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Abstract page for arXiv paper 2512.14698: TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

arXiv - AI · 4 min · 5 days ago

Llms

[2512.10411] SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing

Abstract page for arXiv paper 2512.10411: SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Pro...

arXiv - AI · 4 min · 5 days ago

Llms

[2510.12117] Locket: Robust Feature-Locking Technique for Language Models

Abstract page for arXiv paper 2510.12117: Locket: Robust Feature-Locking Technique for Language Models

arXiv - Machine Learning · 3 min · 5 days ago

Machine Learning

[2509.21385] Debugging Concept Bottleneck Models through Removal and Retraining

Abstract page for arXiv paper 2509.21385: Debugging Concept Bottleneck Models through Removal and Retraining

arXiv - Machine Learning · 4 min · 5 days ago

Previous Page 44 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[R] Fine-tuning services report

[D] Does ML have a "bible"/reference textbook at the Intermediate/Advanced level?

[D] ICML 2026 review policy debate: 100 responses suggest Policy B may score higher, while Policy A shows higher confidence

All Content

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

[2603.16179] 360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method

[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

[2603.11560] Theory of Dynamic Adaptive Coordination

[2603.11413] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI

[2603.16673] When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making

[2603.06663] Graph-of-Mark: Promote Spatial Reasoning in Multimodal Language Models with Graph-Based Visual Prompting

[2603.14294] Seeking Physics in Diffusion Noise

[2601.07325] Robust Bayesian Inference via Variational Approximations of Generalized Rho-Posteriors

[2512.22854] ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning

[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

[2511.14427] Self-Supervised Multisensory Pretraining for Contact-Rich Robot Reinforcement Learning

[2601.08881] TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

[2511.04454] Fitting Reinforcement Learning Model to Behavioral Data under Bandits

[2601.06394] Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification

[2511.01464] Split-Flows: Measure Transport and Information Loss Across Molecular Resolutions

[2512.14698] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

[2512.10411] SWAA: Sliding Window Attention Adaptation for Efficient and Quality Preserving Long Context Processing

[2510.12117] Locket: Robust Feature-Locking Technique for Language Models

[2509.21385] Debugging Concept Bottleneck Models through Removal and Retraining

Related Topics

Stay updated with AI News