Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[P] I trained an AI to play Resident Evil 4 Remake using Behavioral Cloning + LSTM

I recorded gameplay trajectories in RE4's village — running, shooting, reloading, dodging — and used Behavioral Cloning to train a model ...

Reddit - Machine Learning · 1 min · 26 minutes ago

Machine Learning

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

Many times when I try to deeply understand a topic in machine learning — whether it's a new architecture, a quantization method, a full t...

Reddit - Machine Learning · 1 min · 26 minutes ago

Llms

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...

Reddit - Machine Learning · 1 min · about 5 hours ago

All Content

Llms

[2603.25155] Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Abstract page for arXiv paper 2603.25155: Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

arXiv - AI · 3 min · 2 days ago

Machine Learning

[2603.25150] Goodness-of-pronunciation without phoneme time alignment

Abstract page for arXiv paper 2603.25150: Goodness-of-pronunciation without phoneme time alignment

arXiv - AI · 3 min · 2 days ago

Llms

[2603.25146] Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence

Abstract page for arXiv paper 2603.25146: Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.25144] FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

Abstract page for arXiv paper 2603.25144: FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.25068] Ultra-fast Traffic Nowcasting and Control via Differentiable Agent-based Simulation

Abstract page for arXiv paper 2603.25068: Ultra-fast Traffic Nowcasting and Control via Differentiable Agent-based Simulation

arXiv - Machine Learning · 4 min · 2 days ago

Llms

[2603.25015] Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

Abstract page for arXiv paper 2603.25015: Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

arXiv - AI · 3 min · 2 days ago

Machine Learning

[2603.25126] MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

Abstract page for arXiv paper 2603.25126: MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

arXiv - Machine Learning · 3 min · 2 days ago

Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min · 2 days ago

Llms

[2603.24946] MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development

Abstract page for arXiv paper 2603.24946: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application...

arXiv - Machine Learning · 4 min · 2 days ago

Machine Learning

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

arXiv - AI · 4 min · 2 days ago

Llms

[2603.24917] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

Abstract page for arXiv paper 2603.24917: Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

arXiv - Machine Learning · 4 min · 2 days ago

Llms

[2603.25099] Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

Abstract page for arXiv paper 2603.25099: Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Opti...

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

Abstract page for arXiv paper 2603.25083: Learning domain-invariant features through channel-level sparsification for Out-Of Distribution...

arXiv - AI · 4 min · 2 days ago

Llms

[2603.25063] TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

Abstract page for arXiv paper 2603.25063: TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visual...

arXiv - Machine Learning · 4 min · 2 days ago

Llms

[2603.25056] The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

Abstract page for arXiv paper 2603.25056: The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Create...

arXiv - AI · 4 min · 2 days ago

Machine Learning

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

Abstract page for arXiv paper 2603.24764: Synthetic Cardiac MRI Image Generation using Deep Generative Models

arXiv - Machine Learning · 3 min · 2 days ago

Llms

[2603.25052] Closing the Confidence-Faithfulness Gap in Large Language Models

Abstract page for arXiv paper 2603.25052: Closing the Confidence-Faithfulness Gap in Large Language Models

arXiv - AI · 3 min · 2 days ago

Machine Learning

[2603.24752] Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical-to-Quantum Transfer Learning

Abstract page for arXiv paper 2603.24752: Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical...

arXiv - Machine Learning · 4 min · 2 days ago

Machine Learning

[2603.25006] Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

Abstract page for arXiv paper 2603.25006: Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

arXiv - AI · 4 min · 2 days ago

Previous Page 10 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] I trained an AI to play Resident Evil 4 Remake using Behavioral Cloning + LSTM

[D] Why does it seem like open source materials on ML are incomplete? this is not enough...

[R] GPT-5.4-mini regressed 22pp on vanilla prompting vs GPT-5-mini. Nobody noticed because benchmarks don't test this. Recursive Language Models solved it.

All Content

[2603.25155] Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

[2603.25150] Goodness-of-pronunciation without phoneme time alignment

[2603.25146] Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence

[2603.25144] FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

[2603.25068] Ultra-fast Traffic Nowcasting and Control via Differentiable Agent-based Simulation

[2603.25015] Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

[2603.25126] MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

[2603.24946] MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

[2603.24917] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

[2603.25099] Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

[2603.25063] TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

[2603.25056] The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

[2603.25052] Closing the Confidence-Faithfulness Gap in Large Language Models

[2603.24752] Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical-to-Quantum Transfer Learning

[2603.25006] Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

Related Topics

Stay updated with AI News