[P] I trained an AI to play Resident Evil 4 Remake using Behavioral Cloning + LSTM
I recorded gameplay trajectories in RE4's village — running, shooting, reloading, dodging — and used Behavioral Cloning to train a model ...
ML algorithms, training, and inference
I recorded gameplay trajectories in RE4's village — running, shooting, reloading, dodging — and used Behavioral Cloning to train a model ...
Many times when I try to deeply understand a topic in machine learning — whether it's a new architecture, a quantization method, a full t...
GPT-5.4-mini produces shorter, terser outputs by default. Vanilla accuracy dropped from 69.5% to 47.2% across 12 tasks (1,800 evals). The...
Abstract page for arXiv paper 2603.25155: Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models
Abstract page for arXiv paper 2603.25150: Goodness-of-pronunciation without phoneme time alignment
Abstract page for arXiv paper 2603.25146: Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence
Abstract page for arXiv paper 2603.25144: FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation
Abstract page for arXiv paper 2603.25068: Ultra-fast Traffic Nowcasting and Control via Differentiable Agent-based Simulation
Abstract page for arXiv paper 2603.25015: Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models
Abstract page for arXiv paper 2603.25126: MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation
Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method
Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Abstract page for arXiv paper 2603.24946: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application...
Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness
Abstract page for arXiv paper 2603.24917: Estimating near-verbatim extraction risk in language models with decoding-constrained beam search
Abstract page for arXiv paper 2603.25099: Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Opti...
Abstract page for arXiv paper 2603.25083: Learning domain-invariant features through channel-level sparsification for Out-Of Distribution...
Abstract page for arXiv paper 2603.25063: TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visual...
Abstract page for arXiv paper 2603.25056: The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Create...
Abstract page for arXiv paper 2603.24764: Synthetic Cardiac MRI Image Generation using Deep Generative Models
Abstract page for arXiv paper 2603.25052: Closing the Confidence-Faithfulness Gap in Large Language Models
Abstract page for arXiv paper 2603.24752: Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical...
Abstract page for arXiv paper 2603.25006: Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime