Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Choosing between a specialized Data Science/ML master’s [D]

Hi everyone, I’m planning to start my master’s this October and I’m deciding between two options. Option A: A Master’s in Data Science an...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

I built AI agents that play Pokemon Showdown autonomously using free LLM APIs via tool-calling [P]

I've built a system where models like Llama 3, Qwen, and Gemma play Pokémon Showdown battles autonomously. Instead of simple prompt-respo...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

Hey r/MachineLearning, The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Trito...

Reddit - Machine Learning · 1 min · about 2 hours ago

All Content

Llms

[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

Abstract page for arXiv paper 2604.03980: Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

Abstract page for arXiv paper 2604.03968: TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03956] VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

Abstract page for arXiv paper 2604.03956: VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03925] AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

Abstract page for arXiv paper 2604.03925: AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03904] I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

Abstract page for arXiv paper 2604.03904: I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03881] Enhancing behavioral nudges with large language model-based iterative personalization: A field experiment on electricity and hot-water conservation

Abstract page for arXiv paper 2604.03881: Enhancing behavioral nudges with large language model-based iterative personalization: A field ...

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.03814] InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset

Abstract page for arXiv paper 2604.03814: InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03774] When Does Multimodal AI Help? Diagnostic Complementarity of Vision-Language Models and CNNs for Spectrum Management in Satellite-Terrestrial Networks

Abstract page for arXiv paper 2604.03774: When Does Multimodal AI Help? Diagnostic Complementarity of Vision-Language Models and CNNs for...

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03755] Can Humans Tell? A Dual-Axis Study of Human Perception of LLM-Generated News

Abstract page for arXiv paper 2604.03755: Can Humans Tell? A Dual-Axis Study of Human Perception of LLM-Generated News

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03758] AutoReSpec: A Framework for Generating Specification using Large Language Models

Abstract page for arXiv paper 2604.03758: AutoReSpec: A Framework for Generating Specification using Large Language Models

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03754] Testing the Limits of Truth Directions in LLMs

Abstract page for arXiv paper 2604.03754: Testing the Limits of Truth Directions in LLMs

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03750] CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

Abstract page for arXiv paper 2604.03750: CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03677] Unlocking Prompt Infilling Capability for Diffusion Language Models

Abstract page for arXiv paper 2604.03677: Unlocking Prompt Infilling Capability for Diffusion Language Models

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03688] Fusion and Alignment Enhancement with Large Language Models for Tail-item Sequential Recommendation

Abstract page for arXiv paper 2604.03688: Fusion and Alignment Enhancement with Large Language Models for Tail-item Sequential Recommenda...

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.03672] AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

Abstract page for arXiv paper 2604.03672: AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals i...

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.03649] ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware Relations

Abstract page for arXiv paper 2604.03649: ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware R...

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03647] Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

Abstract page for arXiv paper 2604.03647: Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03635] A Generative Foundation Model for Multimodal Histopathology

Abstract page for arXiv paper 2604.03635: A Generative Foundation Model for Multimodal Histopathology

arXiv - AI · 4 min · 24 days ago

Llms

[2604.03632] Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

Abstract page for arXiv paper 2604.03632: Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

arXiv - AI · 3 min · 24 days ago

Llms

[2604.03622] Toward Executable Repository-Level Code Generation via Environment Alignment

Abstract page for arXiv paper 2604.03622: Toward Executable Repository-Level Code Generation via Environment Alignment

arXiv - AI · 4 min · 24 days ago

Previous Page 297 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Choosing between a specialized Data Science/ML master’s [D]

I built AI agents that play Pokemon Showdown autonomously using free LLM APIs via tool-calling [P]

A Hackable ML Compiler Stack in 5,000 Lines of Python [P]

All Content

[2604.03980] Gram-Anchored Prompt Learning for Vision-Language Models via Second-Order Statistics

[2604.03968] TraceGuard: Structured Multi-Dimensional Monitoring as a Collusion-Resistant Control Protocol

[2604.03956] VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

[2604.03925] AdaptFuse: Training-Free Sequential Preference Learning via Externalized Bayesian Inference

[2604.03904] I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation

[2604.03881] Enhancing behavioral nudges with large language model-based iterative personalization: A field experiment on electricity and hot-water conservation

[2604.03814] InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset

[2604.03774] When Does Multimodal AI Help? Diagnostic Complementarity of Vision-Language Models and CNNs for Spectrum Management in Satellite-Terrestrial Networks

[2604.03755] Can Humans Tell? A Dual-Axis Study of Human Perception of LLM-Generated News

[2604.03758] AutoReSpec: A Framework for Generating Specification using Large Language Models

[2604.03754] Testing the Limits of Truth Directions in LLMs

[2604.03750] CREBench: Evaluating Large Language Models in Cryptographic Binary Reverse Engineering

[2604.03677] Unlocking Prompt Infilling Capability for Diffusion Language Models

[2604.03688] Fusion and Alignment Enhancement with Large Language Models for Tail-item Sequential Recommendation

[2604.03672] AI Appeals Processor: A Deep Learning Approach to Automated Classification of Citizen Appeals in Government Services

[2604.03649] ART: Adaptive Relational Transformer for Pedestrian Trajectory Prediction with Temporal-Aware Relations

[2604.03647] Stabilizing Unsupervised Self-Evolution of MLLMs via Continuous Softened Retracing reSampling

[2604.03635] A Generative Foundation Model for Multimodal Histopathology

[2604.03632] Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

[2604.03622] Toward Executable Repository-Level Code Generation via Environment Alignment

Related Topics

Stay updated with AI News