Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

What to expect from AlphaZero's value predictions [D]

An AlphaZero agent has learnt to predict the value of a game state by training on data generated by self-play by the model and a series o...

Reddit - Machine Learning · 1 min ·
Machine Learning

Open Source Projects related to CNNs to Contribute To? [D]

Around a decade a go I was tinkering a lot with CNNs for real time event detection. I enjoyed that a lot and always wanted to get back in...

Reddit - Machine Learning · 1 min ·
I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED
Machine Learning

I Work in Hollywood. Everyone Who Used to Make TV Is Now Secretly Training AI | WIRED

For screenwriters like me—and job seekers all over—AI gig work is the new waiting tables. In eight months, I’ve done 20 of these soul-cru...

Wired - AI · 27 min ·

All Content

[2511.22893] Switching-time bioprocess control with pulse-width-modulated optogenetics
Machine Learning

[2511.22893] Switching-time bioprocess control with pulse-width-modulated optogenetics

Abstract page for arXiv paper 2511.22893: Switching-time bioprocess control with pulse-width-modulated optogenetics

arXiv - AI · 4 min ·
[2511.15204] Physics-Based Benchmarking Metrics for Multimodal Synthetic Images
Llms

[2511.15204] Physics-Based Benchmarking Metrics for Multimodal Synthetic Images

Abstract page for arXiv paper 2511.15204: Physics-Based Benchmarking Metrics for Multimodal Synthetic Images

arXiv - AI · 3 min ·
[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
Llms

[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Abstract page for arXiv paper 2511.02805: MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Lea...

arXiv - AI · 3 min ·
[2510.16079] EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
Llms

[2510.16079] EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle

Abstract page for arXiv paper 2510.16079: EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle

arXiv - AI · 4 min ·
[2506.21582] VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents
Llms

[2506.21582] VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Abstract page for arXiv paper 2506.21582: VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with I...

arXiv - AI · 4 min ·
[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies
Llms

[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Abstract page for arXiv paper 2510.22944: Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

arXiv - AI · 4 min ·
[2510.04850] Detecting Distillation Data from Reasoning Models
Llms

[2510.04850] Detecting Distillation Data from Reasoning Models

Abstract page for arXiv paper 2510.04850: Detecting Distillation Data from Reasoning Models

arXiv - AI · 4 min ·
[2510.01685] How Do Language Models Compose Functions?
Llms

[2510.01685] How Do Language Models Compose Functions?

Abstract page for arXiv paper 2510.01685: How Do Language Models Compose Functions?

arXiv - AI · 3 min ·
[2506.14399] Factored Classifier-Free Guidance
Machine Learning

[2506.14399] Factored Classifier-Free Guidance

Abstract page for arXiv paper 2506.14399: Factored Classifier-Free Guidance

arXiv - AI · 3 min ·
[2504.11837] FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations
Llms

[2504.11837] FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations

Abstract page for arXiv paper 2504.11837: FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations

arXiv - AI · 3 min ·
[2502.01941] Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression
Llms

[2502.01941] Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Compression

Abstract page for arXiv paper 2502.01941: Semantic Integrity Matters: Benchmarking and Preserving High-Density Reasoning in KV Cache Comp...

arXiv - AI · 4 min ·
[2412.11194] Direction for Detection: A Survey of Automated Vulnerability Detection and all of its Pain Points
Machine Learning

[2412.11194] Direction for Detection: A Survey of Automated Vulnerability Detection and all of its Pain Points

Abstract page for arXiv paper 2412.11194: Direction for Detection: A Survey of Automated Vulnerability Detection and all of its Pain Points

arXiv - AI · 4 min ·
[2410.06347] Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning
Machine Learning

[2410.06347] Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

Abstract page for arXiv paper 2410.06347: Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

arXiv - AI · 3 min ·
[2407.04183] Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms
Llms

[2407.04183] Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

Abstract page for arXiv paper 2407.04183: Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

arXiv - AI · 4 min ·
[2603.09652] MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants
Llms

[2603.09652] MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Abstract page for arXiv paper 2603.09652: MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assis...

arXiv - AI · 4 min ·
[2602.00924] Supervised sparse auto-encoders for interpretable and compositional representations
Machine Learning

[2602.00924] Supervised sparse auto-encoders for interpretable and compositional representations

Abstract page for arXiv paper 2602.00924: Supervised sparse auto-encoders for interpretable and compositional representations

arXiv - AI · 3 min ·
[2601.23143] THINKSAFE: Self-Generated Safety Alignment for Reasoning Models
Machine Learning

[2601.23143] THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

Abstract page for arXiv paper 2601.23143: THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

arXiv - AI · 3 min ·
[2601.04731] Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models
Machine Learning

[2601.04731] Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models

Abstract page for arXiv paper 2601.04731: Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models

arXiv - AI · 4 min ·
[2512.05439] BEAVER: An Efficient Deterministic LLM Verifier
Llms

[2512.05439] BEAVER: An Efficient Deterministic LLM Verifier

Abstract page for arXiv paper 2512.05439: BEAVER: An Efficient Deterministic LLM Verifier

arXiv - AI · 3 min ·
[2511.09907] Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis
Machine Learning

[2511.09907] Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

Abstract page for arXiv paper 2511.09907: Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

arXiv - AI · 4 min ·
Previous Page 2 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime