Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min · 1 minute ago

Llms

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

This article discusses the resolution of an AI mystery regarding ChatGPT's unusual focus on gremlins and goblins, along with insights gai...

AI Tools & Products · 1 min · 34 minutes ago

Llms

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

Abstract page for arXiv paper 2602.06869: Uncovering Cross-Objective Interference in Multi-Objective Alignment

arXiv - Machine Learning · 3 min · about 1 hour ago

All Content

Machine Learning

[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning

Abstract page for arXiv paper 2311.14756: Task-Distributionally Robust Data-Free Meta-Learning

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

Abstract page for arXiv paper 2604.07956: MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.07725] Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Abstract page for arXiv paper 2604.07725: Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2604.02183] TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning

Abstract page for arXiv paper 2604.02183: TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning

arXiv - AI · 4 min · 24 days ago

Llms

[2603.11178] PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence

Abstract page for arXiv paper 2603.11178: PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence

arXiv - AI · 4 min · 24 days ago

Llms

[2602.02188] Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization

Abstract page for arXiv paper 2602.02188: Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combi...

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2601.23045] The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

Abstract page for arXiv paper 2601.23045: The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2601.07663] Reasoning Models Will Sometimes Lie About Their Reasoning

Abstract page for arXiv paper 2601.07663: Reasoning Models Will Sometimes Lie About Their Reasoning

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2601.02850] Sample-Efficient Neurosymbolic Deep Reinforcement Learning

Abstract page for arXiv paper 2601.02850: Sample-Efficient Neurosymbolic Deep Reinforcement Learning

arXiv - AI · 3 min · 24 days ago

Llms

[2509.25835] Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search

Abstract page for arXiv paper 2509.25835: Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search

arXiv - AI · 3 min · 24 days ago

Llms

[2510.07517] When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

Abstract page for arXiv paper 2510.07517: When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2509.24250] Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations

Abstract page for arXiv paper 2509.24250: Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demo...

arXiv - AI · 4 min · 24 days ago

Llms

[2508.08992] Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty

Abstract page for arXiv paper 2508.08992: Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistem...

arXiv - AI · 4 min · 24 days ago

Llms

[2507.04736] ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning

Abstract page for arXiv paper 2507.04736: ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning

arXiv - AI · 4 min · 24 days ago

Llms

[2506.17788] Bayesian Social Deduction with Graph-Informed Language Models

Abstract page for arXiv paper 2506.17788: Bayesian Social Deduction with Graph-Informed Language Models

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2604.09537] Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

Abstract page for arXiv paper 2604.09537: Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

arXiv - AI · 4 min · 24 days ago

Llms

[2604.09544] Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

Abstract page for arXiv paper 2604.09544: Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

arXiv - AI · 4 min · 24 days ago

Llms

[2604.09532] Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

Abstract page for arXiv paper 2604.09532: Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

arXiv - AI · 4 min · 24 days ago

Llms

[2604.09531] VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

Abstract page for arXiv paper 2604.09531: VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

arXiv - AI · 4 min · 24 days ago

Llms

[2604.09529] VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

Abstract page for arXiv paper 2604.09529: VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

arXiv - AI · 4 min · 24 days ago

Previous Page 335 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

All Content

[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning

[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

[2604.07725] Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

[2604.02183] TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning

[2603.11178] PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence

[2602.02188] Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization

[2601.23045] The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

[2601.07663] Reasoning Models Will Sometimes Lie About Their Reasoning

[2601.02850] Sample-Efficient Neurosymbolic Deep Reinforcement Learning

[2509.25835] Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search

[2510.07517] When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

[2509.24250] Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations

[2508.08992] Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty

[2507.04736] ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning

[2506.17788] Bayesian Social Deduction with Graph-Informed Language Models

[2604.09537] Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

[2604.09544] Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

[2604.09532] Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

[2604.09531] VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

[2604.09529] VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

Related Topics

Stay updated with AI News