Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

I am not an "anti" like this guy, but still an interesting video of person interacting with chat 4o

(Posting Here because removed by Chatgpt Complaints moderators because the model here is 4o, and refuse to believe there were any safety ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Unsolved AI Mystery Is Solved Along With Lessons Learned On Why ChatGPT Became Oddly Obsessed With Gremlins And Goblins

This article discusses the resolution of an AI mystery regarding ChatGPT's unusual focus on gremlins and goblins, along with insights gai...

AI Tools & Products · 1 min ·
[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment
Llms

[2602.06869] Uncovering Cross-Objective Interference in Multi-Objective Alignment

Abstract page for arXiv paper 2602.06869: Uncovering Cross-Objective Interference in Multi-Objective Alignment

arXiv - Machine Learning · 3 min ·

All Content

[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning
Machine Learning

[2311.14756] Task-Distributionally Robust Data-Free Meta-Learning

Abstract page for arXiv paper 2311.14756: Task-Distributionally Robust Data-Free Meta-Learning

arXiv - AI · 4 min ·
[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems
Machine Learning

[2604.07956] MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

Abstract page for arXiv paper 2604.07956: MONETA: Multimodal Industry Classification through Geographic Information with Multi Agent Systems

arXiv - AI · 3 min ·
[2604.07725] Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution
Machine Learning

[2604.07725] Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Abstract page for arXiv paper 2604.07725: Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

arXiv - AI · 4 min ·
[2604.02183] TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning
Machine Learning

[2604.02183] TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning

Abstract page for arXiv paper 2604.02183: TRU: Targeted Reverse Update for Efficient Multimodal Recommendation Unlearning

arXiv - AI · 4 min ·
[2603.11178] PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence
Llms

[2603.11178] PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence

Abstract page for arXiv paper 2603.11178: PACED: Distillation and On-Policy Self-Distillation at the Frontier of Student Competence

arXiv - AI · 4 min ·
[2602.02188] Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization
Llms

[2602.02188] Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combinatorial Optimization

Abstract page for arXiv paper 2602.02188: Reasoning in a Combinatorial and Constrained World: Benchmarking LLMs on Natural-Language Combi...

arXiv - AI · 4 min ·
[2601.23045] The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?
Machine Learning

[2601.23045] The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

Abstract page for arXiv paper 2601.23045: The Hot Mess of AI: How Does Misalignment Scale With Model Intelligence and Task Complexity?

arXiv - AI · 4 min ·
[2601.07663] Reasoning Models Will Sometimes Lie About Their Reasoning
Machine Learning

[2601.07663] Reasoning Models Will Sometimes Lie About Their Reasoning

Abstract page for arXiv paper 2601.07663: Reasoning Models Will Sometimes Lie About Their Reasoning

arXiv - AI · 3 min ·
[2601.02850] Sample-Efficient Neurosymbolic Deep Reinforcement Learning
Machine Learning

[2601.02850] Sample-Efficient Neurosymbolic Deep Reinforcement Learning

Abstract page for arXiv paper 2601.02850: Sample-Efficient Neurosymbolic Deep Reinforcement Learning

arXiv - AI · 3 min ·
[2509.25835] Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search
Llms

[2509.25835] Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search

Abstract page for arXiv paper 2509.25835: Chain-in-Tree: Back to Sequential Reasoning in LLM Tree Search

arXiv - AI · 3 min ·
[2510.07517] When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning
Llms

[2510.07517] When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

Abstract page for arXiv paper 2510.07517: When Identity Skews Debate: Anonymization for Bias-Reduced Multi-Agent Reasoning

arXiv - AI · 4 min ·
[2509.24250] Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations
Machine Learning

[2509.24250] Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations

Abstract page for arXiv paper 2509.24250: Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demo...

arXiv - AI · 4 min ·
[2508.08992] Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty
Llms

[2508.08992] Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistemic Uncertainty

Abstract page for arXiv paper 2508.08992: Rethinking Prospect Theory for LLMs: Revealing the Instability of Decision-Making under Epistem...

arXiv - AI · 4 min ·
[2507.04736] ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning
Llms

[2507.04736] ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning

Abstract page for arXiv paper 2507.04736: ChipSeek: Optimizing Verilog Generation via EDA-Integrated Reinforcement Learning

arXiv - AI · 4 min ·
[2506.17788] Bayesian Social Deduction with Graph-Informed Language Models
Llms

[2506.17788] Bayesian Social Deduction with Graph-Informed Language Models

Abstract page for arXiv paper 2506.17788: Bayesian Social Deduction with Graph-Informed Language Models

arXiv - AI · 3 min ·
[2604.09537] Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision
Machine Learning

[2604.09537] Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

Abstract page for arXiv paper 2604.09537: Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

arXiv - AI · 4 min ·
[2604.09544] Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism
Llms

[2604.09544] Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

Abstract page for arXiv paper 2604.09544: Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

arXiv - AI · 4 min ·
[2604.09532] Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise
Llms

[2604.09532] Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

Abstract page for arXiv paper 2604.09532: Seeing is Believing: Robust Vision-Guided Cross-Modal Prompt Learning under Label Noise

arXiv - AI · 4 min ·
[2604.09531] VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images
Llms

[2604.09531] VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

Abstract page for arXiv paper 2604.09531: VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images

arXiv - AI · 4 min ·
[2604.09529] VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning
Llms

[2604.09529] VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

Abstract page for arXiv paper 2604.09529: VL-Calibration: Decoupled Confidence Calibration for Large Vision-Language Models Reasoning

arXiv - AI · 4 min ·
Previous Page 335 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime