Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

[R] Joint Embedding Variational Bayes (TMLR ’26)

Disclosure: first author. The paper was just published in TMLR, and I figured it might be of interest to some people here. It is fairly d...

Reddit - Machine Learning · 1 min ·
Llms

suggestions regarding mlops [D]

hey I'm starting with mlops. currently watching vikash das's videos. is the playlist good or should i switch to another one? ps: I've a g...

Reddit - Machine Learning · 1 min ·
OpenAI talks about not talking about goblins | The Verge
Machine Learning

OpenAI talks about not talking about goblins | The Verge

References to goblins and gremlins spiked with the release of GPT-5.1’s ‘Nerdy’ personality, and then spread to other models.

The Verge - AI · 4 min ·

All Content

[2509.05892] Challenges in Deep Learning-Based Small Organ Segmentation: A Benchmarking Perspective for Medical Research with Limited Datasets
Llms

[2509.05892] Challenges in Deep Learning-Based Small Organ Segmentation: A Benchmarking Perspective for Medical Research with Limited Datasets

Abstract page for arXiv paper 2509.05892: Challenges in Deep Learning-Based Small Organ Segmentation: A Benchmarking Perspective for Medi...

arXiv - AI · 4 min ·
[2506.13130] ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Llms

[2506.13130] ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Abstract page for arXiv paper 2506.13130: ZINA: Multimodal Fine-grained Hallucination Detection and Editing

arXiv - AI · 3 min ·
[2506.09749] Large Language Models for Combinatorial Optimization of Design Structure Matrix
Llms

[2506.09749] Large Language Models for Combinatorial Optimization of Design Structure Matrix

Abstract page for arXiv paper 2506.09749: Large Language Models for Combinatorial Optimization of Design Structure Matrix

arXiv - AI · 4 min ·
[2505.15925] VERDI: VLM-Embedded Reasoning for Autonomous Driving
Llms

[2505.15925] VERDI: VLM-Embedded Reasoning for Autonomous Driving

Abstract page for arXiv paper 2505.15925: VERDI: VLM-Embedded Reasoning for Autonomous Driving

arXiv - AI · 4 min ·
[2503.12575] BalancedDPO: Adaptive Multi-Metric Alignment
Machine Learning

[2503.12575] BalancedDPO: Adaptive Multi-Metric Alignment

Abstract page for arXiv paper 2503.12575: BalancedDPO: Adaptive Multi-Metric Alignment

arXiv - AI · 4 min ·
[2503.11572] Implicit Bias-Like Patterns in Reasoning Models
Llms

[2503.11572] Implicit Bias-Like Patterns in Reasoning Models

Abstract page for arXiv paper 2503.11572: Implicit Bias-Like Patterns in Reasoning Models

arXiv - AI · 3 min ·
[2501.11782] Human-AI Collaborative Game Testing with Vision Language Models
Llms

[2501.11782] Human-AI Collaborative Game Testing with Vision Language Models

Abstract page for arXiv paper 2501.11782: Human-AI Collaborative Game Testing with Vision Language Models

arXiv - AI · 4 min ·
[2501.07813] Talk to Right Specialists: Iterative Routing in Multi-agent Systems for Question Answering
Machine Learning

[2501.07813] Talk to Right Specialists: Iterative Routing in Multi-agent Systems for Question Answering

Abstract page for arXiv paper 2501.07813: Talk to Right Specialists: Iterative Routing in Multi-agent Systems for Question Answering

arXiv - AI · 4 min ·
[2408.11871] MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
Llms

[2408.11871] MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

Abstract page for arXiv paper 2408.11871: MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

arXiv - AI · 3 min ·
[2406.14194] VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model
Llms

[2406.14194] VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

Abstract page for arXiv paper 2406.14194: VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

arXiv - AI · 4 min ·
[2604.01438] ClawSafety: "Safe" LLMs, Unsafe Agents
Llms

[2604.01438] ClawSafety: "Safe" LLMs, Unsafe Agents

Abstract page for arXiv paper 2604.01438: ClawSafety: "Safe" LLMs, Unsafe Agents

arXiv - AI · 4 min ·
[2603.18633] An Onto-Relational-Sophic Framework for Governing Synthetic Minds
Llms

[2603.18633] An Onto-Relational-Sophic Framework for Governing Synthetic Minds

Abstract page for arXiv paper 2603.18633: An Onto-Relational-Sophic Framework for Governing Synthetic Minds

arXiv - AI · 4 min ·
[2603.09127] Collective AI can amplify tiny perturbations into divergent decisions
Llms

[2603.09127] Collective AI can amplify tiny perturbations into divergent decisions

Abstract page for arXiv paper 2603.09127: Collective AI can amplify tiny perturbations into divergent decisions

arXiv - AI · 4 min ·
[2602.07943] IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery
Llms

[2602.07943] IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Abstract page for arXiv paper 2602.07943: IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

arXiv - AI · 3 min ·
[2602.03151] Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration
Llms

[2602.03151] Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional Feature Restoration

Abstract page for arXiv paper 2602.03151: Enhancing Foundation VLM Robustness to Missing Modality: Scalable Diffusion for Bi-directional ...

arXiv - AI · 4 min ·
[2601.22776] TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization
Llms

[2601.22776] TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization

Abstract page for arXiv paper 2601.22776: TSPO: Breaking the Double Homogenization Dilemma in Multi-turn Search Policy Optimization

arXiv - AI · 3 min ·
[2601.21439] The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making
Llms

[2601.21439] The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Decision-Making

Abstract page for arXiv paper 2601.21439: The Paradox of Robustness: Decoupling Rule-Based Logic from Affective Noise in High-Stakes Deci...

arXiv - AI · 4 min ·
[2511.16383] An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models
Llms

[2511.16383] An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models

Abstract page for arXiv paper 2511.16383: An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models

arXiv - AI · 3 min ·
[2601.05656] HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation
Llms

[2601.05656] HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

Abstract page for arXiv paper 2601.05656: HAG: Hierarchical Demographic Tree-based Agent Generation for Topic-Adaptive Simulation

arXiv - AI · 3 min ·
[2512.13168] Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows
Machine Learning

[2512.13168] Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Abstract page for arXiv paper 2512.13168: Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

arXiv - AI · 4 min ·
Previous Page 293 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime