Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]

Hi all, Stjepan from Manning here. The mods said it's fine if I post this here. I wanted to share a new MEAP (early access) release we th...

Reddit - Machine Learning · 1 min ·
Llms

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...

Reddit - Artificial Intelligence · 1 min ·
Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED
Machine Learning

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED

Today on “Uncanny Valley,” we’re diving into recent reports that the Trump administration is considering an executive order that would es...

Wired - AI · 29 min ·

All Content

[2604.06185] Benchmarking LLM Tool-Use in the Wild
Llms

[2604.06185] Benchmarking LLM Tool-Use in the Wild

Abstract page for arXiv paper 2604.06185: Benchmarking LLM Tool-Use in the Wild

arXiv - AI · 3 min ·
[2604.06176] Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model
Machine Learning

[2604.06176] Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model

Abstract page for arXiv paper 2604.06176: Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qw...

arXiv - AI · 3 min ·
[2604.06172] EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation
Llms

[2604.06172] EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

Abstract page for arXiv paper 2604.06172: EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

arXiv - AI · 3 min ·
[2604.06173] Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA
Machine Learning

[2604.06173] Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA

Abstract page for arXiv paper 2604.06173: Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA

arXiv - AI · 3 min ·
[2511.10354] Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates
Llms

[2511.10354] Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates

Abstract page for arXiv paper 2511.10354: Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engine...

arXiv - AI · 4 min ·
[2405.03420] Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation
Machine Learning

[2405.03420] Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation

Abstract page for arXiv paper 2405.03420: Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segment...

arXiv - AI · 3 min ·
[2604.07236] How Much LLM Does a Self-Revising Agent Actually Need?
Llms

[2604.07236] How Much LLM Does a Self-Revising Agent Actually Need?

Abstract page for arXiv paper 2604.07236: How Much LLM Does a Self-Revising Agent Actually Need?

arXiv - AI · 4 min ·
[2604.07003] EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration
Llms

[2604.07003] EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration

Abstract page for arXiv paper 2604.07003: EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayes...

arXiv - AI · 4 min ·
[2604.06695] Reasoning Fails Where Step Flow Breaks
Machine Learning

[2604.06695] Reasoning Fails Where Step Flow Breaks

Abstract page for arXiv paper 2604.06695: Reasoning Fails Where Step Flow Breaks

arXiv - AI · 3 min ·
[2604.06820] Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation
Llms

[2604.06820] Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

Abstract page for arXiv paper 2604.06820: Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

arXiv - AI · 3 min ·
[2604.06747] TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design
Llms

[2604.06747] TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

Abstract page for arXiv paper 2604.06747: TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

arXiv - AI · 4 min ·
[2604.06779] FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling
Machine Learning

[2604.06779] FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

Abstract page for arXiv paper 2604.06779: FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

arXiv - AI · 3 min ·
[2604.06691] KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning
Machine Learning

[2604.06691] KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2604.06691: KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

arXiv - AI · 4 min ·
[2604.06628] Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability
Llms

[2604.06628] Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Abstract page for arXiv paper 2604.06628: Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and M...

arXiv - AI · 3 min ·
[2604.06562] On Emotion-Sensitive Decision Making of Small Language Model Agents
Llms

[2604.06562] On Emotion-Sensitive Decision Making of Small Language Model Agents

Abstract page for arXiv paper 2604.06562: On Emotion-Sensitive Decision Making of Small Language Model Agents

arXiv - AI · 3 min ·
[2604.06389] SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio
Llms

[2604.06389] SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

Abstract page for arXiv paper 2604.06389: SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

arXiv - AI · 4 min ·
[2604.06233] Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules
Llms

[2604.06233] Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

Abstract page for arXiv paper 2604.06233: Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

arXiv - AI · 4 min ·
[2603.17812] ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation
Machine Learning

[2603.17812] ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

Abstract page for arXiv paper 2603.17812: ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

arXiv - AI · 3 min ·
[2603.14135] Conditional flow matching for physics-constrained inverse problems with finite training data
Machine Learning

[2603.14135] Conditional flow matching for physics-constrained inverse problems with finite training data

Abstract page for arXiv paper 2603.14135: Conditional flow matching for physics-constrained inverse problems with finite training data

arXiv - Machine Learning · 4 min ·
[2603.13354] AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification
Llms

[2603.13354] AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

Abstract page for arXiv paper 2603.13354: AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

arXiv - Machine Learning · 4 min ·
Previous Page 354 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime