Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]

Hi all, Stjepan from Manning here. The mods said it's fine if I post this here. I wanted to share a new MEAP (early access) release we th...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED

Today on “Uncanny Valley,” we’re diving into recent reports that the Trump administration is considering an executive order that would es...

Wired - AI · 29 min · about 1 hour ago

All Content

Llms

[2604.06185] Benchmarking LLM Tool-Use in the Wild

Abstract page for arXiv paper 2604.06185: Benchmarking LLM Tool-Use in the Wild

arXiv - AI · 3 min · 29 days ago

Machine Learning

[2604.06176] Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model

Abstract page for arXiv paper 2604.06176: Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qw...

arXiv - AI · 3 min · 29 days ago

Llms

[2604.06172] EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

Abstract page for arXiv paper 2604.06172: EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

arXiv - AI · 3 min · 29 days ago

Machine Learning

[2604.06173] Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA

Abstract page for arXiv paper 2604.06173: Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA

arXiv - AI · 3 min · 29 days ago

Llms

[2511.10354] Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates

Abstract page for arXiv paper 2511.10354: Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engine...

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2405.03420] Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation

Abstract page for arXiv paper 2405.03420: Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segment...

arXiv - AI · 3 min · 29 days ago

Llms

[2604.07236] How Much LLM Does a Self-Revising Agent Actually Need?

Abstract page for arXiv paper 2604.07236: How Much LLM Does a Self-Revising Agent Actually Need?

arXiv - AI · 4 min · 29 days ago

Llms

[2604.07003] EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration

Abstract page for arXiv paper 2604.07003: EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayes...

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2604.06695] Reasoning Fails Where Step Flow Breaks

Abstract page for arXiv paper 2604.06695: Reasoning Fails Where Step Flow Breaks

arXiv - AI · 3 min · 29 days ago

Llms

[2604.06820] Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

Abstract page for arXiv paper 2604.06820: Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

arXiv - AI · 3 min · 29 days ago

Llms

[2604.06747] TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

Abstract page for arXiv paper 2604.06747: TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2604.06779] FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

Abstract page for arXiv paper 2604.06779: FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

arXiv - AI · 3 min · 29 days ago

Machine Learning

[2604.06691] KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2604.06691: KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

arXiv - AI · 4 min · 29 days ago

Llms

[2604.06628] Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Abstract page for arXiv paper 2604.06628: Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and M...

arXiv - AI · 3 min · 29 days ago

Llms

[2604.06562] On Emotion-Sensitive Decision Making of Small Language Model Agents

Abstract page for arXiv paper 2604.06562: On Emotion-Sensitive Decision Making of Small Language Model Agents

arXiv - AI · 3 min · 29 days ago

Llms

[2604.06389] SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

Abstract page for arXiv paper 2604.06389: SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

arXiv - AI · 4 min · 29 days ago

Llms

[2604.06233] Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

Abstract page for arXiv paper 2604.06233: Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2603.17812] ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

Abstract page for arXiv paper 2603.17812: ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

arXiv - AI · 3 min · 29 days ago

Machine Learning

[2603.14135] Conditional flow matching for physics-constrained inverse problems with finite training data

Abstract page for arXiv paper 2603.14135: Conditional flow matching for physics-constrained inverse problems with finite training data

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2603.13354] AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

Abstract page for arXiv paper 2603.13354: AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

arXiv - Machine Learning · 4 min · 29 days ago

Previous Page 354 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED

All Content

[2604.06185] Benchmarking LLM Tool-Use in the Wild

[2604.06176] Robustness Risk of Conversational Retrieval: Identifying and Mitigating Noise Sensitivity in Qwen3-Embedding Model

[2604.06172] EviSnap: Faithful Evidence-Cited Explanations for Cold-Start Cross-Domain Recommendation

[2604.06173] Beyond Case Law: Evaluating Structure-Aware Retrieval and Safety in Statute-Centric Legal QA

[2511.10354] Knowledge Graphs Generation from Cultural Heritage Texts: Combining LLMs and Ontological Engineering for Scholarly Debates

[2405.03420] Implantable Adaptive Cells: A Novel Enhancement for Pre-Trained U-Nets in Medical Image Segmentation

[2604.07236] How Much LLM Does a Self-Revising Agent Actually Need?

[2604.07003] EmoMAS: Emotion-Aware Multi-Agent System for High-Stakes Edge-Deployable Negotiation with Bayesian Orchestration

[2604.06695] Reasoning Fails Where Step Flow Breaks

[2604.06820] Beyond Surface Judgments: Human-Grounded Risk Evaluation of LLM-Generated Disinformation

[2604.06747] TurboAgent: An LLM-Driven Autonomous Multi-Agent Framework for Turbomachinery Aerodynamic Design

[2604.06779] FVD: Inference-Time Alignment of Diffusion Models via Fleming-Viot Resampling

[2604.06691] KD-MARL: Resource-Aware Knowledge Distillation in Multi-Agent Reinforcement Learning

[2604.06628] Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

[2604.06562] On Emotion-Sensitive Decision Making of Small Language Model Agents

[2604.06389] SELFDOUBT: Uncertainty Quantification for Reasoning LLMs via the Hedge-to-Verify Ratio

[2604.06233] Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules

[2603.17812] ChopGrad: Pixel-Wise Losses for Latent Video Diffusion via Truncated Backpropagation

[2603.14135] Conditional flow matching for physics-constrained inverse problems with finite training data

[2603.13354] AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

Related Topics

Stay updated with AI News