Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]

Most embedding models are not Matryoshka-trained, so naive dimension truncation tends to destroy them. I tested a simple alternative: fit...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Looking for Feedback & Improvement Ideas[P]

Hey everyone, I recently built a machine learning project and would really appreciate some honest feedback from this community. LINK- htt...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Why Anthropic’s new model has cybersecurity experts rattled

submitted by /u/ThereWas [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Llms

[2603.25015] Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

Abstract page for arXiv paper 2603.25015: Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

arXiv - AI · 3 min · 13 days ago

Machine Learning

[2603.25126] MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

Abstract page for arXiv paper 2603.25126: MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

arXiv - AI · 4 min · 13 days ago

Machine Learning

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24946] MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development

Abstract page for arXiv paper 2603.24946: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application...

arXiv - Machine Learning · 4 min · 13 days ago

Machine Learning

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24917] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

Abstract page for arXiv paper 2603.24917: Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.25099] Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

Abstract page for arXiv paper 2603.25099: Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Opti...

arXiv - AI · 4 min · 13 days ago

Machine Learning

[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

Abstract page for arXiv paper 2603.25083: Learning domain-invariant features through channel-level sparsification for Out-Of Distribution...

arXiv - AI · 4 min · 13 days ago

Llms

[2603.25063] TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

Abstract page for arXiv paper 2603.25063: TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visual...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.25056] The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

Abstract page for arXiv paper 2603.25056: The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Create...

arXiv - AI · 4 min · 13 days ago

Machine Learning

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

Abstract page for arXiv paper 2603.24764: Synthetic Cardiac MRI Image Generation using Deep Generative Models

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.25052] Closing the Confidence-Faithfulness Gap in Large Language Models

Abstract page for arXiv paper 2603.25052: Closing the Confidence-Faithfulness Gap in Large Language Models

arXiv - AI · 3 min · 13 days ago

Machine Learning

[2603.24752] Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical-to-Quantum Transfer Learning

Abstract page for arXiv paper 2603.24752: Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical...

arXiv - Machine Learning · 4 min · 13 days ago

Machine Learning

[2603.25006] Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

Abstract page for arXiv paper 2603.25006: Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24989] Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model

Abstract page for arXiv paper 2603.24989: Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24986] Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

Abstract page for arXiv paper 2603.24986: Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

arXiv - AI · 3 min · 13 days ago

Machine Learning

[2603.24705] Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks

Abstract page for arXiv paper 2603.24705: Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks

arXiv - Machine Learning · 4 min · 13 days ago

Machine Learning

[2603.24704] Conformal Selective Prediction with General Risk Control

Abstract page for arXiv paper 2603.24704: Conformal Selective Prediction with General Risk Control

arXiv - Machine Learning · 4 min · 13 days ago

Machine Learning

[2603.24654] Spectral methods: crucial for machine learning, natural for quantum computers?

Abstract page for arXiv paper 2603.24654: Spectral methods: crucial for machine learning, natural for quantum computers?

arXiv - Machine Learning · 4 min · 13 days ago

Previous Page 163 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]

Looking for Feedback & Improvement Ideas[P]

Why Anthropic’s new model has cybersecurity experts rattled

All Content

[2603.25015] Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models

[2603.25126] MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation

[2603.25024] Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

[2603.25112] Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

[2603.24946] MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application Development

[2603.25109] MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness

[2603.24917] Estimating near-verbatim extraction risk in language models with decoding-constrained beam search

[2603.25099] Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization

[2603.25083] Learning domain-invariant features through channel-level sparsification for Out-Of Distribution Generalization

[2603.25063] TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visualization

[2603.25056] The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities

[2603.24764] Synthetic Cardiac MRI Image Generation using Deep Generative Models

[2603.25052] Closing the Confidence-Faithfulness Gap in Large Language Models

[2603.24752] Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical-to-Quantum Transfer Learning

[2603.25006] Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning

[2603.24989] Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model

[2603.24986] Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators

[2603.24705] Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks

[2603.24704] Conformal Selective Prediction with General Risk Control

[2603.24654] Spectral methods: crucial for machine learning, natural for quantum computers?

Related Topics

Stay updated with AI News