[P] PCA before truncation makes non-Matryoshka embeddings compressible: results on BGE-M3 [P]
Most embedding models are not Matryoshka-trained, so naive dimension truncation tends to destroy them. I tested a simple alternative: fit...
ML algorithms, training, and inference
Most embedding models are not Matryoshka-trained, so naive dimension truncation tends to destroy them. I tested a simple alternative: fit...
Hey everyone, I recently built a machine learning project and would really appreciate some honest feedback from this community. LINK- htt...
submitted by /u/ThereWas [link] [comments]
Abstract page for arXiv paper 2603.25015: Imperative Interference: Social Register Shapes Instruction Topology in Large Language Models
Abstract page for arXiv paper 2603.25126: MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation
Abstract page for arXiv paper 2603.25024: Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method
Abstract page for arXiv paper 2603.25112: Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
Abstract page for arXiv paper 2603.24946: MobileDev-Bench: A Comprehensive Benchmark for Evaluating Language Models on Mobile Application...
Abstract page for arXiv paper 2603.25109: MoireMix: A Formula-Based Data Augmentation for Improving Image Classification Robustness
Abstract page for arXiv paper 2603.24917: Estimating near-verbatim extraction risk in language models with decoding-constrained beam search
Abstract page for arXiv paper 2603.25099: Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Opti...
Abstract page for arXiv paper 2603.25083: Learning domain-invariant features through channel-level sparsification for Out-Of Distribution...
Abstract page for arXiv paper 2603.25063: TopoPilot: Reliable Conversational Workflow Automation for Topological Data Analysis and Visual...
Abstract page for arXiv paper 2603.25056: The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Create...
Abstract page for arXiv paper 2603.24764: Synthetic Cardiac MRI Image Generation using Deep Generative Models
Abstract page for arXiv paper 2603.25052: Closing the Confidence-Faithfulness Gap in Large Language Models
Abstract page for arXiv paper 2603.24752: Autotuning T-PaiNN: Enabling Data-Efficient GNN Interatomic Potential Development via Classical...
Abstract page for arXiv paper 2603.25006: Improving Fine-Grained Rice Leaf Disease Detection via Angular-Compactness Dual Loss Learning
Abstract page for arXiv paper 2603.24989: Learning Rollout from Sampling:An R1-Style Tokenized Traffic Simulation Model
Abstract page for arXiv paper 2603.24986: Rethinking Health Agents: From Siloed AI to Collaborative Decision Mediators
Abstract page for arXiv paper 2603.24705: Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks
Abstract page for arXiv paper 2603.24704: Conformal Selective Prediction with General Risk Control
Abstract page for arXiv paper 2603.24654: Spectral methods: crucial for machine learning, natural for quantum computers?
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime