Data Science

Data analysis, statistics, and data engineering

Top This Week

Machine Learning

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

I’ve been reading more about attention mechanisms in transformers and how they effectively learn to weight and prioritize relevant inputs...

Reddit - Machine Learning · 1 min ·
Machine Learning

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Machine Learning · 1 min ·

All Content

[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction
Machine Learning

[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction

AeroDGS presents a novel framework for 4D reconstruction from monocular UAV videos, addressing challenges in depth ambiguity and motion e...

arXiv - AI · 4 min ·
[2602.22278] RETLLM: Training and Data-Free MLLMs for Multimodal Information Retrieval
Llms

[2602.22278] RETLLM: Training and Data-Free MLLMs for Multimodal Information Retrieval

The paper presents RETLLM, a novel framework for multimodal information retrieval (MMIR) that operates without the need for training or l...

arXiv - Machine Learning · 4 min ·
[2602.22275] Deep Accurate Solver for the Geodesic Problem
Nlp

[2602.22275] Deep Accurate Solver for the Geodesic Problem

This article presents a novel deep learning approach for accurately solving the geodesic problem on continuous surfaces, achieving third-...

arXiv - Machine Learning · 4 min ·
[2602.22239] VAE-MS: An Asymmetric Variational Autoencoder for Mutational Signature Extraction
Machine Learning

[2602.22239] VAE-MS: An Asymmetric Variational Autoencoder for Mutational Signature Extraction

The paper introduces VAE-MS, an Asymmetric Variational Autoencoder designed to enhance mutational signature extraction in cancer research...

arXiv - Machine Learning · 4 min ·
[2602.22236] CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction
Llms

[2602.22236] CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction

The article presents CrossLLM-Mamba, a novel framework for RNA interaction prediction that utilizes multimodal state space fusion of larg...

arXiv - Machine Learning · 4 min ·
[2602.22299] Decoding the Hook: A Multimodal LLM Framework for Analyzing the Hooking Period of Video Ads
Llms

[2602.22299] Decoding the Hook: A Multimodal LLM Framework for Analyzing the Hooking Period of Video Ads

This article presents a framework using multimodal large language models (MLLMs) to analyze the 'hooking period' of video ads, focusing o...

arXiv - Machine Learning · 4 min ·
[2602.22223] SQaLe: A Large Text-to-SQL Corpus Grounded in Real Schemas
Llms

[2602.22223] SQaLe: A Large Text-to-SQL Corpus Grounded in Real Schemas

The paper introduces SQaLe, a large-scale text-to-SQL dataset designed to enhance the development of models that convert natural language...

arXiv - Machine Learning · 3 min ·
[2602.22279] Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging
Machine Learning

[2602.22279] Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

This paper presents a novel approach to reconstruct audio and images from clipped measurements using self-supervised learning, addressing...

arXiv - AI · 3 min ·
[2602.23360] Model Agreement via Anchoring
Machine Learning

[2602.23360] Model Agreement via Anchoring

The paper presents a method for reducing model disagreement in machine learning by using an anchoring technique, demonstrating its effect...

arXiv - AI · 4 min ·
[2602.23358] A Dataset is Worth 1 MB
Machine Learning

[2602.23358] A Dataset is Worth 1 MB

The paper presents PLADA, a novel method for efficient dataset transmission in machine learning, significantly reducing payload size whil...

arXiv - Machine Learning · 4 min ·
[2602.23349] FlashOptim: Optimizers for Memory Efficient Training
Machine Learning

[2602.23349] FlashOptim: Optimizers for Memory Efficient Training

FlashOptim introduces innovative optimizers that significantly reduce memory usage in neural network training, enhancing efficiency witho...

arXiv - AI · 4 min ·
[2602.22263] CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints
Machine Learning

[2602.22263] CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints

CryoNet.Refine introduces a one-step diffusion model for efficiently refining structural models using cryo-EM density maps, offering a si...

arXiv - AI · 4 min ·
[2602.22258] Poisoned Acoustics
Machine Learning

[2602.22258] Poisoned Acoustics

The paper 'Poisoned Acoustics' explores training-data poisoning attacks on deep neural networks, demonstrating significant vulnerabilitie...

arXiv - AI · 3 min ·
[2602.23341] Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms
Machine Learning

[2602.23341] Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

This article presents efficient algorithms for estimating the mean from coarse data, addressing key questions in Gaussian mean estimation...

arXiv - Machine Learning · 4 min ·
[2602.23336] Differentiable Zero-One Loss via Hypersimplex Projections
Machine Learning

[2602.23336] Differentiable Zero-One Loss via Hypersimplex Projections

This paper presents a novel differentiable approximation to the zero-one loss, enhancing gradient-based optimization in machine learning ...

arXiv - Machine Learning · 3 min ·
[2602.23305] A Proper Scoring Rule for Virtual Staining
Machine Learning

[2602.23305] A Proper Scoring Rule for Virtual Staining

The paper introduces a novel scoring rule for evaluating generative virtual staining models in high-throughput screening, emphasizing the...

arXiv - Machine Learning · 3 min ·
[2602.23303] Inferential Mechanics Part 1: Causal Mechanistic Theories of Machine Learning in Chemical Biology with Implications
Machine Learning

[2602.23303] Inferential Mechanics Part 1: Causal Mechanistic Theories of Machine Learning in Chemical Biology with Implications

This article introduces 'Inferential Mechanics,' a framework combining causal theories with machine learning in chemical biology, address...

arXiv - Machine Learning · 4 min ·
[2602.22247] Multi-Dimensional Spectral Geometry of Biological Knowledge in Single-Cell Transformer Representations
Llms

[2602.22247] Multi-Dimensional Spectral Geometry of Biological Knowledge in Single-Cell Transformer Representations

This article explores how single-cell foundation models like scGPT encode biological knowledge through high-dimensional gene representati...

arXiv - Machine Learning · 4 min ·
[2602.23296] Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity
Machine Learning

[2602.23296] Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity

This article presents FedWQ-CP, a novel approach to federated uncertainty quantification that addresses dual heterogeneity in data and mo...

arXiv - Machine Learning · 4 min ·
[2602.23280] Physics Informed Viscous Value Representations
Nlp

[2602.23280] Physics Informed Viscous Value Representations

This paper presents a novel approach to offline goal-conditioned reinforcement learning by introducing a physics-informed regularization ...

arXiv - Machine Learning · 3 min ·
Previous Page 33 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime