Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

I’ve been reading more about attention mechanisms in transformers and how they effectively learn to weight and prioritize relevant inputs...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Machine Learning

[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction

AeroDGS presents a novel framework for 4D reconstruction from monocular UAV videos, addressing challenges in depth ambiguity and motion e...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22278] RETLLM: Training and Data-Free MLLMs for Multimodal Information Retrieval

The paper presents RETLLM, a novel framework for multimodal information retrieval (MMIR) that operates without the need for training or l...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2602.22275] Deep Accurate Solver for the Geodesic Problem

This article presents a novel deep learning approach for accurately solving the geodesic problem on continuous surfaces, achieving third-...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.22239] VAE-MS: An Asymmetric Variational Autoencoder for Mutational Signature Extraction

The paper introduces VAE-MS, an Asymmetric Variational Autoencoder designed to enhance mutational signature extraction in cancer research...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22236] CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction

The article presents CrossLLM-Mamba, a novel framework for RNA interaction prediction that utilizes multimodal state space fusion of larg...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22299] Decoding the Hook: A Multimodal LLM Framework for Analyzing the Hooking Period of Video Ads

This article presents a framework using multimodal large language models (MLLMs) to analyze the 'hooking period' of video ads, focusing o...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22223] SQaLe: A Large Text-to-SQL Corpus Grounded in Real Schemas

The paper introduces SQaLe, a large-scale text-to-SQL dataset designed to enhance the development of models that convert natural language...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.22279] Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

This paper presents a novel approach to reconstruct audio and images from clipped measurements using self-supervised learning, addressing...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.23360] Model Agreement via Anchoring

The paper presents a method for reducing model disagreement in machine learning by using an anchoring technique, demonstrating its effect...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.23358] A Dataset is Worth 1 MB

The paper presents PLADA, a novel method for efficient dataset transmission in machine learning, significantly reducing payload size whil...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23349] FlashOptim: Optimizers for Memory Efficient Training

FlashOptim introduces innovative optimizers that significantly reduce memory usage in neural network training, enhancing efficiency witho...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22263] CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints

CryoNet.Refine introduces a one-step diffusion model for efficiently refining structural models using cryo-EM density maps, offering a si...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22258] Poisoned Acoustics

The paper 'Poisoned Acoustics' explores training-data poisoning attacks on deep neural networks, demonstrating significant vulnerabilitie...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.23341] Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

This article presents efficient algorithms for estimating the mean from coarse data, addressing key questions in Gaussian mean estimation...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23336] Differentiable Zero-One Loss via Hypersimplex Projections

This paper presents a novel differentiable approximation to the zero-one loss, enhancing gradient-based optimization in machine learning ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.23305] A Proper Scoring Rule for Virtual Staining

The paper introduces a novel scoring rule for evaluating generative virtual staining models in high-throughput screening, emphasizing the...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.23303] Inferential Mechanics Part 1: Causal Mechanistic Theories of Machine Learning in Chemical Biology with Implications

This article introduces 'Inferential Mechanics,' a framework combining causal theories with machine learning in chemical biology, address...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22247] Multi-Dimensional Spectral Geometry of Biological Knowledge in Single-Cell Transformer Representations

This article explores how single-cell foundation models like scGPT encode biological knowledge through high-dimensional gene representati...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23296] Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity

This article presents FedWQ-CP, a novel approach to federated uncertainty quantification that addresses dual heterogeneity in data and mo...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2602.23280] Physics Informed Viscous Value Representations

This paper presents a novel approach to offline goal-conditioned reinforcement learning by introducing a physics-informed regularization ...

arXiv - Machine Learning · 3 min · about 1 month ago

Previous Page 33 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

[R] Are there ML approaches for prioritizing and routing “important” signals across complex systems?

[R] Structure Over Scale: Memory-First Reasoning and Depth-Pruned Efficiency in Magnus and Seed Architecture Auto-Discovery

All Content

[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction

[2602.22278] RETLLM: Training and Data-Free MLLMs for Multimodal Information Retrieval

[2602.22275] Deep Accurate Solver for the Geodesic Problem

[2602.22239] VAE-MS: An Asymmetric Variational Autoencoder for Mutational Signature Extraction

[2602.22236] CrossLLM-Mamba: Multimodal State Space Fusion of LLMs for RNA Interaction Prediction

[2602.22299] Decoding the Hook: A Multimodal LLM Framework for Analyzing the Hooking Period of Video Ads

[2602.22223] SQaLe: A Large Text-to-SQL Corpus Grounded in Real Schemas

[2602.22279] Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

[2602.23360] Model Agreement via Anchoring

[2602.23358] A Dataset is Worth 1 MB

[2602.23349] FlashOptim: Optimizers for Memory Efficient Training

[2602.22263] CryoNet.Refine: A One-step Diffusion Model for Rapid Refinement of Structural Models with Cryo-EM Density Map Restraints

[2602.22258] Poisoned Acoustics

[2602.23341] Mean Estimation from Coarse Data: Characterizations and Efficient Algorithms

[2602.23336] Differentiable Zero-One Loss via Hypersimplex Projections

[2602.23305] A Proper Scoring Rule for Virtual Staining

[2602.23303] Inferential Mechanics Part 1: Causal Mechanistic Theories of Machine Learning in Chemical Biology with Implications

[2602.22247] Multi-Dimensional Spectral Geometry of Biological Knowledge in Single-Cell Transformer Representations

[2602.23296] Conformalized Neural Networks for Federated Uncertainty Quantification under Dual Heterogeneity

[2602.23280] Physics Informed Viscous Value Representations

Related Topics

Stay updated with AI News