Data Science

Data analysis, statistics, and data engineering

Top This Week

Top 10 AI certifications and courses for 2026
Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min ·
[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions
Machine Learning

[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

Abstract page for arXiv paper 2603.18109: Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

arXiv - AI · 4 min ·
[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?
Llms

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arXiv - AI · 4 min ·

All Content

[2408.07110] Physics-informed graph neural networks for flow field estimation in carotid arteries
Machine Learning

[2408.07110] Physics-informed graph neural networks for flow field estimation in carotid arteries

This article presents a novel approach using physics-informed graph neural networks to estimate hemodynamic flow fields in carotid arteri...

arXiv - Machine Learning · 4 min ·
[2312.12715] Learning Performance Maximizing Ensembles with Explainability Guarantees
Machine Learning

[2312.12715] Learning Performance Maximizing Ensembles with Explainability Guarantees

This paper presents a method for optimizing the allocation of observations between explainable and black box models, aiming to maximize e...

arXiv - Machine Learning · 3 min ·
[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models
Machine Learning

[2112.05128] Fair Community Detection and Structure Learning in Heterogeneous Graphical Models

This paper presents a novel approach for fair community detection in heterogeneous graphical models, ensuring demographic representation ...

arXiv - Machine Learning · 3 min ·
[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion
Machine Learning

[2602.07875] Harpoon: Generalised Manifold Guidance for Conditional Tabular Diffusion

The paper introduces HARPOON, a novel method for generating tabular data using generalized manifold guidance, addressing limitations in e...

arXiv - Machine Learning · 3 min ·
[2602.12162] Amortized Molecular Optimization via Group Relative Policy Optimization
Machine Learning

[2602.12162] Amortized Molecular Optimization via Group Relative Policy Optimization

The paper presents GRXForm, a novel approach for molecular optimization using Group Relative Policy Optimization, addressing the limitati...

arXiv - Machine Learning · 3 min ·
[2602.04908] Temporal Pair Consistency for Variance-Reduced Flow Matching
Machine Learning

[2602.04908] Temporal Pair Consistency for Variance-Reduced Flow Matching

The paper introduces Temporal Pair Consistency (TPC), a novel approach to reduce variance in flow matching for continuous-time generative...

arXiv - Machine Learning · 3 min ·
[2602.03875] Reversible Deep Learning for 13C NMR in Chemoinformatics: On Structures and Spectra
Machine Learning

[2602.03875] Reversible Deep Learning for 13C NMR in Chemoinformatics: On Structures and Spectra

This article presents a reversible deep learning model for 13C NMR in chemoinformatics, utilizing an invertible neural network to predict...

arXiv - Machine Learning · 4 min ·
[2602.03175] Probe-then-Commit Multi-Objective Bandits: Theoretical Benefits of Limited Multi-Arm Feedback
Machine Learning

[2602.03175] Probe-then-Commit Multi-Objective Bandits: Theoretical Benefits of Limited Multi-Arm Feedback

This article presents a novel approach to multi-objective bandit problems through the Probe-then-Commit (PtC) strategy, demonstrating the...

arXiv - Machine Learning · 4 min ·
[2601.11924] Communication-Corruption Coupling and Verification in Cooperative Multi-Objective Bandits
Machine Learning

[2601.11924] Communication-Corruption Coupling and Verification in Cooperative Multi-Objective Bandits

This paper explores cooperative multi-objective bandits under adversarial corruption, presenting a communication-corruption coupling that...

arXiv - Machine Learning · 4 min ·
[2601.01703] Beyond Homophily: Community Search on Heterophilic Graphs
Data Science

[2601.01703] Beyond Homophily: Community Search on Heterophilic Graphs

This paper presents Adaptive Community Search (AdaptCS), a novel framework designed to improve community search in heterophilic graphs, o...

arXiv - AI · 4 min ·
[2512.19223] Phase-space entropy at acquisition reflects downstream learnability
Machine Learning

[2512.19223] Phase-space entropy at acquisition reflects downstream learnability

The paper explores how phase-space entropy at the acquisition stage can predict the learnability of downstream models, offering a new met...

arXiv - Machine Learning · 4 min ·
[2512.10877] Guided Transfer Learning for Discrete Diffusion Models
Machine Learning

[2512.10877] Guided Transfer Learning for Discrete Diffusion Models

This paper introduces Guided Transfer Learning (GTL) for discrete diffusion models, addressing challenges in small-data scenarios and off...

arXiv - Machine Learning · 4 min ·
[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows
Machine Learning

[2512.04954] Amortized Inference of Multi-Modal Posteriors using Likelihood-Weighted Normalizing Flows

This paper introduces a novel technique for amortized posterior estimation using Normalizing Flows, enhancing inference in high-dimension...

arXiv - Machine Learning · 3 min ·
[2510.13887] Incomplete Multi-view Clustering via Hierarchical Semantic Alignment and Cooperative Completion
Ai Safety

[2510.13887] Incomplete Multi-view Clustering via Hierarchical Semantic Alignment and Cooperative Completion

This paper presents a novel framework for incomplete multi-view clustering using Hierarchical Semantic Alignment and Cooperative Completi...

arXiv - Machine Learning · 4 min ·
[2511.18945] MIST: Mutual Information Estimation Via Supervised Training
Machine Learning

[2511.18945] MIST: Mutual Information Estimation Via Supervised Training

The paper presents MIST, a novel approach for estimating mutual information using a neural network trained on a large dataset of syntheti...

arXiv - Machine Learning · 4 min ·
[2511.02872] FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels
Llms

[2511.02872] FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels

The paper introduces FATE, a benchmark series for formal algebra, designed to assess large language models' capabilities in advanced math...

arXiv - Machine Learning · 4 min ·
[2509.12253] Physics-Informed Neural Networks vs. Physics Models for Non-Invasive Glucose Monitoring: A Comparative Study Under Noise-Stressed Synthetic Conditions
Machine Learning

[2509.12253] Physics-Informed Neural Networks vs. Physics Models for Non-Invasive Glucose Monitoring: A Comparative Study Under Noise-Stressed Synthetic Conditions

This study compares Physics-Informed Neural Networks (PINNs) and traditional physics models for non-invasive glucose monitoring under noi...

arXiv - Machine Learning · 3 min ·
[2510.18322] Uncertainty Estimation by Flexible Evidential Deep Learning
Machine Learning

[2510.18322] Uncertainty Estimation by Flexible Evidential Deep Learning

This paper introduces Flexible Evidential Deep Learning (F-EDL), enhancing uncertainty quantification in machine learning by extending th...

arXiv - Machine Learning · 3 min ·
[2509.00479] A Novel Method to Determine Total Oxidant Concentration Produced by Non-Thermal Plasma Based on Image Processing and Machine Learning
Machine Learning

[2509.00479] A Novel Method to Determine Total Oxidant Concentration Produced by Non-Thermal Plasma Based on Image Processing and Machine Learning

This article presents a novel method for accurately determining total oxidant concentration in non-thermal plasma systems using image pro...

arXiv - Machine Learning · 4 min ·
[2510.09658] Gradient-Sign Masking for Task Vector Transport Across Pre-Trained Models
Llms

[2510.09658] Gradient-Sign Masking for Task Vector Transport Across Pre-Trained Models

This paper presents Gradient-Sign Masking, a method for transferring task vectors across pre-trained models without additional fine-tunin...

arXiv - Machine Learning · 4 min ·
Previous Page 85 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime