Data Science

Data analysis, statistics, and data engineering

Top This Week

Top 10 AI certifications and courses for 2026
Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min ·
[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions
Machine Learning

[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

Abstract page for arXiv paper 2603.18109: Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

arXiv - AI · 4 min ·
[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?
Llms

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arXiv - AI · 4 min ·

All Content

Why is AI so bad at reading PDFs? | The Verge
Ai Infrastructure

Why is AI so bad at reading PDFs? | The Verge

The article explores the challenges AI faces in parsing PDFs, highlighting the limitations of current models and the innovative solutions...

The Verge - AI · 14 min ·
Machine Learning

[D] SIGIR 2026 Reviews are (likely) done. Why the delay in releasing scores?

The wait for SIGIR 2026 review scores feels unusually long this year, raising concerns about the impact on researchers' timelines and pro...

Reddit - Machine Learning · 1 min ·
Inside Chicago’s surveillance panopticon | MIT Technology Review
Nlp

Inside Chicago’s surveillance panopticon | MIT Technology Review

The article explores Chicago's extensive surveillance system, highlighting its implications for public safety and civil liberties, partic...

MIT Technology Review - AI · 20 min ·
Gallup poll reveals more Americans use AI at work
Ai Infrastructure

Gallup poll reveals more Americans use AI at work

A recent Gallup poll reveals that AI adoption among American workers has surged, with 12% using it daily and nearly half using it at leas...

AI Tools & Products · 4 min ·
[2601.01679] Simplex Deep Linear Discriminant Analysis
Machine Learning

[2601.01679] Simplex Deep Linear Discriminant Analysis

The paper presents a novel approach to Deep Linear Discriminant Analysis (Deep LDA) by introducing a constrained formulation that stabili...

arXiv - Machine Learning · 4 min ·
[2511.18554] Online Smoothed Demand Management
Nlp

[2511.18554] Online Smoothed Demand Management

The paper introduces Online Smoothed Demand Management (OSDM), a framework for optimizing energy purchasing and delivery in data centers,...

arXiv - Machine Learning · 4 min ·
[2511.18555] A joint optimization approach to identifying sparse dynamics using least squares kernel collocation
Machine Learning

[2511.18555] A joint optimization approach to identifying sparse dynamics using least squares kernel collocation

The paper presents a novel modeling framework for learning ordinary differential equations (ODEs) from limited and noisy data, enhancing ...

arXiv - Machine Learning · 3 min ·
[2510.15058] The Minimax Lower Bound of Kernel Stein Discrepancy Estimation
Machine Learning

[2510.15058] The Minimax Lower Bound of Kernel Stein Discrepancy Estimation

This paper establishes the minimax lower bound of Kernel Stein Discrepancy (KSD) estimation, demonstrating its optimality and implication...

arXiv - Machine Learning · 3 min ·
[2510.00545] Bayesian Neural Networks for Functional ANOVA model
Machine Learning

[2510.00545] Bayesian Neural Networks for Functional ANOVA model

This paper introduces Bayesian-TPNN, a Bayesian inference approach for the functional ANOVA model using Tensor Product Neural Networks, i...

arXiv - Machine Learning · 3 min ·
[2510.00463] On the Adversarial Robustness of Learning-based Conformal Novelty Detection
Machine Learning

[2510.00463] On the Adversarial Robustness of Learning-based Conformal Novelty Detection

This paper investigates the adversarial robustness of learning-based conformal novelty detection methods, revealing significant vulnerabi...

arXiv - Machine Learning · 4 min ·
[2508.06118] Ensemble-based graph representation of fMRI data for cognitive brain state classification
Machine Learning

[2508.06118] Ensemble-based graph representation of fMRI data for cognitive brain state classification

This article presents an ensemble-based graph representation method for classifying cognitive brain states using fMRI data, achieving hig...

arXiv - Machine Learning · 4 min ·
[2507.17316] Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability
Machine Learning

[2507.17316] Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

This paper presents a comprehensive study on estimating discrete distributions using Kullback-Leibler divergence, establishing minimax ra...

arXiv - Machine Learning · 4 min ·
[2507.12182] Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices
Machine Learning

[2507.12182] Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices

This paper explores the asymptotic behavior of eigenvalues in large random matrices, particularly focusing on the impact of rank perturba...

arXiv - Machine Learning · 3 min ·
[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model
Llms

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

AstroMLab 4 introduces a 70B-parameter AI model specialized for astronomy, achieving benchmark-topping performance in Q&A tasks, surpassi...

arXiv - Machine Learning · 4 min ·
[2505.11228] Learning hidden cascades via classification
Machine Learning

[2505.11228] Learning hidden cascades via classification

The paper presents a novel machine learning framework for inferring hidden statuses in social networks, enhancing the understanding of sp...

arXiv - Machine Learning · 4 min ·
[2504.21035] A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage
Data Science

[2504.21035] A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

This article evaluates the effectiveness of textual data sanitization methods, revealing that current techniques may provide a false sens...

arXiv - Machine Learning · 4 min ·
[2503.07313] The influence of missing data mechanisms and simple missing data handling techniques on fairness
Machine Learning

[2503.07313] The influence of missing data mechanisms and simple missing data handling techniques on fairness

This article explores how different missing data mechanisms and handling techniques affect the fairness of machine learning algorithms, r...

arXiv - Machine Learning · 4 min ·
[2502.05351] Deep Generative model that uses physical quantities to generate and retrieve solar magnetic active regions
Machine Learning

[2502.05351] Deep Generative model that uses physical quantities to generate and retrieve solar magnetic active regions

This article presents a deep generative model that utilizes physical quantities to generate and retrieve solar magnetic active regions, e...

arXiv - Machine Learning · 4 min ·
[2502.17160] A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis
Machine Learning

[2502.17160] A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis

This article discusses the limitations of using Fréchet Inception Distance (FID) as an evaluation metric for generative models in retinal...

arXiv - Machine Learning · 4 min ·
[2408.03099] Topic Modeling with Fine-tuning LLMs and Bag of Sentences
Llms

[2408.03099] Topic Modeling with Fine-tuning LLMs and Bag of Sentences

This paper presents FT-Topic, a novel approach for topic modeling that fine-tunes large language models (LLMs) using bags of sentences, o...

arXiv - Machine Learning · 4 min ·
Previous Page 84 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime