Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Startups

Top 10 AI certifications and courses for 2026

This article reviews the top 10 AI certifications and courses for 2026, highlighting their significance in a rapidly evolving field and t...

AI Events · 15 min · about 4 hours ago

Machine Learning

[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

Abstract page for arXiv paper 2603.18109: Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

arXiv - AI · 4 min · about 5 hours ago

Llms

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arXiv - AI · 4 min · about 5 hours ago

All Content

Ai Infrastructure

Why is AI so bad at reading PDFs? | The Verge

The article explores the challenges AI faces in parsing PDFs, highlighting the limitations of current models and the innovative solutions...

The Verge - AI · 14 min · about 1 month ago

Machine Learning

[D] SIGIR 2026 Reviews are (likely) done. Why the delay in releasing scores?

The wait for SIGIR 2026 review scores feels unusually long this year, raising concerns about the impact on researchers' timelines and pro...

Reddit - Machine Learning · 1 min · about 1 month ago

Nlp

Inside Chicago’s surveillance panopticon | MIT Technology Review

The article explores Chicago's extensive surveillance system, highlighting its implications for public safety and civil liberties, partic...

MIT Technology Review - AI · 20 min · about 1 month ago

Ai Infrastructure

Gallup poll reveals more Americans use AI at work

A recent Gallup poll reveals that AI adoption among American workers has surged, with 12% using it daily and nearly half using it at leas...

AI Tools & Products · 4 min · about 1 month ago

Machine Learning

[2601.01679] Simplex Deep Linear Discriminant Analysis

The paper presents a novel approach to Deep Linear Discriminant Analysis (Deep LDA) by introducing a constrained formulation that stabili...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2511.18554] Online Smoothed Demand Management

The paper introduces Online Smoothed Demand Management (OSDM), a framework for optimizing energy purchasing and delivery in data centers,...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2511.18555] A joint optimization approach to identifying sparse dynamics using least squares kernel collocation

The paper presents a novel modeling framework for learning ordinary differential equations (ODEs) from limited and noisy data, enhancing ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.15058] The Minimax Lower Bound of Kernel Stein Discrepancy Estimation

This paper establishes the minimax lower bound of Kernel Stein Discrepancy (KSD) estimation, demonstrating its optimality and implication...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.00545] Bayesian Neural Networks for Functional ANOVA model

This paper introduces Bayesian-TPNN, a Bayesian inference approach for the functional ANOVA model using Tensor Product Neural Networks, i...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.00463] On the Adversarial Robustness of Learning-based Conformal Novelty Detection

This paper investigates the adversarial robustness of learning-based conformal novelty detection methods, revealing significant vulnerabi...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2508.06118] Ensemble-based graph representation of fMRI data for cognitive brain state classification

This article presents an ensemble-based graph representation method for classifying cognitive brain states using fMRI data, achieving hig...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.17316] Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

This paper presents a comprehensive study on estimating discrete distributions using Kullback-Leibler divergence, establishing minimax ra...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.12182] Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices

This paper explores the asymptotic behavior of eigenvalues in large random matrices, particularly focusing on the impact of rank perturba...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

AstroMLab 4 introduces a 70B-parameter AI model specialized for astronomy, achieving benchmark-topping performance in Q&A tasks, surpassi...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.11228] Learning hidden cascades via classification

The paper presents a novel machine learning framework for inferring hidden statuses in social networks, enhancing the understanding of sp...

arXiv - Machine Learning · 4 min · about 1 month ago

Data Science

[2504.21035] A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

This article evaluates the effectiveness of textual data sanitization methods, revealing that current techniques may provide a false sens...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2503.07313] The influence of missing data mechanisms and simple missing data handling techniques on fairness

This article explores how different missing data mechanisms and handling techniques affect the fairness of machine learning algorithms, r...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.05351] Deep Generative model that uses physical quantities to generate and retrieve solar magnetic active regions

This article presents a deep generative model that utilizes physical quantities to generate and retrieve solar magnetic active regions, e...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.17160] A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis

This article discusses the limitations of using Fréchet Inception Distance (FID) as an evaluation metric for generative models in retinal...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2408.03099] Topic Modeling with Fine-tuning LLMs and Bag of Sentences

This paper presents FT-Topic, a novel approach for topic modeling that fine-tunes large language models (LLMs) using bags of sentences, o...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 84 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

Top 10 AI certifications and courses for 2026

[2603.18109] Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

All Content

Why is AI so bad at reading PDFs? | The Verge

[D] SIGIR 2026 Reviews are (likely) done. Why the delay in releasing scores?

Inside Chicago’s surveillance panopticon | MIT Technology Review

Gallup poll reveals more Americans use AI at work

[2601.01679] Simplex Deep Linear Discriminant Analysis

[2511.18554] Online Smoothed Demand Management

[2511.18555] A joint optimization approach to identifying sparse dynamics using least squares kernel collocation

[2510.15058] The Minimax Lower Bound of Kernel Stein Discrepancy Estimation

[2510.00545] Bayesian Neural Networks for Functional ANOVA model

[2510.00463] On the Adversarial Robustness of Learning-based Conformal Novelty Detection

[2508.06118] Ensemble-based graph representation of fMRI data for cognitive brain state classification

[2507.17316] Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability

[2507.12182] Asymptotic behavior of eigenvalues of large rank perturbations of large random matrices

[2505.17592] AstroMLab 4: Benchmark-Topping Performance in Astronomy Q&A with a 70B-Parameter Domain-Specialized Reasoning Model

[2505.11228] Learning hidden cascades via classification

[2504.21035] A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

[2503.07313] The influence of missing data mechanisms and simple missing data handling techniques on fairness

[2502.05351] Deep Generative model that uses physical quantities to generate and retrieve solar magnetic active regions

[2502.17160] A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis

[2408.03099] Topic Modeling with Fine-tuning LLMs and Bag of Sentences

Related Topics

Stay updated with AI News