Data Science

Data analysis, statistics, and data engineering

Top This Week

Harvard opens more free online courses in AI, data science, programming: Check full list and direct links
Data Science

Harvard opens more free online courses in AI, data science, programming: Check full list and direct links

AI News - General · 9 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] Offering licensed Indian language speech datasets (with explicit contributor consent)

Hi everyone, I run a small data initiative where we collect speech datasets in multiple Indian languages directly from contributors who p...

Reddit - Machine Learning · 1 min ·

All Content

[2602.18718] Stochastic Gradient Variational Inference with Price's Gradient Estimator from Bures-Wasserstein to Parameter Space
Machine Learning

[2602.18718] Stochastic Gradient Variational Inference with Price's Gradient Estimator from Bures-Wasserstein to Parameter Space

This paper presents advancements in Stochastic Gradient Variational Inference (SGVI) using Price's Gradient Estimator, demonstrating comp...

arXiv - Machine Learning · 4 min ·
[2602.19177] Next Reply Prediction X Dataset: Linguistic Discrepancies in Naively Generated Content
Llms

[2602.19177] Next Reply Prediction X Dataset: Linguistic Discrepancies in Naively Generated Content

The paper introduces the Next Reply Prediction X Dataset, addressing linguistic discrepancies in content generated by Large Language Mode...

arXiv - AI · 3 min ·
[2602.18715] A Data-Driven Method to Map the Functional Organisation of Human Brain White Matter
Machine Learning

[2602.18715] A Data-Driven Method to Map the Functional Organisation of Human Brain White Matter

This article presents a data-driven method to map the functional organization of human brain white matter, integrating diffusion and func...

arXiv - Machine Learning · 4 min ·
[2602.19171] HistCAD: Geometrically Constrained Parametric History-based CAD Dataset
Machine Learning

[2602.19171] HistCAD: Geometrically Constrained Parametric History-based CAD Dataset

The paper presents HistCAD, a comprehensive dataset for parametric CAD modeling that incorporates geometric constraints and functional se...

arXiv - AI · 3 min ·
[2602.19153] Constrained Diffusion for Accelerated Structure Relaxation of Inorganic Solids with Point Defects
Generative Ai

[2602.19153] Constrained Diffusion for Accelerated Structure Relaxation of Inorganic Solids with Point Defects

This article presents a novel generative framework for simulating point defects in inorganic solids, enhancing structure relaxation proce...

arXiv - Machine Learning · 3 min ·
[2602.19156] Artefact-Aware Fungal Detection in Dermatophytosis: A Real-Time Transformer-Based Approach for KOH Microscopy
Machine Learning

[2602.19156] Artefact-Aware Fungal Detection in Dermatophytosis: A Real-Time Transformer-Based Approach for KOH Microscopy

This study presents a transformer-based framework for detecting fungal elements in dermatophytosis using KOH microscopy, achieving high a...

arXiv - AI · 4 min ·
[2602.18642] Auto Quantum Machine Learning for Multisource Classification
Machine Learning

[2602.18642] Auto Quantum Machine Learning for Multisource Classification

The paper presents an automated quantum machine learning (AQML) approach for multisource classification, demonstrating improved accuracy ...

arXiv - Machine Learning · 3 min ·
[2602.19138] CRCC: Contrast-Based Robust Cross-Subject and Cross-Site Representation Learning for EEG
Machine Learning

[2602.19138] CRCC: Contrast-Based Robust Cross-Subject and Cross-Site Representation Learning for EEG

The paper presents CRCC, a novel framework for improving EEG-based neural decoding models' generalization across different acquisition si...

arXiv - AI · 3 min ·
[2602.18573] Multiclass Calibration Assessment and Recalibration of Probability Predictions via the Linear Log Odds Calibration Function
Machine Learning

[2602.18573] Multiclass Calibration Assessment and Recalibration of Probability Predictions via the Linear Log Odds Calibration Function

The paper presents a novel method for assessing and recalibrating probability predictions in multiclass classification tasks, addressing ...

arXiv - Machine Learning · 4 min ·
[2602.18525] Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity
Machine Learning

[2602.18525] Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity

This paper evaluates the effectiveness of generative metrics in predicting the performance of YOLO object detection models across various...

arXiv - Machine Learning · 4 min ·
[2602.19087] Detecting Cybersecurity Threats by Integrating Explainable AI with SHAP Interpretability and Strategic Data Sampling
Machine Learning

[2602.19087] Detecting Cybersecurity Threats by Integrating Explainable AI with SHAP Interpretability and Strategic Data Sampling

This article presents a novel framework for detecting cybersecurity threats by integrating Explainable AI (XAI) with SHAP interpretabilit...

arXiv - Machine Learning · 3 min ·
[2602.18487] The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder
Nlp

[2602.18487] The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

The paper presents GLiNER-bi-Encoder, a new architecture for Named Entity Recognition (NER) that enhances efficiency and scalability, ena...

arXiv - Machine Learning · 3 min ·
[2602.18489] DCInject: Persistent Backdoor Attacks via Frequency Manipulation in Personal Federated Learning
Machine Learning

[2602.18489] DCInject: Persistent Backdoor Attacks via Frequency Manipulation in Personal Federated Learning

The paper presents DCInject, a novel backdoor attack method targeting personalized federated learning (PFL) systems, demonstrating high a...

arXiv - Machine Learning · 3 min ·
[2602.18482] Boltzmann Generators for Condensed Matter via Riemannian Flow Matching
Machine Learning

[2602.18482] Boltzmann Generators for Condensed Matter via Riemannian Flow Matching

This article presents a novel approach using Riemannian flow matching to enhance Boltzmann generators for sampling equilibrium distributi...

arXiv - Machine Learning · 3 min ·
[2602.19028] The Metaphysics We Train: A Heideggerian Reading of Machine Learning
Machine Learning

[2602.19028] The Metaphysics We Train: A Heideggerian Reading of Machine Learning

This paper explores machine learning through a Heideggerian lens, highlighting insights on algorithmic opacity, the limitations of calcul...

arXiv - Machine Learning · 3 min ·
[2602.19025] Routing-Aware Explanations for Mixture of Experts Graph Models in Malware Detection
Machine Learning

[2602.19025] Routing-Aware Explanations for Mixture of Experts Graph Models in Malware Detection

This article presents a novel approach to malware detection using Mixture-of-Experts (MoE) graph models, emphasizing routing-aware explan...

arXiv - AI · 4 min ·
[2602.19022] An interpretable framework using foundation models for fish sex identification
Llms

[2602.19022] An interpretable framework using foundation models for fish sex identification

The paper presents FishProtoNet, a non-invasive computer vision framework for accurately identifying the sex of delta smelt, an endangere...

arXiv - AI · 4 min ·
[2602.18962] NeuroWise: A Multi-Agent LLM "Glass-Box" System for Practicing Double-Empathy Communication with Autistic Partners
Llms

[2602.18962] NeuroWise: A Multi-Agent LLM "Glass-Box" System for Practicing Double-Empathy Communication with Autistic Partners

NeuroWise is a multi-agent LLM system designed to enhance double-empathy communication between neurotypical and autistic individuals, dem...

arXiv - AI · 3 min ·
[2602.20152] Behavior Learning (BL): Learning Hierarchical Optimization Structures from Data
Machine Learning

[2602.20152] Behavior Learning (BL): Learning Hierarchical Optimization Structures from Data

The paper introduces Behavior Learning (BL), a machine learning framework that learns interpretable optimization structures from data, en...

arXiv - AI · 3 min ·
[2602.20111] Reliable Abstention under Adversarial Injections: Tight Lower Bounds and New Upper Bounds
Machine Learning

[2602.20111] Reliable Abstention under Adversarial Injections: Tight Lower Bounds and New Upper Bounds

This paper explores reliable abstention in online learning under adversarial injections, presenting new lower and upper bounds for error ...

arXiv - Machine Learning · 4 min ·
Previous Page 74 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime