Data Science

Data analysis, statistics, and data engineering

Top This Week

Harvard opens more free online courses in AI, data science, programming: Check full list and direct links
Data Science

Harvard opens more free online courses in AI, data science, programming: Check full list and direct links

AI News - General · 9 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] Offering licensed Indian language speech datasets (with explicit contributor consent)

Hi everyone, I run a small data initiative where we collect speech datasets in multiple Indian languages directly from contributors who p...

Reddit - Machine Learning · 1 min ·

All Content

[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation
Machine Learning

[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

The paper presents a novel two-stage training approach for Automatic Chord Recognition (ACR), utilizing pseudo-labeling and knowledge dis...

arXiv - Machine Learning · 4 min ·
[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization
Machine Learning

[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization

The paper presents a novel approach to exact discrete stochastic simulation using deep-learning-scale gradient optimization, enhancing sc...

arXiv - Machine Learning · 3 min ·
[2602.19761] Ensemble Machine Learning and Statistical Procedures for Dynamic Predictions of Time-to-Event Outcomes
Machine Learning

[2602.19761] Ensemble Machine Learning and Statistical Procedures for Dynamic Predictions of Time-to-Event Outcomes

This article discusses the use of ensemble machine learning techniques, specifically the Super Learner framework, to improve dynamic pred...

arXiv - Machine Learning · 4 min ·
[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics
Machine Learning

[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

This article presents a novel approach for continuous telemonitoring of heart failure through personalized speech dynamics, showcasing si...

arXiv - AI · 4 min ·
[2602.19668] Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation
Machine Learning

[2602.19668] Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation

This article presents a novel framework, FedTAR, for generating personalized longitudinal medical reports using federated learning that a...

arXiv - Machine Learning · 3 min ·
[2602.19600] Manifold-Aligned Generative Transport
Machine Learning

[2602.19600] Manifold-Aligned Generative Transport

The paper presents Manifold-Aligned Generative Transport (MAGT), a novel generative model that efficiently samples from high-dimensional ...

arXiv - Machine Learning · 3 min ·
[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization
Machine Learning

[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization

The paper presents Goal-Oriented Influence-Maximizing Data Acquisition (GOIMDA), a novel algorithm for active data acquisition in machine...

arXiv - Machine Learning · 3 min ·
[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining
Llms

[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

This paper explores the limitations of using a single extractor for HTML-to-text conversion in LLM pretraining, proposing a union of mult...

arXiv - Machine Learning · 3 min ·
[2602.19539] Can a Teenager Fool an AI? Evaluating Low-Cost Cosmetic Attacks on Age Estimation Systems
Computer Vision

[2602.19539] Can a Teenager Fool an AI? Evaluating Low-Cost Cosmetic Attacks on Age Estimation Systems

This paper evaluates the effectiveness of low-cost cosmetic modifications in deceiving AI age estimation systems, revealing significant v...

arXiv - Machine Learning · 4 min ·
[2602.19608] Satellite-Based Detection of Looted Archaeological Sites Using Machine Learning
Machine Learning

[2602.19608] Satellite-Based Detection of Looted Archaeological Sites Using Machine Learning

This article presents a machine learning approach to detect looted archaeological sites using satellite imagery, demonstrating significan...

arXiv - AI · 4 min ·
[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
Nlp

[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis

The paper presents a Tri-Subspace Disentanglement (TSD) framework for Multimodal Sentiment Analysis, enhancing representation by factorin...

arXiv - AI · 3 min ·
[2602.19411] MACE-POLAR-1: A Polarisable Electrostatic Foundation Model for Molecular Chemistry
Llms

[2602.19411] MACE-POLAR-1: A Polarisable Electrostatic Foundation Model for Molecular Chemistry

The paper presents MACE-POLAR-1, a new electrostatic foundation model for molecular chemistry that improves the accuracy of modeling long...

arXiv - Machine Learning · 4 min ·
[2602.19540] A Green Learning Approach to LDCT Image Restoration
Machine Learning

[2602.19540] A Green Learning Approach to LDCT Image Restoration

This paper presents a Green Learning approach for restoring low-dose computed tomography (LDCT) images, emphasizing mathematical transpar...

arXiv - AI · 3 min ·
[2602.19385] Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition
Llms

[2602.19385] Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition

The paper presents ADAMAB, a novel framework for efficient embedding calibration in few-shot pattern recognition, leveraging adaptive dat...

arXiv - Machine Learning · 4 min ·
[2602.19381] Regularity of Second-Order Elliptic PDEs in Spectral Barron Spaces
Machine Learning

[2602.19381] Regularity of Second-Order Elliptic PDEs in Spectral Barron Spaces

This paper establishes a regularity theorem for second-order elliptic PDEs in spectral Barron spaces, demonstrating that solutions can ac...

arXiv - Machine Learning · 3 min ·
[2602.19357] MentalBlackboard: Evaluating Spatial Visualization via Mathematical Transformations
Llms

[2602.19357] MentalBlackboard: Evaluating Spatial Visualization via Mathematical Transformations

The paper 'MentalBlackboard' evaluates spatial visualization capabilities of Vision-Language Models (VLMs) through mathematical transform...

arXiv - Machine Learning · 3 min ·
[2602.19339] SplitLight: An Exploratory Toolkit for Recommender Systems Datasets and Splits
Machine Learning

[2602.19339] SplitLight: An Exploratory Toolkit for Recommender Systems Datasets and Splits

SplitLight is an open-source toolkit designed to enhance the evaluation of recommender systems by providing measurable and comparable dat...

arXiv - Machine Learning · 3 min ·
[2602.19509] Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference
Llms

[2602.19509] Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference

The article presents Pyramid MoA, a probabilistic framework designed to optimize inference costs in large language models (LLMs) while ma...

arXiv - Machine Learning · 3 min ·
[2602.19329] Dynamic Elasticity Between Forest Loss and Carbon Emissions: A Subnational Panel Analysis of the United States
Data Science

[2602.19329] Dynamic Elasticity Between Forest Loss and Carbon Emissions: A Subnational Panel Analysis of the United States

This article analyzes the dynamic relationship between forest loss and carbon emissions in the U.S. using a comprehensive dataset from 20...

arXiv - Machine Learning · 4 min ·
[2602.19263] Prognostics of Multisensor Systems with Unknown and Unlabeled Failure Modes via Bayesian Nonparametric Process Mixtures
Machine Learning

[2602.19263] Prognostics of Multisensor Systems with Unknown and Unlabeled Failure Modes via Bayesian Nonparametric Process Mixtures

This article presents a novel Bayesian nonparametric framework for prognostics in multisensor systems, addressing challenges with unknown...

arXiv - Machine Learning · 4 min ·
Previous Page 72 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime