Data Science
Data analysis, statistics, and data engineering
Top This Week
UMKC Announces New Master of Science in Artificial Intelligence
UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...
[D] Offering licensed Indian language speech datasets (with explicit contributor consent)
Hi everyone, I run a small data initiative where we collect speech datasets in multiple Indian languages directly from contributors who p...
All Content
[2602.19778] Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation
The paper presents a novel two-stage training approach for Automatic Chord Recognition (ACR), utilizing pseudo-labeling and knowledge dis...
[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization
The paper presents a novel approach to exact discrete stochastic simulation using deep-learning-scale gradient optimization, enhancing sc...
[2602.19761] Ensemble Machine Learning and Statistical Procedures for Dynamic Predictions of Time-to-Event Outcomes
This article discusses the use of ensemble machine learning techniques, specifically the Super Learner framework, to improve dynamic pred...
[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics
This article presents a novel approach for continuous telemonitoring of heart failure through personalized speech dynamics, showcasing si...
[2602.19668] Personalized Longitudinal Medical Report Generation via Temporally-Aware Federated Adaptation
This article presents a novel framework, FedTAR, for generating personalized longitudinal medical reports using federated learning that a...
[2602.19600] Manifold-Aligned Generative Transport
The paper presents Manifold-Aligned Generative Transport (MAGT), a novel generative model that efficiently samples from high-dimensional ...
[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization
The paper presents Goal-Oriented Influence-Maximizing Data Acquisition (GOIMDA), a novel algorithm for active data acquisition in machine...
[2602.19548] Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining
This paper explores the limitations of using a single extractor for HTML-to-text conversion in LLM pretraining, proposing a union of mult...
[2602.19539] Can a Teenager Fool an AI? Evaluating Low-Cost Cosmetic Attacks on Age Estimation Systems
This paper evaluates the effectiveness of low-cost cosmetic modifications in deceiving AI age estimation systems, revealing significant v...
[2602.19608] Satellite-Based Detection of Looted Archaeological Sites Using Machine Learning
This article presents a machine learning approach to detect looted archaeological sites using satellite imagery, demonstrating significan...
[2602.19585] Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis
The paper presents a Tri-Subspace Disentanglement (TSD) framework for Multimodal Sentiment Analysis, enhancing representation by factorin...
[2602.19411] MACE-POLAR-1: A Polarisable Electrostatic Foundation Model for Molecular Chemistry
The paper presents MACE-POLAR-1, a new electrostatic foundation model for molecular chemistry that improves the accuracy of modeling long...
[2602.19540] A Green Learning Approach to LDCT Image Restoration
This paper presents a Green Learning approach for restoring low-dose computed tomography (LDCT) images, emphasizing mathematical transpar...
[2602.19385] Adaptive Data Augmentation with Multi-armed Bandit: Sample-Efficient Embedding Calibration for Implicit Pattern Recognition
The paper presents ADAMAB, a novel framework for efficient embedding calibration in few-shot pattern recognition, leveraging adaptive dat...
[2602.19381] Regularity of Second-Order Elliptic PDEs in Spectral Barron Spaces
This paper establishes a regularity theorem for second-order elliptic PDEs in spectral Barron spaces, demonstrating that solutions can ac...
[2602.19357] MentalBlackboard: Evaluating Spatial Visualization via Mathematical Transformations
The paper 'MentalBlackboard' evaluates spatial visualization capabilities of Vision-Language Models (VLMs) through mathematical transform...
[2602.19339] SplitLight: An Exploratory Toolkit for Recommender Systems Datasets and Splits
SplitLight is an open-source toolkit designed to enhance the evaluation of recommender systems by providing measurable and comparable dat...
[2602.19509] Pyramid MoA: A Probabilistic Framework for Cost-Optimized Anytime Inference
The article presents Pyramid MoA, a probabilistic framework designed to optimize inference costs in large language models (LLMs) while ma...
[2602.19329] Dynamic Elasticity Between Forest Loss and Carbon Emissions: A Subnational Panel Analysis of the United States
This article analyzes the dynamic relationship between forest loss and carbon emissions in the U.S. using a comprehensive dataset from 20...
[2602.19263] Prognostics of Multisensor Systems with Unknown and Unlabeled Failure Modes via Bayesian Nonparametric Process Mixtures
This article presents a novel Bayesian nonparametric framework for prognostics in multisensor systems, addressing challenges with unknown...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime