Data Science

Data analysis, statistics, and data engineering

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs
Llms

[2601.21463] Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

Abstract page for arXiv paper 2601.21463: Unifying Speech Editing Detection and Content Localization via Prior-Enhanced Audio LLMs

arXiv - AI · 4 min ·
[2601.02627] Improved Evidence Extraction and Metrics for Document Inconsistency Detection with LLMs
Llms

[2601.02627] Improved Evidence Extraction and Metrics for Document Inconsistency Detection with LLMs

Abstract page for arXiv paper 2601.02627: Improved Evidence Extraction and Metrics for Document Inconsistency Detection with LLMs

arXiv - AI · 3 min ·

All Content

[2408.14073] Score-based change point detection via tracking the best of infinitely many experts
Machine Learning

[2408.14073] Score-based change point detection via tracking the best of infinitely many experts

This paper presents a novel algorithm for nonparametric online change point detection, utilizing a score-based approach to track the best...

arXiv - Machine Learning · 3 min ·
[2407.12226] Individualized Federated Learning for Traffic Prediction with Error Driven Aggregation
Machine Learning

[2407.12226] Individualized Federated Learning for Traffic Prediction with Error Driven Aggregation

The paper presents NeighborFL, an individualized federated learning approach for traffic prediction that enhances real-time model updates...

arXiv - Machine Learning · 4 min ·
[2602.15189] ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction
Llms

[2602.15189] ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction

ScrapeGraphAI-100k introduces a large-scale dataset for LLM-based web information extraction, addressing limitations of existing datasets...

arXiv - AI · 3 min ·
[2405.21012] IGC-Net for conditional average potential outcome estimation over time
Nlp

[2405.21012] IGC-Net for conditional average potential outcome estimation over time

The paper introduces IGC-Net, a novel neural model designed for estimating conditional average potential outcomes (CAPOs) over time, addr...

arXiv - Machine Learning · 4 min ·
[2602.15138] MB-DSMIL-CL-PL: Scalable Weakly Supervised Ovarian Cancer Subtype Classification and Localisation Using Contrastive and Prototype Learning with Frozen Patch Features
Machine Learning

[2602.15138] MB-DSMIL-CL-PL: Scalable Weakly Supervised Ovarian Cancer Subtype Classification and Localisation Using Contrastive and Prototype Learning with Frozen Patch Features

This paper presents a novel approach for classifying and localizing ovarian cancer subtypes using weakly supervised learning techniques, ...

arXiv - AI · 4 min ·
[2602.15830] Ensemble-size-dependence of deep-learning post-processing methods that minimize an (un)fair score: motivating examples and a proof-of-concept solution
Machine Learning

[2602.15830] Ensemble-size-dependence of deep-learning post-processing methods that minimize an (un)fair score: motivating examples and a proof-of-concept solution

This paper explores the ensemble-size dependence of deep-learning post-processing methods aimed at minimizing unfair scores in ensemble f...

arXiv - Machine Learning · 4 min ·
[2602.15781] Neural Scaling Laws for Boosted Jet Tagging
Llms

[2602.15781] Neural Scaling Laws for Boosted Jet Tagging

The paper explores neural scaling laws for boosted jet tagging in high energy physics, highlighting the relationship between compute reso...

arXiv - Machine Learning · 4 min ·
[2602.15074] Structure-Aware Piano Accompaniment via Style Planning and Dataset-Aligned Pattern Retrieval
Machine Learning

[2602.15074] Structure-Aware Piano Accompaniment via Style Planning and Dataset-Aligned Pattern Retrieval

This paper presents a structure-aware method for generating piano accompaniments using a transformer model for style planning and dataset...

arXiv - AI · 3 min ·
[2602.15738] Beyond Labels: Information-Efficient Human-in-the-Loop Learning using Ranking and Selection Queries
Machine Learning

[2602.15738] Beyond Labels: Information-Efficient Human-in-the-Loop Learning using Ranking and Selection Queries

This article presents a novel human-in-the-loop framework for machine learning that enhances information efficiency by utilizing ranking ...

arXiv - Machine Learning · 4 min ·
[2602.15070] An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling
Machine Learning

[2602.15070] An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling

This paper presents a Genetic Programming Hyper-Heuristic (GPHH) designed for the Uncertain Agile Earth Observation Satellite Scheduling ...

arXiv - AI · 3 min ·
[2602.15632] Neural-POD: A Plug-and-Play Neural Operator Framework for Infinite-Dimensional Functional Nonlinear Proper Orthogonal Decomposition
Machine Learning

[2602.15632] Neural-POD: A Plug-and-Play Neural Operator Framework for Infinite-Dimensional Functional Nonlinear Proper Orthogonal Decomposition

The Neural-POD framework introduces a novel approach to constructing nonlinear orthogonal basis functions in infinite-dimensional spaces ...

arXiv - Machine Learning · 4 min ·
[2602.15568] Scenario Approach with Post-Design Certification of User-Specified Properties
Data Science

[2602.15568] Scenario Approach with Post-Design Certification of User-Specified Properties

This paper introduces a scenario approach for post-design certification of user-specified properties, enhancing reliability without addit...

arXiv - Machine Learning · 3 min ·
[2602.15552] Latent Regularization in Generative Test Input Generation
Machine Learning

[2602.15552] Latent Regularization in Generative Test Input Generation

This paper explores the effects of latent space regularization on the quality of generative test inputs for deep learning classifiers, de...

arXiv - Machine Learning · 3 min ·
[2602.15042] Combining scEEG and PPG for reliable sleep staging using lightweight wearables
Data Science

[2602.15042] Combining scEEG and PPG for reliable sleep staging using lightweight wearables

This article explores the fusion of single-channel EEG (scEEG) and photoplethysmography (PPG) for improved sleep staging in lightweight w...

arXiv - AI · 4 min ·
[2602.15538] Functional Central Limit Theorem for Stochastic Gradient Descent
Generative Ai

[2602.15538] Functional Central Limit Theorem for Stochastic Gradient Descent

This paper presents a functional central limit theorem for the trajectory of the stochastic gradient descent (SGD) algorithm applied to c...

arXiv - Machine Learning · 3 min ·
[2602.15039] GRACE: an Agentic AI for Particle Physics Experiment Design and Simulation
Robotics

[2602.15039] GRACE: an Agentic AI for Particle Physics Experiment Design and Simulation

The paper presents GRACE, an AI agent designed for autonomous experimental design in particle physics, utilizing simulations to optimize ...

arXiv - AI · 4 min ·
[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient
Machine Learning

[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient

This article discusses the integration of accelerated computing (AC) and artificial intelligence (AI) in computational lithography, highl...

arXiv - AI · 4 min ·
[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network
Nlp

[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network

This article explores Ankara's public transport crisis, attributing it to structural issues rather than mere inefficiencies. It highlight...

arXiv - Machine Learning · 3 min ·
[2602.13209] LemonadeBench: Evaluating the Economic Intuition of Large Language Models in Simple Markets
Llms

[2602.13209] LemonadeBench: Evaluating the Economic Intuition of Large Language Models in Simple Markets

The paper presents LemonadeBench, a benchmark for assessing the economic intuition of large language models (LLMs) through a simulated le...

arXiv - AI · 3 min ·
[2602.15816] Developing AI Agents with Simulated Data: Why, what, and how?
Machine Learning

[2602.15816] Developing AI Agents with Simulated Data: Why, what, and how?

This article discusses the significance of synthetic data generation through simulation for training AI agents, addressing challenges and...

arXiv - AI · 3 min ·
Previous Page 123 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime