Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

What image/video training data is hardest to find right now? [R]

I'm building a crowdsourced photo collection platform (contributors take photos with smartphones, we auto-label with YOLO/CLIP + enrich w...

Reddit - Machine Learning · 1 min · about 2 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 3 hours ago

All Content

Machine Learning

[2602.14642] GenPANIS: A Latent-Variable Generative Framework for Forward and Inverse PDE Problems in Multiphase Media

GenPANIS introduces a generative framework for solving forward and inverse PDE problems in multiphase media, enhancing accuracy and effic...

arXiv - Machine Learning · 4 min · about 2 months ago

Data Science

[2602.14641] Quantum Reservoir Computing with Neutral Atoms on a Small, Complex, Medical Dataset

This paper explores Quantum Reservoir Computing (QRC) using neutral atoms to enhance predictions in medical datasets, demonstrating impro...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14607] A Bayesian Approach to Low-Discrepancy Subset Selection

This paper presents a Bayesian approach to low-discrepancy subset selection, addressing its NP-hardness and proposing a Bayesian Optimiza...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.14408] Feature Recalibration Based Olfactory-Visual Multimodal Model for Fine-Grained Rice Deterioration Detection

The paper presents a novel olfactory-visual multimodal model for detecting fine-grained rice deterioration, achieving high accuracy and s...

arXiv - AI · 3 min · about 2 months ago

Data Science

[2602.14406] TruthStance: An Annotated Dataset of Conversations on Truth Social

TruthStance introduces a comprehensive dataset of conversations from Truth Social, focusing on argument mining and stance detection, with...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.14478] Constrained and Composite Sampling via Proximal Sampler

This paper presents a novel approach to constrained and composite sampling using a proximal sampler, addressing challenges in enforcing f...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14472] Frequentist Regret Analysis of Gaussian Process Thompson Sampling via Fractional Posteriors

This paper presents a frequentist regret analysis of Gaussian Process Thompson Sampling (GP-TS) using fractional posteriors, offering a u...

arXiv - Machine Learning · 3 min · about 2 months ago

Computer Vision

[2602.14365] Image-based Joint-level Detection for Inflammation in Rheumatoid Arthritis from Small and Imbalanced Data

This paper presents a novel framework for detecting joint inflammation in rheumatoid arthritis using RGB images, addressing challenges li...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14440] CAIRO: Decoupling Order from Scale in Regression

The paper presents CAIRO, a novel framework that separates the learning of ordering from scale in regression analysis, enhancing robustne...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.14342] High-accuracy log-concave sampling with stochastic queries

This paper presents a method for high-accuracy log-concave sampling using stochastic queries, achieving improved efficiency in query comp...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.14358] High Precision Audience Expansion via Extreme Classification in a Two-Sided Marketplace

This paper discusses a novel approach to audience expansion in a two-sided marketplace, focusing on high precision retrieval methods for ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.14280] Fast Compute for ML Optimization

The paper presents the Scale Mixture EM (SM-EM) algorithm for optimizing machine learning losses, demonstrating significant performance i...

arXiv - Machine Learning · 3 min · about 2 months ago

Data Science

[2602.14285] FMMD: A multimodal open peer review dataset based on F1000Research

The paper introduces FMMD, a multimodal open peer review dataset from F1000Research, addressing limitations in current datasets by integr...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.14257] AD-Bench: A Real-World, Trajectory-Aware Advertising Analytics Benchmark for LLM Agents

The paper introduces AD-Bench, a benchmark for evaluating Large Language Model (LLM) agents in real-world advertising analytics, highligh...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.14239] A Hybrid TGN-SEAL Model for Dynamic Graph Link Prediction

The paper presents a Hybrid TGN-SEAL model aimed at improving link prediction in dynamic graphs, particularly in sparse networks, by inte...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.14201] GeoEyes: On-Demand Visual Focusing for Evidence-Grounded Understanding of Ultra-High-Resolution Remote Sensing Imagery

GeoEyes introduces a novel framework for enhancing visual understanding in ultra-high-resolution remote sensing imagery, addressing limit...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.14177] Towards Spatial Transcriptomics-driven Pathology Foundation Models

This article presents Spatial Expression-Aligned Learning (SEAL), a framework that integrates spatial transcriptomics with pathology mode...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.14030] MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages

MC$^2$Mark introduces a novel watermarking framework that ensures reliable embedding of long messages in generated text while maintaining...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.14029] Why Self-Training Helps and Hurts: Denoising vs. Signal Forgetting

This paper investigates the dual effects of iterative self-training in machine learning, focusing on the balance between denoising and si...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14020] Computable Bernstein Certificates for Cross-Fitted Clipped Covariance Estimation

This article presents a novel approach to covariance estimation using computable Bernstein certificates, addressing challenges posed by h...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 137 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

What image/video training data is hardest to find right now? [R]

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

All Content

[2602.14642] GenPANIS: A Latent-Variable Generative Framework for Forward and Inverse PDE Problems in Multiphase Media

[2602.14641] Quantum Reservoir Computing with Neutral Atoms on a Small, Complex, Medical Dataset

[2602.14607] A Bayesian Approach to Low-Discrepancy Subset Selection

[2602.14408] Feature Recalibration Based Olfactory-Visual Multimodal Model for Fine-Grained Rice Deterioration Detection

[2602.14406] TruthStance: An Annotated Dataset of Conversations on Truth Social

[2602.14478] Constrained and Composite Sampling via Proximal Sampler

[2602.14472] Frequentist Regret Analysis of Gaussian Process Thompson Sampling via Fractional Posteriors

[2602.14365] Image-based Joint-level Detection for Inflammation in Rheumatoid Arthritis from Small and Imbalanced Data

[2602.14440] CAIRO: Decoupling Order from Scale in Regression

[2602.14342] High-accuracy log-concave sampling with stochastic queries

[2602.14358] High Precision Audience Expansion via Extreme Classification in a Two-Sided Marketplace

[2602.14280] Fast Compute for ML Optimization

[2602.14285] FMMD: A multimodal open peer review dataset based on F1000Research

[2602.14257] AD-Bench: A Real-World, Trajectory-Aware Advertising Analytics Benchmark for LLM Agents

[2602.14239] A Hybrid TGN-SEAL Model for Dynamic Graph Link Prediction

[2602.14201] GeoEyes: On-Demand Visual Focusing for Evidence-Grounded Understanding of Ultra-High-Resolution Remote Sensing Imagery

[2602.14177] Towards Spatial Transcriptomics-driven Pathology Foundation Models

[2602.14030] MC$^2$Mark: Distortion-Free Multi-Bit Watermarking for Long Messages

[2602.14029] Why Self-Training Helps and Hurts: Denoising vs. Signal Forgetting

[2602.14020] Computable Bernstein Certificates for Cross-Fitted Clipped Covariance Estimation

Related Topics

Stay updated with AI News