Data Science

Data analysis, statistics, and data engineering

Top This Week

Machine Learning

What image/video training data is hardest to find right now? [R]

I'm building a crowdsourced photo collection platform (contributors take photos with smartphones, we auto-label with YOLO/CLIP + enrich w...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·

All Content

[2602.13784] Comparables XAI: Faithful Example-based AI Explanations with Counterfactual Trace Adjustments
Ai Startups

[2602.13784] Comparables XAI: Faithful Example-based AI Explanations with Counterfactual Trace Adjustments

The paper introduces Comparables XAI, a method for providing faithful, example-based AI explanations using counterfactual trace adjustmen...

arXiv - AI · 3 min ·
[2602.13325] Graph neural networks uncover structure and functions underlying the activity of simulated neural assemblies
Machine Learning

[2602.13325] Graph neural networks uncover structure and functions underlying the activity of simulated neural assemblies

This article discusses how graph neural networks can effectively analyze and interpret the dynamics of simulated neural assemblies, revea...

arXiv - Machine Learning · 3 min ·
[2602.13322] Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture
Data Science

[2602.13322] Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture

This paper presents the PolyShapes-Ideal (PSI) dataset and diagnostic benchmarks for evaluating topological invariance in machine learnin...

arXiv - Machine Learning · 3 min ·
[2602.13758] OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding
Llms

[2602.13758] OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding

The paper introduces OmniScience, a large-scale multi-modal dataset designed to enhance scientific image understanding in AI models, addr...

arXiv - AI · 4 min ·
[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search
Nlp

[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

The paper presents Pailitao-VL, a multi-modal retrieval system designed for real-time industrial search, addressing key challenges in ret...

arXiv - AI · 4 min ·
[2602.13685] AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning
Llms

[2602.13685] AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

AuTAgent introduces a reinforcement learning framework designed to enhance audio reasoning by effectively integrating external tools, imp...

arXiv - AI · 3 min ·
[2602.13297] Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset
Machine Learning

[2602.13297] Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

This paper explores the use of conditional generative models to synthesize high-resolution range profiles (HRRPs) for maritime surveillan...

arXiv - Machine Learning · 3 min ·
[2602.13681] An Ensemble Learning Approach towards Waste Segmentation in Cluttered Environment
Computer Vision

[2602.13681] An Ensemble Learning Approach towards Waste Segmentation in Cluttered Environment

This article presents an Ensemble Learning approach to enhance waste segmentation accuracy in cluttered environments, crucial for improvi...

arXiv - AI · 4 min ·
[2602.13296] MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models
Machine Learning

[2602.13296] MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

This paper presents a novel approach to evaluating high-resolution range profile (HRRP) data using MFN decomposition, addressing challeng...

arXiv - Machine Learning · 3 min ·
[2602.13288] Benchmarking Anomaly Detection Across Heterogeneous Cloud Telemetry Datasets
Machine Learning

[2602.13288] Benchmarking Anomaly Detection Across Heterogeneous Cloud Telemetry Datasets

This paper evaluates various deep learning models for anomaly detection across multiple cloud telemetry datasets, highlighting the import...

arXiv - Machine Learning · 4 min ·
[2602.13662] LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases
Llms

[2602.13662] LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases

LeafNet introduces a large-scale dataset and benchmark for evaluating vision-language models in plant disease diagnosis, highlighting sig...

arXiv - AI · 4 min ·
[2602.13650] KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination
Llms

[2602.13650] KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

The article presents KorMedMCQA-V, a benchmark dataset for evaluating vision-language models on the Korean Medical Licensing Examination,...

arXiv - AI · 4 min ·
[2602.13588] Two-Stream Interactive Joint Learning of Scene Parsing and Geometric Vision Tasks
Computer Vision

[2602.13588] Two-Stream Interactive Joint Learning of Scene Parsing and Geometric Vision Tasks

The paper presents TwInS, a novel framework for joint learning of scene parsing and geometric vision tasks, inspired by the human visual ...

arXiv - AI · 4 min ·
[2602.15029] Symmetry in language statistics shapes the geometry of model representations
Llms

[2602.15029] Symmetry in language statistics shapes the geometry of model representations

This article explores how symmetry in language statistics influences the geometric representation of models in machine learning, particul...

arXiv - Machine Learning · 4 min ·
[2602.15022] Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation
Machine Learning

[2602.15022] Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation

This paper explores a novel approach to diffusion models by emphasizing canonicalization to enhance molecular graph generation, demonstra...

arXiv - AI · 4 min ·
[2602.15008] Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees
Machine Learning

[2602.15008] Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees

This paper explores the efficiency of discrete diffusion models in sampling, establishing sharp convergence guarantees and improving exis...

arXiv - Machine Learning · 4 min ·
[2602.15004] PDE foundation models are skillful AI weather emulators for the Martian atmosphere
Llms

[2602.15004] PDE foundation models are skillful AI weather emulators for the Martian atmosphere

This article presents a novel approach using AI foundation models to predict weather patterns in the Martian atmosphere, demonstrating si...

arXiv - Machine Learning · 4 min ·
[2602.14997] Spectral Convolution on Orbifolds for Geometric Deep Learning
Machine Learning

[2602.14997] Spectral Convolution on Orbifolds for Geometric Deep Learning

This paper introduces spectral convolution on orbifolds, expanding geometric deep learning (GDL) techniques to non-Euclidean data structu...

arXiv - AI · 3 min ·
[2602.14977] MacroGuide: Topological Guidance for Macrocycle Generation
Machine Learning

[2602.14977] MacroGuide: Topological Guidance for Macrocycle Generation

The paper introduces MacroGuide, a novel diffusion guidance mechanism that enhances the generation of macrocycles in molecular modeling, ...

arXiv - Machine Learning · 3 min ·
[2602.14983] Orthogonalized Multimodal Contrastive Learning with Asymmetric Masking for Structured Representations
Machine Learning

[2602.14983] Orthogonalized Multimodal Contrastive Learning with Asymmetric Masking for Structured Representations

The paper presents COrAL, a novel framework for multimodal contrastive learning that effectively separates redundant, unique, and synergi...

arXiv - Machine Learning · 4 min ·
Previous Page 139 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime