Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

What image/video training data is hardest to find right now? [R]

I'm building a crowdsourced photo collection platform (contributors take photos with smartphones, we auto-label with YOLO/CLIP + enrich w...

Reddit - Machine Learning · 1 min · about 5 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 7 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 7 hours ago

All Content

Ai Startups

[2602.13784] Comparables XAI: Faithful Example-based AI Explanations with Counterfactual Trace Adjustments

The paper introduces Comparables XAI, a method for providing faithful, example-based AI explanations using counterfactual trace adjustmen...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13325] Graph neural networks uncover structure and functions underlying the activity of simulated neural assemblies

This article discusses how graph neural networks can effectively analyze and interpret the dynamics of simulated neural assemblies, revea...

arXiv - Machine Learning · 3 min · about 2 months ago

Data Science

[2602.13322] Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture

This paper presents the PolyShapes-Ideal (PSI) dataset and diagnostic benchmarks for evaluating topological invariance in machine learnin...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.13758] OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding

The paper introduces OmniScience, a large-scale multi-modal dataset designed to enhance scientific image understanding in AI models, addr...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

The paper presents Pailitao-VL, a multi-modal retrieval system designed for real-time industrial search, addressing key challenges in ret...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13685] AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

AuTAgent introduces a reinforcement learning framework designed to enhance audio reasoning by effectively integrating external tools, imp...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13297] Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

This paper explores the use of conditional generative models to synthesize high-resolution range profiles (HRRPs) for maritime surveillan...

arXiv - Machine Learning · 3 min · about 2 months ago

Computer Vision

[2602.13681] An Ensemble Learning Approach towards Waste Segmentation in Cluttered Environment

This article presents an Ensemble Learning approach to enhance waste segmentation accuracy in cluttered environments, crucial for improvi...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.13296] MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

This paper presents a novel approach to evaluating high-resolution range profile (HRRP) data using MFN decomposition, addressing challeng...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13288] Benchmarking Anomaly Detection Across Heterogeneous Cloud Telemetry Datasets

This paper evaluates various deep learning models for anomaly detection across multiple cloud telemetry datasets, highlighting the import...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.13662] LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases

LeafNet introduces a large-scale dataset and benchmark for evaluating vision-language models in plant disease diagnosis, highlighting sig...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13650] KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

The article presents KorMedMCQA-V, a benchmark dataset for evaluating vision-language models on the Korean Medical Licensing Examination,...

arXiv - AI · 4 min · about 2 months ago

Computer Vision

[2602.13588] Two-Stream Interactive Joint Learning of Scene Parsing and Geometric Vision Tasks

The paper presents TwInS, a novel framework for joint learning of scene parsing and geometric vision tasks, inspired by the human visual ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15029] Symmetry in language statistics shapes the geometry of model representations

This article explores how symmetry in language statistics influences the geometric representation of models in machine learning, particul...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15022] Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation

This paper explores a novel approach to diffusion models by emphasizing canonicalization to enhance molecular graph generation, demonstra...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15008] Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees

This paper explores the efficiency of discrete diffusion models in sampling, establishing sharp convergence guarantees and improving exis...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15004] PDE foundation models are skillful AI weather emulators for the Martian atmosphere

This article presents a novel approach using AI foundation models to predict weather patterns in the Martian atmosphere, demonstrating si...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.14997] Spectral Convolution on Orbifolds for Geometric Deep Learning

This paper introduces spectral convolution on orbifolds, expanding geometric deep learning (GDL) techniques to non-Euclidean data structu...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.14977] MacroGuide: Topological Guidance for Macrocycle Generation

The paper introduces MacroGuide, a novel diffusion guidance mechanism that enhances the generation of macrocycles in molecular modeling, ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.14983] Orthogonalized Multimodal Contrastive Learning with Asymmetric Masking for Structured Representations

The paper presents COrAL, a novel framework for multimodal contrastive learning that effectively separates redundant, unique, and synergi...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 139 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

What image/video training data is hardest to find right now? [R]

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

All Content

[2602.13784] Comparables XAI: Faithful Example-based AI Explanations with Counterfactual Trace Adjustments

[2602.13325] Graph neural networks uncover structure and functions underlying the activity of simulated neural assemblies

[2602.13322] Diagnostic Benchmarks for Invariant Learning Dynamics: Empirical Validation of the Eidos Architecture

[2602.13758] OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding

[2602.13704] Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

[2602.13685] AuTAgent: A Reinforcement Learning Framework for Tool-Augmented Audio Reasoning

[2602.13297] Conditional Generative Models for High-Resolution Range Profiles: Capturing Geometry-Driven Trends in a Large-Scale Maritime Dataset

[2602.13681] An Ensemble Learning Approach towards Waste Segmentation in Cluttered Environment

[2602.13296] MFN Decomposition and Related Metrics for High-Resolution Range Profiles Generative Models

[2602.13288] Benchmarking Anomaly Detection Across Heterogeneous Cloud Telemetry Datasets

[2602.13662] LeafNet: A Large-Scale Dataset and Comprehensive Benchmark for Foundational Vision-Language Understanding of Plant Diseases

[2602.13650] KorMedMCQA-V: A Multimodal Benchmark for Evaluating Vision-Language Models on the Korean Medical Licensing Examination

[2602.13588] Two-Stream Interactive Joint Learning of Scene Parsing and Geometric Vision Tasks

[2602.15029] Symmetry in language statistics shapes the geometry of model representations

[2602.15022] Rethinking Diffusion Models with Symmetries through Canonicalization with Applications to Molecular Graph Generation

[2602.15008] Efficient Sampling with Discrete Diffusion Models: Sharp and Adaptive Guarantees

[2602.15004] PDE foundation models are skillful AI weather emulators for the Martian atmosphere

[2602.14997] Spectral Convolution on Orbifolds for Geometric Deep Learning

[2602.14977] MacroGuide: Topological Guidance for Macrocycle Generation

[2602.14983] Orthogonalized Multimodal Contrastive Learning with Asymmetric Masking for Structured Representations

Related Topics

Stay updated with AI News