Data Science

Data analysis, statistics, and data engineering

Top This Week

Machine Learning

Ml project user give dataset and I give best model [D] [P]

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model ...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min ·

All Content

[2508.14734] AFABench: A Generic Framework for Benchmarking Active Feature Acquisition
Machine Learning

[2508.14734] AFABench: A Generic Framework for Benchmarking Active Feature Acquisition

AFABench introduces a benchmark framework for Active Feature Acquisition (AFA), addressing the need for standardized evaluation of AFA me...

arXiv - AI · 4 min ·
[2508.06199] Benchmarking Pretrained Molecular Embedding Models For Molecular Representation Learning
Machine Learning

[2508.06199] Benchmarking Pretrained Molecular Embedding Models For Molecular Representation Learning

This article evaluates 25 pretrained molecular embedding models for molecular representation learning, revealing that most show little im...

arXiv - AI · 3 min ·
[2510.05761] Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis
Data Science

[2510.05761] Early Multimodal Prediction of Cross-Lingual Meme Virality on Reddit: A Time-Window Analysis

This article presents a novel approach to predicting the virality of memes on Reddit using a multimodal dataset and advanced machine lear...

arXiv - AI · 4 min ·
[2508.03111] GEDAN: Learning the Edit Costs for Graph Edit Distance
Machine Learning

[2508.03111] GEDAN: Learning the Edit Costs for Graph Edit Distance

The paper presents GEDAN, a novel Graph Neural Network framework that learns edit costs for Graph Edit Distance (GED), addressing limitat...

arXiv - AI · 4 min ·
[2510.04373] JEF-Hinter: Leveraging Offline Knowledge for Improving Web Agents Adaptation
Llms

[2510.04373] JEF-Hinter: Leveraging Offline Knowledge for Improving Web Agents Adaptation

The paper presents JEF-Hinter, a system designed to enhance the adaptation of web agents by leveraging offline knowledge, improving perfo...

arXiv - AI · 4 min ·
[2507.20997] Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition
Machine Learning

[2507.20997] Modular Delta Merging with Orthogonal Constraints: A Scalable Framework for Continual and Reversible Model Composition

The paper presents Modular Delta Merging with Orthogonal Constraints (MDM-OC), a framework for scalable and reversible model composition ...

arXiv - AI · 4 min ·
[2507.11732] Graph Neural Networks Powered by Encoder Embedding for Improved Node Learning
Machine Learning

[2507.11732] Graph Neural Networks Powered by Encoder Embedding for Improved Node Learning

This paper introduces a novel framework for Graph Neural Networks (GNNs) that utilizes a one-hot graph encoder embedding (GEE) to enhance...

arXiv - Machine Learning · 4 min ·
[2507.07390] Learning Collective Variables from BioEmu with Time-Lagged Generation
Machine Learning

[2507.07390] Learning Collective Variables from BioEmu with Time-Lagged Generation

This article presents a novel framework, BioEmu-CV, for automatically learning collective variables (CVs) from molecular dynamics simulat...

arXiv - Machine Learning · 4 min ·
[2509.21896] GenesisGeo: Technical Report
Llms

[2509.21896] GenesisGeo: Technical Report

GenesisGeo presents a novel approach to geometric reasoning by introducing a large-scale dataset and a multi-task training paradigm that ...

arXiv - AI · 3 min ·
[2507.01781] Symbolic Branch Networks: Tree-Inherited Neural Models for Interpretable Multiclass Classification
Machine Learning

[2507.01781] Symbolic Branch Networks: Tree-Inherited Neural Models for Interpretable Multiclass Classification

This article presents Symbolic Branch Networks (SBNs), a novel neural model that integrates decision tree structures for enhanced interpr...

arXiv - AI · 4 min ·
[2509.19800] Analysis of approximate linear programming solution to Markov decision problem with log barrier function
Llms

[2509.19800] Analysis of approximate linear programming solution to Markov decision problem with log barrier function

This paper explores a novel approach to solving Markov decision problems (MDPs) using approximate linear programming with a log barrier f...

arXiv - AI · 4 min ·
[2507.00390] MoNE: Replacing Redundant Experts with Lightweight Novices for Structured Pruning of MoE
Llms

[2507.00390] MoNE: Replacing Redundant Experts with Lightweight Novices for Structured Pruning of MoE

The paper introduces MoNE, a novel method for structured pruning of Mixture-of-Experts (MoE) models, replacing redundant experts with lig...

arXiv - Machine Learning · 4 min ·
[2509.03830] Decoding Tourist Perception in Historic Urban Quarters with Multimodal Social Media Data: An AI-Based Framework and Evidence from Shanghai
Data Science

[2509.03830] Decoding Tourist Perception in Historic Urban Quarters with Multimodal Social Media Data: An AI-Based Framework and Evidence from Shanghai

This study presents an AI-based framework to analyze tourist perceptions in historic urban quarters of Shanghai, utilizing multimodal soc...

arXiv - AI · 4 min ·
[2506.16824] Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs
Llms

[2506.16824] Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs

This article explores the use of large language models (LLMs) to identify new research directions in materials science by analyzing scien...

arXiv - Machine Learning · 4 min ·
[2506.08604] Physics vs Distributions: Pareto Optimal Flow Matching with Physics Constraints
Machine Learning

[2506.08604] Physics vs Distributions: Pareto Optimal Flow Matching with Physics Constraints

This article presents a novel method, Physics-Based Flow Matching (PBFM), which integrates physical constraints into generative modeling,...

arXiv - AI · 4 min ·
[2504.18880] Reshaping MOFs text mining with a dynamic multi-agents framework of large language model
Llms

[2504.18880] Reshaping MOFs text mining with a dynamic multi-agents framework of large language model

The paper presents MOFh6, a large language model system that enhances the extraction of synthesis conditions for metal-organic frameworks...

arXiv - AI · 4 min ·
[2505.19193] SuperMAN: Interpretable and Expressive Networks over Temporally Sparse Heterogeneous Data
Nlp

[2505.19193] SuperMAN: Interpretable and Expressive Networks over Temporally Sparse Heterogeneous Data

The paper presents SuperMAN, a framework designed for learning from temporally sparse and heterogeneous data, enhancing interpretability ...

arXiv - Machine Learning · 4 min ·
[2505.17517] The Spacetime of Diffusion Models: An Information Geometry Perspective
Machine Learning

[2505.17517] The Spacetime of Diffusion Models: An Information Geometry Perspective

This paper presents a novel geometric perspective on diffusion models, revealing flaws in traditional decoding methods and proposing a ne...

arXiv - Machine Learning · 4 min ·
[2505.11139] Covariance Density Neural Networks
Machine Learning

[2505.11139] Covariance Density Neural Networks

The paper introduces Covariance Density Neural Networks (CDNN), enhancing graph neural networks by using the sample covariance matrix as ...

arXiv - Machine Learning · 4 min ·
[2505.11111] FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation
Machine Learning

[2505.11111] FairSHAP: Preprocessing for Fairness Through Attribution-Based Data Augmentation

FairSHAP introduces a novel preprocessing framework that utilizes Shapley value attribution to enhance fairness in machine learning model...

arXiv - AI · 4 min ·
Previous Page 69 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime