Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 21 minutes ago

Data Science

AI Venture Capital Boom: 2025 Funding Shift & Investment Strategy - News and Statistics

Analysis of AI dominance in 2025 venture capital, its effects on market valuations, and strategic considerations for investors.

AI News - General · 7 min · about 2 hours ago

Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min · about 6 hours ago

All Content

Llms

[2602.18481] AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models

The paper introduces AlphaForgeBench, a framework for evaluating trading strategies using Large Language Models (LLMs), addressing issues...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19345] Smooth Gate Functions for Soft Advantage Policy Optimization

This paper explores Smooth Gate Functions for Soft Advantage Policy Optimization, enhancing the stability of large language model trainin...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders

The paper presents ZUNA, a 380M-parameter masked diffusion autoencoder designed for EEG signal superresolution and channel infilling, dem...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.18479] AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature

AgentCAT is a large language model designed to extract and analyze catalytic reaction data from chemical engineering literature, addressi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19331] Partial Soft-Matching Distance for Neural Representational Comparison with Partial Unit Correspondence

The paper introduces a Partial Soft-Matching Distance metric for neural representational comparison, enhancing robustness against noise a...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis

The paper introduces CTS-Bench, a benchmark suite for evaluating graph coarsening trade-offs in Graph Neural Networks (GNNs) for Clock Tr...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18476] BioLM-Score: Language-Prior Conditioned Probabilistic Geometric Potentials for Protein-Ligand Scoring

BioLM-Score introduces a novel protein-ligand scoring model that enhances efficiency and interpretability in drug design by integrating g...

arXiv - Machine Learning · 4 min · about 1 month ago

Generative Ai

[2602.19289] AdsorbFlow: energy-conditioned flow matching enables fast and realistic adsorbate placement

The paper introduces AdsorbFlow, a deterministic generative model that enhances the efficiency of adsorbate placement on catalytic surfac...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19237] Evaluating SAP RPT-1 for Enterprise Business Process Prediction: In-Context Learning vs. Traditional Machine Learning on Structured SAP Data

This article evaluates SAP's RPT-1 model for enterprise business process prediction, comparing its performance against traditional machin...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19253] Alternating Bi-Objective Optimization for Explainable Neuro-Fuzzy Systems

This article presents X-ANFIS, a novel optimization scheme for explainable neuro-fuzzy systems that balances accuracy and explainability ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19215] Understanding Empirical Unlearning with Combinatorial Interpretability

This article explores the concept of empirical unlearning in machine learning, focusing on how knowledge can persist in models even after...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19208] How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization

This article presents DynaMO, a novel framework for optimizing reinforcement learning with verifiable rewards, addressing key challenges ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18462] Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents

This article evaluates the reliability of persona-conditioned large language models (LLMs) as synthetic survey respondents, revealing tha...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection

The paper presents HybridFL, a federated learning approach designed for financial crime detection, which integrates horizontal and vertic...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

The paper introduces Virtual Parameter Sharpening (VPS), a novel technique for enhancing inference-time reasoning in transformer models t...

arXiv - AI · 3 min · about 1 month ago

Robotics

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

The article presents a novel evaluation framework for mechanistic interpretability research, utilizing AI agents to enhance research rigo...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

This article examines the impact of AI-generated search summaries on website traffic, specifically analyzing how Google's AI Overviews af...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

The paper presents TICL, a novel method for causal structure learning from interventional data, enhancing generalization across diverse s...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18453] LLM-Assisted Replication for Quantitative Social Science

The paper presents an LLM-based system designed to replicate statistical analyses in quantitative social science, addressing the replicat...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18451] Developing a Multi-Agent System to Generate Next Generation Science Assessments with Evidence-Centered Design

This article discusses the development of a Multi-Agent System (MAS) that automates the generation of science assessments aligned with th...

arXiv - AI · 4 min · about 1 month ago

Previous Page 78 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

AI Venture Capital Boom: 2025 Funding Shift & Investment Strategy - News and Statistics

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

All Content

[2602.18481] AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models

[2602.19345] Smooth Gate Functions for Soft Advantage Policy Optimization

[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders

[2602.18479] AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature

[2602.19331] Partial Soft-Matching Distance for Neural Representational Comparison with Partial Unit Correspondence

[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis

[2602.18476] BioLM-Score: Language-Prior Conditioned Probabilistic Geometric Potentials for Protein-Ligand Scoring

[2602.19289] AdsorbFlow: energy-conditioned flow matching enables fast and realistic adsorbate placement

[2602.19237] Evaluating SAP RPT-1 for Enterprise Business Process Prediction: In-Context Learning vs. Traditional Machine Learning on Structured SAP Data

[2602.19253] Alternating Bi-Objective Optimization for Explainable Neuro-Fuzzy Systems

[2602.19215] Understanding Empirical Unlearning with Combinatorial Interpretability

[2602.19208] How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization

[2602.18462] Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents

[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection

[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

[2602.18453] LLM-Assisted Replication for Quantitative Social Science

[2602.18451] Developing a Multi-Agent System to Generate Next Generation Science Assessments with Evidence-Centered Design

Related Topics

Stay updated with AI News