Data Science

Data analysis, statistics, and data engineering

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
AI Venture Capital Boom: 2025 Funding Shift & Investment Strategy - News and Statistics
Data Science

AI Venture Capital Boom: 2025 Funding Shift & Investment Strategy - News and Statistics

Analysis of AI dominance in 2025 venture capital, its effects on market valuations, and strategic considerations for investors.

AI News - General · 7 min ·
Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min ·

All Content

[2602.18481] AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models
Llms

[2602.18481] AlphaForgeBench: Benchmarking End-to-End Trading Strategy Design with Large Language Models

The paper introduces AlphaForgeBench, a framework for evaluating trading strategies using Large Language Models (LLMs), addressing issues...

arXiv - AI · 4 min ·
[2602.19345] Smooth Gate Functions for Soft Advantage Policy Optimization
Llms

[2602.19345] Smooth Gate Functions for Soft Advantage Policy Optimization

This paper explores Smooth Gate Functions for Soft Advantage Policy Optimization, enhancing the stability of large language model trainin...

arXiv - Machine Learning · 3 min ·
[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders
Machine Learning

[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders

The paper presents ZUNA, a 380M-parameter masked diffusion autoencoder designed for EEG signal superresolution and channel infilling, dem...

arXiv - Machine Learning · 3 min ·
[2602.18479] AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature
Llms

[2602.18479] AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature

AgentCAT is a large language model designed to extract and analyze catalytic reaction data from chemical engineering literature, addressi...

arXiv - AI · 4 min ·
[2602.19331] Partial Soft-Matching Distance for Neural Representational Comparison with Partial Unit Correspondence
Machine Learning

[2602.19331] Partial Soft-Matching Distance for Neural Representational Comparison with Partial Unit Correspondence

The paper introduces a Partial Soft-Matching Distance metric for neural representational comparison, enhancing robustness against noise a...

arXiv - Machine Learning · 4 min ·
[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis
Machine Learning

[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis

The paper introduces CTS-Bench, a benchmark suite for evaluating graph coarsening trade-offs in Graph Neural Networks (GNNs) for Clock Tr...

arXiv - Machine Learning · 4 min ·
[2602.18476] BioLM-Score: Language-Prior Conditioned Probabilistic Geometric Potentials for Protein-Ligand Scoring
Machine Learning

[2602.18476] BioLM-Score: Language-Prior Conditioned Probabilistic Geometric Potentials for Protein-Ligand Scoring

BioLM-Score introduces a novel protein-ligand scoring model that enhances efficiency and interpretability in drug design by integrating g...

arXiv - Machine Learning · 4 min ·
[2602.19289] AdsorbFlow: energy-conditioned flow matching enables fast and realistic adsorbate placement
Generative Ai

[2602.19289] AdsorbFlow: energy-conditioned flow matching enables fast and realistic adsorbate placement

The paper introduces AdsorbFlow, a deterministic generative model that enhances the efficiency of adsorbate placement on catalytic surfac...

arXiv - Machine Learning · 4 min ·
[2602.19237] Evaluating SAP RPT-1 for Enterprise Business Process Prediction: In-Context Learning vs. Traditional Machine Learning on Structured SAP Data
Llms

[2602.19237] Evaluating SAP RPT-1 for Enterprise Business Process Prediction: In-Context Learning vs. Traditional Machine Learning on Structured SAP Data

This article evaluates SAP's RPT-1 model for enterprise business process prediction, comparing its performance against traditional machin...

arXiv - AI · 4 min ·
[2602.19253] Alternating Bi-Objective Optimization for Explainable Neuro-Fuzzy Systems
Machine Learning

[2602.19253] Alternating Bi-Objective Optimization for Explainable Neuro-Fuzzy Systems

This article presents X-ANFIS, a novel optimization scheme for explainable neuro-fuzzy systems that balances accuracy and explainability ...

arXiv - Machine Learning · 3 min ·
[2602.19215] Understanding Empirical Unlearning with Combinatorial Interpretability
Llms

[2602.19215] Understanding Empirical Unlearning with Combinatorial Interpretability

This article explores the concept of empirical unlearning in machine learning, focusing on how knowledge can persist in models even after...

arXiv - Machine Learning · 3 min ·
[2602.19208] How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization
Llms

[2602.19208] How to Allocate, How to Learn? Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization

This article presents DynaMO, a novel framework for optimizing reinforcement learning with verifiable rewards, addressing key challenges ...

arXiv - AI · 4 min ·
[2602.18462] Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents
Llms

[2602.18462] Assessing the Reliability of Persona-Conditioned LLMs as Synthetic Survey Respondents

This article evaluates the reliability of persona-conditioned large language models (LLMs) as synthetic survey respondents, revealing tha...

arXiv - AI · 3 min ·
[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection
Machine Learning

[2602.19207] HybridFL: A Federated Learning Approach for Financial Crime Detection

The paper presents HybridFL, a federated learning approach designed for financial crime detection, which integrates horizontal and vertic...

arXiv - AI · 3 min ·
[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement
Machine Learning

[2602.19169] Virtual Parameter Sharpening: Dynamic Low-Rank Perturbations for Inference-Time Reasoning Enhancement

The paper introduces Virtual Parameter Sharpening (VPS), a novel technique for enhancing inference-time reasoning in transformer models t...

arXiv - AI · 3 min ·
[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research
Robotics

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

The article presents a novel evaluation framework for mechanistic interpretability research, utilizing AI agents to enhance research rigo...

arXiv - Machine Learning · 3 min ·
[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia
Llms

[2602.18455] Impact of AI Search Summaries on Website Traffic: Evidence from Google AI Overviews and Wikipedia

This article examines the impact of AI-generated search summaries on website traffic, specifically analyzing how Google's AI Overviews af...

arXiv - AI · 4 min ·
[2602.19131] Test-Time Learning of Causal Structure from Interventional Data
Machine Learning

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

The paper presents TICL, a novel method for causal structure learning from interventional data, enhancing generalization across diverse s...

arXiv - AI · 3 min ·
[2602.18453] LLM-Assisted Replication for Quantitative Social Science
Llms

[2602.18453] LLM-Assisted Replication for Quantitative Social Science

The paper presents an LLM-based system designed to replicate statistical analyses in quantitative social science, addressing the replicat...

arXiv - AI · 3 min ·
[2602.18451] Developing a Multi-Agent System to Generate Next Generation Science Assessments with Evidence-Centered Design
Machine Learning

[2602.18451] Developing a Multi-Agent System to Generate Next Generation Science Assessments with Evidence-Centered Design

This article discusses the development of a Multi-Agent System (MAS) that automates the generation of science assessments aligned with th...

arXiv - AI · 4 min ·
Previous Page 78 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime