Data Science

Data analysis, statistics, and data engineering

Top This Week

Machine Learning

Free tool I built to score dataset quality (LQS) — feedback welcome [D]

We built a Label Quality Score (LQS) system for our dataset marketplace and opened it up as a free standalone tool. Upload a dataset → ge...

Reddit - Machine Learning · 1 min ·
Data Science

"OpenAI quietly removed the one safety mechanism that could shut the whole thing down — and nobody is talking about it"

OpenAI was founded as a nonprofit for one specific reason — to ensure AI development couldn't be hijacked by profit motives. Their origin...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] citracer: a small CLI tool to trace where a concept comes from in a citation graph

Hi all, I made a small tool that I've been using for my own literature reviews and figured I'd share in case it's useful to anyone else. ...

Reddit - Machine Learning · 1 min ·

All Content

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning
Robotics

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

The paper presents CAFE, a novel framework for automated feature engineering that combines causal discovery with multi-agent reinforcemen...

arXiv - Machine Learning · 4 min ·
[2602.16264] Prediction of Major Solar Flares Using Interpretable Class-dependent Reward Framework with Active Region Magnetograms and Domain Knowledge
Machine Learning

[2602.16264] Prediction of Major Solar Flares Using Interpretable Class-dependent Reward Framework with Active Region Magnetograms and Domain Knowledge

This article presents a novel supervised classification framework for predicting major solar flares using class-dependent rewards and dee...

arXiv - Machine Learning · 4 min ·
[2602.16236] Online Prediction of Stochastic Sequences with High Probability Regret Bounds
Machine Learning

[2602.16236] Online Prediction of Stochastic Sequences with High Probability Regret Bounds

This paper explores high-probability regret bounds for online prediction of stochastic sequences, proposing new bounds that improve upon ...

arXiv - Machine Learning · 3 min ·
[2602.16224] Amortized Predictability-aware Training Framework for Time Series Forecasting and Classification
Machine Learning

[2602.16224] Amortized Predictability-aware Training Framework for Time Series Forecasting and Classification

The paper presents an Amortized Predictability-aware Training Framework (APTF) designed to enhance time series forecasting and classifica...

arXiv - Machine Learning · 3 min ·
[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting
Machine Learning

[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

The paper presents SEMixer, a novel multiscale model designed for long-term time series forecasting, addressing challenges in modeling te...

arXiv - Machine Learning · 3 min ·
[2602.16173] Learning Personalized Agents from Human Feedback
Machine Learning

[2602.16173] Learning Personalized Agents from Human Feedback

The paper presents a framework, Personalized Agents from Human Feedback (PAHF), which enables AI agents to adapt to individual user prefe...

arXiv - Machine Learning · 4 min ·
[2602.16105] GPSBench: Do Large Language Models Understand GPS Coordinates?
Llms

[2602.16105] GPSBench: Do Large Language Models Understand GPS Coordinates?

The paper introduces GPSBench, a dataset designed to evaluate the geospatial reasoning capabilities of large language models (LLMs) using...

arXiv - AI · 3 min ·
[2602.16218] Bayesian Quadrature: Gaussian Processes for Integration
Machine Learning

[2602.16218] Bayesian Quadrature: Gaussian Processes for Integration

This article presents a comprehensive survey on Bayesian quadrature, a probabilistic approach to numerical integration, detailing its mat...

arXiv - Machine Learning · 3 min ·
[2602.16217] Multi-Class Boundary Extraction from Implicit Representations
Machine Learning

[2602.16217] Multi-Class Boundary Extraction from Implicit Representations

This paper presents a novel algorithm for multi-class boundary extraction from implicit representations, emphasizing topological correctn...

arXiv - Machine Learning · 3 min ·
[2602.16050] Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination
Llms

[2602.16050] Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination

This article evaluates the performance of the January Mirror, an evidence-grounded clinical reasoning system, against leading large langu...

arXiv - AI · 4 min ·
[2602.16216] UCTECG-Net: Uncertainty-aware Convolution Transformer ECG Network for Arrhythmia Detection
Machine Learning

[2602.16216] UCTECG-Net: Uncertainty-aware Convolution Transformer ECG Network for Arrhythmia Detection

The paper presents UCTECG-Net, an innovative uncertainty-aware convolution transformer network for improved ECG classification, achieving...

arXiv - AI · 3 min ·
[2602.16213] Graph neural network for colliding particles with an application to sea ice floe modeling
Machine Learning

[2602.16213] Graph neural network for colliding particles with an application to sea ice floe modeling

This article presents a novel Graph Neural Network approach for modeling sea ice dynamics, focusing on particle collisions and data assim...

arXiv - AI · 3 min ·
[2602.16209] Geometric Neural Operators via Lie Group-Constrained Latent Dynamics
Machine Learning

[2602.16209] Geometric Neural Operators via Lie Group-Constrained Latent Dynamics

This paper presents a novel approach to neural operators, addressing instability in multi-layer iterations and long-horizon predictions b...

arXiv - AI · 4 min ·
[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection
Robotics

[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

This paper explores optimization instability in autonomous workflows for clinical symptom detection, revealing critical failure modes and...

arXiv - AI · 4 min ·
[2602.16204] Linked Data Classification using Neurochaos Learning
Machine Learning

[2602.16204] Linked Data Classification using Neurochaos Learning

This article explores the application of Neurochaos Learning (NL) to linked data classification, demonstrating its effectiveness on knowl...

arXiv - Machine Learning · 3 min ·
[2602.16198] Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform
Machine Learning

[2602.16198] Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform

This paper presents a novel training-free adaptation method for diffusion models, leveraging Doob's $h$-transform to enhance sampling eff...

arXiv - Machine Learning · 4 min ·
[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training
Machine Learning

[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training

The paper presents ModalImmune, a training framework designed to enhance the resilience of multimodal systems against input channel loss ...

arXiv - Machine Learning · 3 min ·
[2602.16188] Deep TPC: Temporal-Prior Conditioning for Time Series Forecasting
Llms

[2602.16188] Deep TPC: Temporal-Prior Conditioning for Time Series Forecasting

The paper introduces Temporal-Prior Conditioning (TPC) for time series forecasting, enhancing temporal reasoning by integrating time as a...

arXiv - Machine Learning · 3 min ·
[2602.16181] Towards Secure and Scalable Energy Theft Detection: A Federated Learning Approach for Resource-Constrained Smart Meters
Machine Learning

[2602.16181] Towards Secure and Scalable Energy Theft Detection: A Federated Learning Approach for Resource-Constrained Smart Meters

This article presents a federated learning framework for detecting energy theft in resource-constrained smart meters, addressing privacy ...

arXiv - Machine Learning · 4 min ·
[2602.16167] Muon with Spectral Guidance: Efficient Optimization for Scientific Machine Learning
Machine Learning

[2602.16167] Muon with Spectral Guidance: Efficient Optimization for Scientific Machine Learning

The paper introduces SpecMuon, a novel optimizer that enhances the Muon optimizer for scientific machine learning by addressing challenge...

arXiv - Machine Learning · 4 min ·
Previous Page 115 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime