Data Science

Data analysis, statistics, and data engineering

Top This Week

Data Science

White-collar workers are quietly rebelling against AI as 80% outright refuse adoption mandates

submitted by /u/Effective-Trick-5795 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] Forced Depth Consideration Reduces Type II Errors in LLM Self-Classification: Evidence from an Exploration Prompting Ablation Study - (200 trap prompts, 4 models, 8 Step-0 variants) [R]

LLM-Based task classifier tend to misroute prompts that look simple at first glance, but require deeper understanding - I call it "Type I...

Reddit - Machine Learning · 1 min ·
Machine Learning

Anyone have an S3-compatible store that actually saturates H100s without the AWS egress tax? [R]

We’re training on a cluster in Lambda Labs, but our main dataset ( over 40TB) is sitting in AWS S3. The egress fees are high, so we tried...

Reddit - Machine Learning · 1 min ·

All Content

[2508.01423] 3DRot: Rediscovering the Missing Primitive for RGB-Based 3D Augmentation
Computer Vision

[2508.01423] 3DRot: Rediscovering the Missing Primitive for RGB-Based 3D Augmentation

The paper introduces 3DRot, a novel RGB-based 3D augmentation technique that enhances geometric consistency in 3D tasks by enabling effec...

arXiv - Machine Learning · 4 min ·
[2506.06964] Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization
Llms

[2506.06964] Offline RL by Reward-Weighted Fine-Tuning for Conversation Optimization

This article presents a novel approach to offline reinforcement learning (RL) using reward-weighted fine-tuning, enhancing conversation o...

arXiv - Machine Learning · 3 min ·
[2505.23522] OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data
Nlp

[2505.23522] OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

The article introduces OmniEarth-Bench, a comprehensive benchmark for evaluating interactions across Earth's six spheres using multimodal...

arXiv - Machine Learning · 4 min ·
[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems
Machine Learning

[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems

This article examines the relevance of statistical methods in the age of deep learning, using ordinary differential equation (ODE) invers...

arXiv - Machine Learning · 4 min ·
[2505.20754] Stationary MMD Points
Machine Learning

[2505.20754] Stationary MMD Points

The paper discusses the concept of stationary MMD points in numerical integration, demonstrating their advantages over traditional method...

arXiv - Machine Learning · 3 min ·
[2504.18367] A Novel 4-D Dataset Paradigm for Studying Complete Ligand-Protein Dissociation Dynamics
Machine Learning

[2504.18367] A Novel 4-D Dataset Paradigm for Studying Complete Ligand-Protein Dissociation Dynamics

This article introduces a novel 4-D dataset paradigm for studying ligand-protein dissociation dynamics, presenting the DD-13M database th...

arXiv - Machine Learning · 3 min ·
[2509.22794] Differentially Private Two-Stage Gradient Descent for Instrumental Variable Regression
Machine Learning

[2509.22794] Differentially Private Two-Stage Gradient Descent for Instrumental Variable Regression

This paper presents a novel algorithm for instrumental variable regression that ensures differential privacy while maintaining statistica...

arXiv - Machine Learning · 4 min ·
[2504.09733] Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) of 2D Black Box Classifier Functions
Ai Startups

[2504.09733] Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) of 2D Black Box Classifier Functions

The paper presents the Epsilon-Neighborhood Decision-Boundary Governed Estimation (EDGE) algorithm for efficiently estimating decision bo...

arXiv - Machine Learning · 4 min ·
[2502.01713] Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool
Ai Safety

[2502.01713] Auditing a Dutch Public Sector Risk Profiling Algorithm Using an Unsupervised Bias Detection Tool

This article presents an audit of a Dutch public sector risk profiling algorithm, utilizing an unsupervised bias detection tool to identi...

arXiv - Machine Learning · 4 min ·
[2509.13550] Complexity Bounds for Smooth Multiobjective Optimization
Machine Learning

[2509.13550] Complexity Bounds for Smooth Multiobjective Optimization

This paper investigates the oracle complexity of finding ε-Pareto stationary points in smooth multiobjective optimization, presenting new...

arXiv - AI · 3 min ·
[2509.13229] Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation
Machine Learning

[2509.13229] Curriculum Multi-Task Self-Supervision Improves Lightweight Architectures for Onboard Satellite Hyperspectral Image Segmentation

This article presents a novel framework, Curriculum Multi-Task Self-Supervision Learning (CMTSSL), aimed at enhancing lightweight archite...

arXiv - Machine Learning · 4 min ·
[2501.01696] Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent
Machine Learning

[2501.01696] Guaranteed Nonconvex Low-Rank Tensor Estimation via Scaled Gradient Descent

This paper presents a novel Scaled Gradient Descent (ScaledGD) algorithm for low-rank tensor estimation, demonstrating linear convergence...

arXiv - Machine Learning · 4 min ·
[2509.12456] Reinforcement Learning-Based Market Making as a Stochastic Control on Non-Stationary Limit Order Book Dynamics
Machine Learning

[2509.12456] Reinforcement Learning-Based Market Making as a Stochastic Control on Non-Stationary Limit Order Book Dynamics

This paper explores the use of reinforcement learning for market making in non-stationary limit order book dynamics, presenting a practic...

arXiv - AI · 4 min ·
[2411.12159] Sensor-fusion based Prognostics for Deep-space Habitats Exhibiting Multiple Unlabeled Failure Modes
Robotics

[2411.12159] Sensor-fusion based Prognostics for Deep-space Habitats Exhibiting Multiple Unlabeled Failure Modes

This paper presents a novel unsupervised prognostics framework for deep-space habitats, addressing multiple unlabeled failure modes throu...

arXiv - Machine Learning · 4 min ·
[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity
Machine Learning

[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

This paper explores denoising diffusions using optimal transport, focusing on localization, curvature, and multi-scale complexity in gene...

arXiv - Machine Learning · 4 min ·
[2508.21285] A Financial Brain Scan of the LLM
Llms

[2508.21285] A Financial Brain Scan of the LLM

This article presents a novel approach to analyzing large language models (LLMs) in finance, enabling researchers to identify and manipul...

arXiv - AI · 3 min ·
[2410.22009] On uniqueness in structured model learning
Machine Learning

[2410.22009] On uniqueness in structured model learning

This paper explores the uniqueness in structured model learning for systems of partial differential equations (PDEs), proposing a framewo...

arXiv - Machine Learning · 4 min ·
[2508.19300] CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy
Computer Vision

[2508.19300] CellINR: Implicitly Overcoming Photo-induced Artifacts in 4D Live Fluorescence Microscopy

The paper presents CellINR, a novel framework designed to mitigate photo-induced artifacts in 4D live fluorescence microscopy, enhancing ...

arXiv - AI · 4 min ·
[2410.17587] Predicting Company Growth using Scaling Theory informed Machine Learning
Machine Learning

[2410.17587] Predicting Company Growth using Scaling Theory informed Machine Learning

The paper presents a novel Scaling-Theory-Informed Machine Learning (STIML) framework for predicting company growth by integrating struct...

arXiv - Machine Learning · 4 min ·
[2508.07514] Robust MultiSpecies Agricultural Segmentation Across Devices, Seasons, and Sensors Using Hierarchical DINOv2 Models
Machine Learning

[2508.07514] Robust MultiSpecies Agricultural Segmentation Across Devices, Seasons, and Sensors Using Hierarchical DINOv2 Models

This article presents a robust segmentation framework using Hierarchical DINOv2 models for reliable plant species and damage identificati...

arXiv - AI · 4 min ·
Previous Page 130 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime