Data Science

Data analysis, statistics, and data engineering

Top This Week

Data Science

White-collar workers are quietly rebelling against AI as 80% outright refuse adoption mandates

submitted by /u/Effective-Trick-5795 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] Forced Depth Consideration Reduces Type II Errors in LLM Self-Classification: Evidence from an Exploration Prompting Ablation Study - (200 trap prompts, 4 models, 8 Step-0 variants) [R]

LLM-Based task classifier tend to misroute prompts that look simple at first glance, but require deeper understanding - I call it "Type I...

Reddit - Machine Learning · 1 min ·
Machine Learning

Anyone have an S3-compatible store that actually saturates H100s without the AWS egress tax? [R]

We’re training on a cluster in Lambda Labs, but our main dataset ( over 40TB) is sitting in AWS S3. The egress fees are high, so we tried...

Reddit - Machine Learning · 1 min ·

All Content

[2510.26046] Bias-Corrected Data Synthesis for Imbalanced Learning
Machine Learning

[2510.26046] Bias-Corrected Data Synthesis for Imbalanced Learning

This paper presents a method for bias-corrected data synthesis aimed at improving classification accuracy in imbalanced learning scenario...

arXiv - Machine Learning · 4 min ·
[2510.24215] What Can Be Recovered Under Sparse Adversarial Corruption? Assumption-Free Theory for Linear Measurements
Machine Learning

[2510.24215] What Can Be Recovered Under Sparse Adversarial Corruption? Assumption-Free Theory for Linear Measurements

This paper explores the recoverability of sparse adversarial vectors in linear measurements without relying on strong structural assumpti...

arXiv - Machine Learning · 4 min ·
[2601.02158] FormationEval, an open multiple-choice benchmark for petroleum geoscience
Llms

[2601.02158] FormationEval, an open multiple-choice benchmark for petroleum geoscience

FormationEval introduces a benchmark for evaluating language models in petroleum geoscience, featuring 505 questions across multiple doma...

arXiv - Machine Learning · 4 min ·
[2510.17734] Efficient Tensor Completion Algorithms for Highly Oscillatory Operators
Machine Learning

[2510.17734] Efficient Tensor Completion Algorithms for Highly Oscillatory Operators

This paper introduces efficient tensor completion algorithms designed for reconstructing highly oscillatory operators, demonstrating sign...

arXiv - Machine Learning · 4 min ·
[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications
Machine Learning

[2510.11418] Forward-Forward Autoencoder Architectures for Energy-Efficient Wireless Communications

This article presents Forward-Forward Autoencoder architectures aimed at enhancing energy efficiency in wireless communications, demonstr...

arXiv - Machine Learning · 3 min ·
[2512.17979] Adaptive Agents in Spatial Double-Auction Markets: Modeling the Emergence of Industrial Symbiosis
Machine Learning

[2512.17979] Adaptive Agents in Spatial Double-Auction Markets: Modeling the Emergence of Industrial Symbiosis

This paper presents an agent-based model to explore how adaptive agents in spatial double-auction markets can foster industrial symbiosis...

arXiv - AI · 4 min ·
[2512.20352] Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Llms

[2512.20352] Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation

This paper presents a novel framework for validating qualitative research using multi-LLM thematic analysis, integrating Cohen's Kappa an...

arXiv - AI · 4 min ·
[2512.12206] ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB
Machine Learning

[2512.12206] ALERT Open Dataset and Input-Size-Agnostic Vision Transformer for Driver Activity Recognition using IR-UWB

The paper presents the ALERT dataset and an input-size-agnostic Vision Transformer (ISA-ViT) for driver activity recognition using IR-UWB...

arXiv - Machine Learning · 4 min ·
[2510.02983] Oracle-based Uniform Sampling from Convex Bodies
Machine Learning

[2510.02983] Oracle-based Uniform Sampling from Convex Bodies

This paper introduces new Markov chain Monte Carlo algorithms for uniform sampling from convex bodies, leveraging a restricted Gaussian o...

arXiv - Machine Learning · 3 min ·
[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs
Machine Learning

[2509.26335] TrackCore-F: Deploying Transformer-Based Subatomic Particle Tracking on FPGAs

The paper discusses TrackCore-F, a methodology for deploying Transformer-based models for subatomic particle tracking on FPGAs, highlight...

arXiv - Machine Learning · 3 min ·
[2512.09185] Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation
Machine Learning

[2512.09185] Learning Patient-Specific Disease Dynamics with Latent Flow Matching for Longitudinal Imaging Generation

The paper presents a novel framework, $ ext{Δ}$-LFM, for modeling patient-specific disease dynamics using latent flow matching, enhancing...

arXiv - AI · 4 min ·
[2509.19665] Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy
Machine Learning

[2509.19665] Deep Learning for Clouds and Cloud Shadow Segmentation in Methane Satellite and Airborne Imaging Spectroscopy

This article presents a study on deep learning techniques for detecting clouds and cloud shadows in methane satellite and airborne imagin...

arXiv - Machine Learning · 4 min ·
[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting
Machine Learning

[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting

The paper introduces Conditionally Whitened Generative Models (CW-Gen) for probabilistic time series forecasting, addressing challenges l...

arXiv - Machine Learning · 4 min ·
[2511.11030] Algorithms Trained on Normal Chest X-rays Can Predict Health Insurance Types
Machine Learning

[2511.11030] Algorithms Trained on Normal Chest X-rays Can Predict Health Insurance Types

This study explores how deep learning algorithms trained on normal chest X-rays can predict patients' health insurance types, revealing h...

arXiv - AI · 4 min ·
[2509.02522] Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR
Llms

[2509.02522] Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

The paper presents PACS, a novel framework for Reinforcement Learning with Verifiable Rewards (RLVR), addressing challenges like sparse r...

arXiv - Machine Learning · 4 min ·
[2511.01228] Importance Ranking in Complex Networks via Influence-aware Causal Node Embedding
Nlp

[2511.01228] Importance Ranking in Complex Networks via Influence-aware Causal Node Embedding

This paper presents a novel framework for ranking node importance in complex networks using influence-aware causal node embedding, enhanc...

arXiv - AI · 4 min ·
[2508.17622] The Statistical Fairness-Accuracy Frontier
Machine Learning

[2508.17622] The Statistical Fairness-Accuracy Frontier

This article explores the trade-offs between fairness and accuracy in predictive modeling, introducing the fairness-accuracy (FA) Pareto ...

arXiv - Machine Learning · 3 min ·
[2508.15555] HEAS: Hierarchical Evolutionary Agent Simulation Framework for Cross-Scale Modeling and Multi-Objective Search
Machine Learning

[2508.15555] HEAS: Hierarchical Evolutionary Agent Simulation Framework for Cross-Scale Modeling and Multi-Objective Search

The HEAS framework integrates agent-based modeling with evolutionary optimization, enabling cross-scale modeling and multi-objective sear...

arXiv - Machine Learning · 4 min ·
[2508.10765] Memorisation and forgetting in a learning Hopfield neural network: bifurcation mechanisms, attractors and basins
Machine Learning

[2508.10765] Memorisation and forgetting in a learning Hopfield neural network: bifurcation mechanisms, attractors and basins

This article explores the mechanisms of memorization and forgetting in Hopfield neural networks, revealing how bifurcations affect memory...

arXiv - Machine Learning · 4 min ·
[2510.18259] Learning under Quantization for High-Dimensional Linear Regression
Machine Learning

[2510.18259] Learning under Quantization for High-Dimensional Linear Regression

This paper explores the impact of low-bit quantization on high-dimensional linear regression, providing a theoretical framework for under...

arXiv - Machine Learning · 4 min ·
Previous Page 129 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime