Data Science

Data analysis, statistics, and data engineering

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
Data Science

~77% of all new "Success" self-help books on Amazon are likely written by AI, with 1 author, Noah Felix Bennett, publishing a stunning 74 books in mid-2025 alone, at a rate of >1 per day. Richard Trillion Mantey, who has published hundreds of books, was assessed to have used AI for every single book

"Ironically, one of the 844 books in this dataset is called 'How to Write for Humans in an AI World: Cutting Through Digital Noise and Re...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection
Ai Startups

[2602.15376] A Unified Evaluation of Learning-Based Similarity Techniques for Malware Detection

This paper presents a systematic evaluation of learning-based similarity techniques for malware detection, comparing various methods unde...

arXiv - AI · 4 min ·
[2601.01016] Improving Variational Autoencoder using Random Fourier Transformation: An Aviation Safety Anomaly Detection Case-Study
Machine Learning

[2601.01016] Improving Variational Autoencoder using Random Fourier Transformation: An Aviation Safety Anomaly Detection Case-Study

This study explores enhancements to Variational Autoencoders (VAEs) using Random Fourier Transformation (RFT) for anomaly detection in av...

arXiv - Machine Learning · 4 min ·
[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English
Llms

[2602.15373] Far Out: Evaluating Language Models on Slang in Australian and Indian English

This paper evaluates the performance of language models on slang in Australian and Indian English, revealing significant gaps in understa...

arXiv - AI · 4 min ·
[2512.19057] Efficient Personalization of Generative Models via Optimal Experimental Design
Machine Learning

[2512.19057] Efficient Personalization of Generative Models via Optimal Experimental Design

This paper presents a novel method for efficiently personalizing generative models using optimal experimental design to select preference...

arXiv - Machine Learning · 3 min ·
[2511.09763] Is nasty noise actually harder than malicious noise?
Machine Learning

[2511.09763] Is nasty noise actually harder than malicious noise?

This paper explores the complexities of learning Boolean functions in the presence of two noise models: malicious and nasty noise, highli...

arXiv - Machine Learning · 4 min ·
[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability
Machine Learning

[2510.26792] Learning Pseudorandom Numbers with Transformers: Permuted Congruential Generators, Curricula, and Interpretability

This article explores how Transformer models can learn sequences generated by Permuted Congruential Generators (PCGs), demonstrating thei...

arXiv - Machine Learning · 4 min ·
[2602.15339] Benchmarking Self-Supervised Models for Cardiac Ultrasound View Classification
Machine Learning

[2602.15339] Benchmarking Self-Supervised Models for Cardiac Ultrasound View Classification

This article evaluates self-supervised learning models for cardiac ultrasound view classification, comparing USF-MAE and MoCo v3 using th...

arXiv - AI · 4 min ·
[2510.02625] TabImpute: Universal Zero-Shot Imputation for Tabular Data
Machine Learning

[2510.02625] TabImpute: Universal Zero-Shot Imputation for Tabular Data

The paper presents TabImpute, a pre-trained transformer model designed for zero-shot imputation of missing data in tabular formats, signi...

arXiv - Machine Learning · 4 min ·
[2510.01510] Flock: A Knowledge Graph Foundation Model via Learning on Random Walks
Llms

[2510.01510] Flock: A Knowledge Graph Foundation Model via Learning on Random Walks

The paper presents Flock, a knowledge graph foundation model that enhances zero-shot link prediction by employing probabilistic node-rela...

arXiv - Machine Learning · 4 min ·
[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series
Machine Learning

[2509.20936] GenFacts-Generative Counterfactual Explanations for Multi-Variate Time Series

The paper introduces GenFacts, a generative framework for creating counterfactual explanations in multivariate time series, improving mod...

arXiv - Machine Learning · 3 min ·
[2509.18131] Randomness and signal propagation in physics-informed neural networks (PINNs): A neural PDE perspective
Machine Learning

[2509.18131] Randomness and signal propagation in physics-informed neural networks (PINNs): A neural PDE perspective

This article investigates the randomness in weight matrices of physics-informed neural networks (PINNs) and its impact on signal propagat...

arXiv - Machine Learning · 4 min ·
[2509.00663] Morephy-Net: An Evolutionary Multi-objective Optimization for Replica-Exchange-based Physics-informed Neural Operator Learning Networks
Machine Learning

[2509.00663] Morephy-Net: An Evolutionary Multi-objective Optimization for Replica-Exchange-based Physics-informed Neural Operator Learning Networks

Morephy-Net introduces an evolutionary multi-objective optimization method for physics-informed neural operator learning networks, enhanc...

arXiv - Machine Learning · 4 min ·
[2508.16832] Out of Distribution Detection for Efficient Continual Learning in Quality Prediction for Arc Welding
Machine Learning

[2508.16832] Out of Distribution Detection for Efficient Continual Learning in Quality Prediction for Arc Welding

This article presents a novel approach to out-of-distribution detection in arc welding quality prediction, enhancing continual learning b...

arXiv - AI · 4 min ·
[2508.16237] A XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Respiratory Disease
Machine Learning

[2508.16237] A XAI-based Framework for Frequency Subband Characterization of Cough Spectrograms in Chronic Respiratory Disease

This paper presents an explainable AI framework for analyzing cough sounds linked to chronic respiratory diseases, focusing on COPD. It u...

arXiv - AI · 4 min ·
[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models
Machine Learning

[2508.11460] Calibrated and uncertain? Evaluating uncertainty estimates in binary classification models

This article evaluates uncertainty estimates in binary classification models, comparing six probabilistic machine learning algorithms to ...

arXiv - Machine Learning · 4 min ·
[2505.11985] Variance-Optimal Arm Selection: Misallocation Minimization and Best Arm Identification
Machine Learning

[2505.11985] Variance-Optimal Arm Selection: Misallocation Minimization and Best Arm Identification

This paper presents novel algorithms for selecting the arm with the highest variance among independent arms, focusing on misallocation mi...

arXiv - Machine Learning · 4 min ·
[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization
Machine Learning

[2505.11695] Qronos: Correcting the Past by Shaping the Future... in Post-Training Quantization

The paper introduces Qronos, a novel post-training quantization algorithm that enhances neural network performance by correcting quantiza...

arXiv - AI · 4 min ·
[2504.20823] Hybrid quantum recurrent neural network for remaining useful life prediction
Machine Learning

[2504.20823] Hybrid quantum recurrent neural network for remaining useful life prediction

This article presents a Hybrid Quantum Recurrent Neural Network framework for predicting the remaining useful life of jet engines, showca...

arXiv - Machine Learning · 4 min ·
[2504.15206] How Global Calibration Strengthens Multiaccuracy
Machine Learning

[2504.15206] How Global Calibration Strengthens Multiaccuracy

This article explores how global calibration enhances multiaccuracy in machine learning, revealing its potential to improve predictive fa...

arXiv - Machine Learning · 4 min ·
[2502.13022] Efficient and Sharp Off-Policy Learning under Unobserved Confounding
Ai Safety

[2502.13022] Efficient and Sharp Off-Policy Learning under Unobserved Confounding

This paper presents a novel method for off-policy learning that addresses unobserved confounding, enhancing the accuracy of policy learni...

arXiv - Machine Learning · 4 min ·
Previous Page 120 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime