Data Science

Data analysis, statistics, and data engineering

Top This Week

Google quietly launched an AI dictation app that works offline
Machine Learning

Google quietly launched an AI dictation app that works offline

Google's new offline-first dictation app uses Gemma AI models to take on the apps like Wispr Flow.

TechCrunch - AI · 4 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

[D] Tested model routing on financial AI datasets — good savings and curious what benchmarks others use.

Ran a benchmark evaluating whether prompt complexity-based routing delivers meaningful savings. Used public HuggingFace datasets. Here's ...

Reddit - Machine Learning · 1 min ·

All Content

The Search Engine for OnlyFans Models Who Look Like Your Crush | WIRED
Machine Learning

The Search Engine for OnlyFans Models Who Look Like Your Crush | WIRED

Presearch's new tool, Doppelgänger, allows users to find OnlyFans models resembling celebrities, aiming to improve content discovery whil...

Wired - AI · 10 min ·
Ai Agents

Hybrid MARL + Linear Programming Architecture for Dynamic Vehicle Routing (Zero-Shot Generalization)

This article presents a hybrid architecture combining Multi-Agent Reinforcement Learning (MARL) and Linear Programming (LP) for optimizin...

Reddit - Machine Learning · 1 min ·
How artificial intelligence is reshaping geotechnical engineering skills
Machine Learning

How artificial intelligence is reshaping geotechnical engineering skills

The article discusses how AI is transforming geotechnical engineering by automating tasks, enhancing data analysis, and creating new skil...

AI News - General · 9 min ·
Machine Learning

[D] How should I fine-tune an ASR model for multilingual IPA transcription?

The article discusses how to fine-tune an ASR model for multilingual IPA transcription, seeking advice on model selection and training st...

Reddit - Machine Learning · 1 min ·
[2601.16174] Beyond Predictive Uncertainty: Reliable Representation Learning with Structural Constraints
Machine Learning

[2601.16174] Beyond Predictive Uncertainty: Reliable Representation Learning with Structural Constraints

This paper introduces a framework for reliable representation learning in machine learning, emphasizing the importance of representation-...

arXiv - Machine Learning · 3 min ·
[2509.22860] Ringleader ASGD: The First Asynchronous SGD with Optimal Time Complexity under Data Heterogeneity
Machine Learning

[2509.22860] Ringleader ASGD: The First Asynchronous SGD with Optimal Time Complexity under Data Heterogeneity

The paper introduces Ringleader ASGD, an asynchronous SGD algorithm that achieves optimal time complexity under data heterogeneity, addre...

arXiv - Machine Learning · 4 min ·
[2506.12819] Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison
Machine Learning

[2506.12819] Nonlinear Model Order Reduction of Dynamical Systems in Process Engineering: Review and Comparison

This article reviews and compares nonlinear model order reduction methods for dynamical systems in process engineering, highlighting thei...

arXiv - Machine Learning · 4 min ·
[2505.00282] A Unifying Framework for Robust and Efficient Inference with Unstructured Data
Machine Learning

[2505.00282] A Unifying Framework for Robust and Efficient Inference with Unstructured Data

This paper presents a new framework, MAR-S, for robust and efficient inference with unstructured data, addressing biases in neural networ...

arXiv - Machine Learning · 4 min ·
[2505.10444] Inferring entropy production in many-body systems using nonequilibrium maximum entropy
Nlp

[2505.10444] Inferring entropy production in many-body systems using nonequilibrium maximum entropy

This article presents a novel method for inferring entropy production in many-body systems using a nonequilibrium maximum entropy approac...

arXiv - Machine Learning · 4 min ·
[2502.10361] Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
Llms

[2502.10361] Enhancing Multilingual LLM Pretraining with Model-Based Data Selection

This article presents a model-based data selection framework for enhancing multilingual LLM pretraining, demonstrating significant effici...

arXiv - Machine Learning · 4 min ·
[2411.02137] Finite-sample performance of the maximum likelihood estimator in logistic regression
Machine Learning

[2411.02137] Finite-sample performance of the maximum likelihood estimator in logistic regression

This article examines the finite-sample performance of the maximum likelihood estimator (MLE) in logistic regression, focusing on its exi...

arXiv - Machine Learning · 4 min ·
[2409.20250] Input-Label Correlation Governs a Linear-to-Nonlinear Transition in Random Features under Spiked Covariance
Machine Learning

[2409.20250] Input-Label Correlation Governs a Linear-to-Nonlinear Transition in Random Features under Spiked Covariance

This article explores how input-label correlation influences the performance of random feature models (RFMs) in machine learning, particu...

arXiv - Machine Learning · 4 min ·
[2305.01507] A Parameter-free Adaptive Resonance Theory-based Topological Clustering Algorithm Capable of Continual Learning
Machine Learning

[2305.01507] A Parameter-free Adaptive Resonance Theory-based Topological Clustering Algorithm Capable of Continual Learning

This article presents a novel parameter-free Adaptive Resonance Theory-based topological clustering algorithm that enhances clustering pe...

arXiv - Machine Learning · 4 min ·
[2602.11893] Universal Diffusion-Based Probabilistic Downscaling
Machine Learning

[2602.11893] Universal Diffusion-Based Probabilistic Downscaling

The paper presents a universal diffusion-based framework for downscaling weather forecasts, enhancing low-resolution predictions into hig...

arXiv - Machine Learning · 3 min ·
[2601.20775] Active Learning for Decision Trees with Provable Guarantees
Machine Learning

[2601.20775] Active Learning for Decision Trees with Provable Guarantees

This paper explores active learning for decision trees, presenting a new algorithm that achieves polylogarithmic label complexity with pr...

arXiv - Machine Learning · 4 min ·
[2601.14517] Learning PDE Solvers with Physics and Data: A Unifying View of Physics-Informed Neural Networks and Neural Operators
Machine Learning

[2601.14517] Learning PDE Solvers with Physics and Data: A Unifying View of Physics-Informed Neural Networks and Neural Operators

This paper presents a unified perspective on learning PDE solvers, integrating Physics-Informed Neural Networks and Neural Operators to e...

arXiv - Machine Learning · 4 min ·
[2601.10181] Reinforcement Learning to Discover a North-East Monsoon Index for Rainfall Prediction in Thailand
Machine Learning

[2601.10181] Reinforcement Learning to Discover a North-East Monsoon Index for Rainfall Prediction in Thailand

This article presents a novel North-East monsoon climate index for improving rainfall predictions in Thailand, utilizing reinforcement le...

arXiv - Machine Learning · 4 min ·
[2512.23405] On the Sample Complexity of Learning for Blind Inverse Problems
Machine Learning

[2512.23405] On the Sample Complexity of Learning for Blind Inverse Problems

This article explores the sample complexity of learning in blind inverse problems, providing theoretical insights and empirical validatio...

arXiv - Machine Learning · 4 min ·
[2510.14190] Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation
Machine Learning

[2510.14190] Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation

The paper presents Contrastive Diffusion Alignment (ConDA), a method that enhances the interpretability and control of diffusion models b...

arXiv - Machine Learning · 4 min ·
[2510.20220] Alternatives to the Laplacian for Scalable Spectral Clustering with Group Fairness Constraints
Ai Safety

[2510.20220] Alternatives to the Laplacian for Scalable Spectral Clustering with Group Fairness Constraints

This paper presents the Fair-SMW algorithm, an innovative approach to spectral clustering that enhances computational efficiency while en...

arXiv - Machine Learning · 4 min ·
Previous Page 93 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime