Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Data Science

"OpenAI quietly removed the one safety mechanism that could shut the whole thing down — and nobody is talking about it"

OpenAI was founded as a nonprofit for one specific reason — to ensure AI development couldn't be hijacked by profit motives. Their origin...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[P] citracer: a small CLI tool to trace where a concept comes from in a citation graph

Hi all, I made a small tool that I've been using for my own literature reviews and figured I'd share in case it's useful to anyone else. ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Data Science

What actually makes something the best AI meeting recorder?

I’ve been trying a few meeting tools lately and realized I care way less about flashy summaries than I thought. What I actually want is p...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Machine Learning

[2602.16530] FEKAN: Feature-Enriched Kolmogorov-Arnold Networks

The paper introduces Feature-Enriched Kolmogorov-Arnold Networks (FEKAN), an advanced model that enhances computational efficiency and pr...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15861] CAST: Achieving Stable LLM-based Text Analysis for Data Analytics

The paper presents CAST, a framework designed to improve the stability of LLM-based text analysis in data analytics by enhancing output c...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16503] Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects

This paper introduces Conditionally Additive Local Models (CALMs), which enhance the interpretability of Generalized Additive Models (GAM...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16507] Small molecule retrieval from tandem mass spectrometry: what are we optimizing for?

This paper explores the optimization of loss functions in deep learning models for small molecule retrieval from tandem mass spectrometry...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16498] Fast and Scalable Analytical Diffusion

The paper presents GoldDiff, a novel framework for analytical diffusion that enhances scalability and speed in generative modeling by dyn...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16456] Beyond SGD, Without SVD: Proximal Subspace Iteration LoRA with Diagonal Fractional K-FAC

This paper presents a novel approach called LoRSum for optimizing Low-Rank Adaptation (LoRA) in machine learning, enhancing efficiency in...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16468] HPMixer: Hierarchical Patching for Multivariate Time Series Forecasting

The paper presents HPMixer, a novel approach for multivariate time series forecasting that effectively models periodic patterns and resid...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16449] GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation

The paper presents GICDM, a method to mitigate hubness in distance-based evaluations of generative models, enhancing reliability and alig...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15851] Narrative Theory-Driven LLM Methods for Automatic Story Generation and Understanding: A Survey

This survey explores the intersection of narrative theory and large language models (LLMs) for automatic story generation and understandi...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.16436] Learning with Locally Private Examples by Inverse Weierstrass Private Stochastic Gradient Descent

This paper presents a novel method for correcting bias in binary classification tasks using locally private examples, leveraging the Inve...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15844] Language Model Representations for Efficient Few-Shot Tabular Classification

This paper explores the use of language model representations for efficient few-shot classification of tabular data, proposing a new para...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16400] Easy Data Unlearning Bench

The paper introduces the Easy Data Unlearning Bench, a unified benchmarking suite aimed at simplifying the evaluation of machine unlearni...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15843] The Perplexity Paradox: Why Code Compresses Better Than Math in LLM Prompts

This article explores the 'perplexity paradox' in large language models (LLMs), demonstrating that code compresses better than mathematic...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16357] Optical Inversion and Spectral Unmixing of Spectroscopic Photoacoustic Images with Physics-Informed Neural Networks

This article presents the Spectroscopic Photoacoustic Optical Inversion Autoencoder (SPOI-AE), a novel approach using physics-informed ne...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15832] What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

This article explores the identification of unknown user personas in simulations, introducing the PICQ dataset and evaluating leading LLM...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16341] Explainability for Fault Detection System in Chemical Processes

This article evaluates two explainability methods, Integrated Gradients and SHAP, for fault detection in chemical processes using an LSTM...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications

The paper presents Guide-Guard, a machine learning solution designed to predict off-target effects in CRISPR applications with 84% accura...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16316] A Graph Meta-Network for Learning on Kolmogorov-Arnold Networks

This paper introduces WS-KAN, a novel weight-space architecture for Kolmogorov-Arnold Networks (KANs), demonstrating its superior perform...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16481] Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

This article explores the use of large language models (LLMs) in causal discovery, proposing a constraint-based, argumentation-driven app...

arXiv - AI · 3 min · about 2 months ago

Robotics

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

The paper presents CAFE, a novel framework for automated feature engineering that combines causal discovery with multi-agent reinforcemen...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 114 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

"OpenAI quietly removed the one safety mechanism that could shut the whole thing down — and nobody is talking about it"

[P] citracer: a small CLI tool to trace where a concept comes from in a citation graph

What actually makes something the best AI meeting recorder?

All Content

[2602.16530] FEKAN: Feature-Enriched Kolmogorov-Arnold Networks

[2602.15861] CAST: Achieving Stable LLM-based Text Analysis for Data Analytics

[2602.16503] Interpretability-by-Design with Accurate Locally Additive Models and Conditional Feature Effects

[2602.16507] Small molecule retrieval from tandem mass spectrometry: what are we optimizing for?

[2602.16498] Fast and Scalable Analytical Diffusion

[2602.16456] Beyond SGD, Without SVD: Proximal Subspace Iteration LoRA with Diagonal Fractional K-FAC

[2602.16468] HPMixer: Hierarchical Patching for Multivariate Time Series Forecasting

[2602.16449] GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation

[2602.15851] Narrative Theory-Driven LLM Methods for Automatic Story Generation and Understanding: A Survey

[2602.16436] Learning with Locally Private Examples by Inverse Weierstrass Private Stochastic Gradient Descent

[2602.15844] Language Model Representations for Efficient Few-Shot Tabular Classification

[2602.16400] Easy Data Unlearning Bench

[2602.15843] The Perplexity Paradox: Why Code Compresses Better Than Math in LLM Prompts

[2602.16357] Optical Inversion and Spectral Unmixing of Spectroscopic Photoacoustic Images with Physics-Informed Neural Networks

[2602.15832] What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

[2602.16341] Explainability for Fault Detection System in Chemical Processes

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications

[2602.16316] A Graph Meta-Network for Learning on Kolmogorov-Arnold Networks

[2602.16481] Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

Related Topics

Stay updated with AI News