Data Science

Data analysis, statistics, and data engineering

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Data Science

Mantis Biotech is making 'digital twins' of humans to help solve medicine's data availability problem | TechCrunch

Mantis takes disparate sources of data to make synthetic datasets that can be used to build so-called "digital twins" of the human body, ...

TechCrunch - AI · 6 min · about 2 hours ago

Nlp

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...

Reddit - Machine Learning · 1 min · about 4 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 10 hours ago

All Content

Llms

[2505.00624] FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning

Abstract page for arXiv paper 2505.00624: FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning

arXiv - AI · 4 min · 28 days ago

Machine Learning

[2602.24047] Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling

Abstract page for arXiv paper 2602.24047: Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling

arXiv - Machine Learning · 3 min · 28 days ago

Data Science

[2404.00306] A blockchain-based intelligent recommender system framework for enhancing supply chain resilience

Abstract page for arXiv paper 2404.00306: A blockchain-based intelligent recommender system framework for enhancing supply chain resilience

arXiv - AI · 4 min · 28 days ago

Data Science

[2602.23666] Active Learning for Planet Habitability Classification under Extreme Class Imbalance

Abstract page for arXiv paper 2602.23666: Active Learning for Planet Habitability Classification under Extreme Class Imbalance

arXiv - Machine Learning · 4 min · 28 days ago

Llms

[2509.24159] RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment

Abstract page for arXiv paper 2509.24159: RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment

arXiv - AI · 4 min · 28 days ago

Machine Learning

[2602.23524] V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space

Abstract page for arXiv paper 2602.23524: V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space

arXiv - Machine Learning · 4 min · 28 days ago

Llms

[2602.24238] Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

Abstract page for arXiv paper 2602.24238: Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale ...

arXiv - Machine Learning · 3 min · 28 days ago

Llms

[2602.24060] Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

Abstract page for arXiv paper 2602.24060: Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

arXiv - AI · 4 min · 28 days ago

Llms

[2602.24009] Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

Abstract page for arXiv paper 2602.24009: Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

arXiv - Machine Learning · 4 min · 28 days ago

Data Science

[2602.23874] Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botnet Attack Detection

Abstract page for arXiv paper 2602.23874: Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botne...

arXiv - Machine Learning · 4 min · 28 days ago

Llms

[2602.23729] From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning

Abstract page for arXiv paper 2602.23729: From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating...

arXiv - Machine Learning · 4 min · 28 days ago

Llms

[2602.23649] AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

Abstract page for arXiv paper 2602.23649: AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

arXiv - AI · 3 min · 28 days ago

Llms

[2602.23610] LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

Abstract page for arXiv paper 2602.23610: LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

arXiv - AI · 4 min · 28 days ago

Llms

[2602.23603] LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

Abstract page for arXiv paper 2602.23603: LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

arXiv - AI · 3 min · 28 days ago

Machine Learning

[2602.23514] Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision

Abstract page for arXiv paper 2602.23514: Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision

arXiv - Machine Learning · 4 min · 28 days ago

Robotics

[2602.23499] TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

Abstract page for arXiv paper 2602.23499: TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

arXiv - AI · 4 min · 28 days ago

Machine Learning

[2602.23438] DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

Abstract page for arXiv paper 2602.23438: DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Genera...

arXiv - AI · 4 min · 28 days ago

Nlp

[2602.23388] Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

Abstract page for arXiv paper 2602.23388: Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

arXiv - AI · 4 min · 28 days ago

Llms

[2602.24288] DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Abstract page for arXiv paper 2602.24288: DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

arXiv - AI · 4 min · 28 days ago

Data Science

[2602.23974] Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Abstract page for arXiv paper 2602.23974: Pessimistic Auxiliary Policy for Offline Reinforcement Learning

arXiv - AI · 3 min · 28 days ago

Previous Page 26 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Data Science

Top This Week

Mantis Biotech is making 'digital twins' of humans to help solve medicine's data availability problem | TechCrunch

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

UMKC Announces New Master of Science in Artificial Intelligence

All Content

[2505.00624] FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning

[2602.24047] Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling

[2404.00306] A blockchain-based intelligent recommender system framework for enhancing supply chain resilience

[2602.23666] Active Learning for Planet Habitability Classification under Extreme Class Imbalance

[2509.24159] RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment

[2602.23524] V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space

[2602.24238] Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

[2602.24060] Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

[2602.24009] Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

[2602.23874] Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botnet Attack Detection

[2602.23729] From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning

[2602.23649] AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

[2602.23610] LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

[2602.23603] LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

[2602.23514] Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision

[2602.23499] TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

[2602.23438] DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

[2602.23388] Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

[2602.24288] DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

[2602.23974] Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Related Topics

Stay updated with AI News