Data Science

Data analysis, statistics, and data engineering

Top This Week

Mantis Biotech is making 'digital twins' of humans to help solve medicine's data availability problem | TechCrunch
Data Science

Mantis Biotech is making 'digital twins' of humans to help solve medicine's data availability problem | TechCrunch

Mantis takes disparate sources of data to make synthetic datasets that can be used to build so-called "digital twins" of the human body, ...

TechCrunch - AI · 6 min ·
Nlp

[P] Using YouTube as a data source (lessons from building a coffee domain dataset)

I started working on a small coffee coaching app recently - something that could answer questions around brew methods, grind size, extrac...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·

All Content

[2505.00624] FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning
Llms

[2505.00624] FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning

Abstract page for arXiv paper 2505.00624: FineScope : SAE-guided Data Selection Enables Domain Specific LLM Pruning and Finetuning

arXiv - AI · 4 min ·
[2602.24047] Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling
Machine Learning

[2602.24047] Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling

Abstract page for arXiv paper 2602.24047: Unsupervised Baseline Clustering and Incremental Adaptation for IoT Device Traffic Profiling

arXiv - Machine Learning · 3 min ·
[2404.00306] A blockchain-based intelligent recommender system framework for enhancing supply chain resilience
Data Science

[2404.00306] A blockchain-based intelligent recommender system framework for enhancing supply chain resilience

Abstract page for arXiv paper 2404.00306: A blockchain-based intelligent recommender system framework for enhancing supply chain resilience

arXiv - AI · 4 min ·
[2602.23666] Active Learning for Planet Habitability Classification under Extreme Class Imbalance
Data Science

[2602.23666] Active Learning for Planet Habitability Classification under Extreme Class Imbalance

Abstract page for arXiv paper 2602.23666: Active Learning for Planet Habitability Classification under Extreme Class Imbalance

arXiv - Machine Learning · 4 min ·
[2509.24159] RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment
Llms

[2509.24159] RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment

Abstract page for arXiv paper 2509.24159: RE-PO: Robust Enhanced Policy Optimization as a General Framework for LLM Alignment

arXiv - AI · 4 min ·
[2602.23524] V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space
Machine Learning

[2602.23524] V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space

Abstract page for arXiv paper 2602.23524: V-MORALS: Visual Morse Graph-Aided Estimation of Regions of Attraction in a Learned Latent Space

arXiv - Machine Learning · 4 min ·
[2602.24238] Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis
Llms

[2602.24238] Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale Benchmark Analysis

Abstract page for arXiv paper 2602.24238: Time Series Foundation Models as Strong Baselines in Transportation Forecasting: A Large-Scale ...

arXiv - Machine Learning · 3 min ·
[2602.24060] Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
Llms

[2602.24060] Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

Abstract page for arXiv paper 2602.24060: Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis

arXiv - AI · 4 min ·
[2602.24009] Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking
Llms

[2602.24009] Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

Abstract page for arXiv paper 2602.24009: Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

arXiv - Machine Learning · 4 min ·
[2602.23874] Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botnet Attack Detection
Data Science

[2602.23874] Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botnet Attack Detection

Abstract page for arXiv paper 2602.23874: Exploring Robust Intrusion Detection: A Benchmark Study of Feature Transferability in IoT Botne...

arXiv - Machine Learning · 4 min ·
[2602.23729] From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning
Llms

[2602.23729] From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning

Abstract page for arXiv paper 2602.23729: From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating...

arXiv - Machine Learning · 4 min ·
[2602.23649] AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech
Llms

[2602.23649] AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

Abstract page for arXiv paper 2602.23649: AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

arXiv - AI · 3 min ·
[2602.23610] LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning
Llms

[2602.23610] LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

Abstract page for arXiv paper 2602.23610: LLM-Driven Multi-Turn Task-Oriented Dialogue Synthesis for Realistic Reasoning

arXiv - AI · 4 min ·
[2602.23603] LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering
Llms

[2602.23603] LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

Abstract page for arXiv paper 2602.23603: LFQA-HP-1M: A Large-Scale Human Preference Dataset for Long-Form Question Answering

arXiv - AI · 3 min ·
[2602.23514] Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision
Machine Learning

[2602.23514] Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision

Abstract page for arXiv paper 2602.23514: Modelling and Simulation of Neuromorphic Datasets for Anomaly Detection in Computer Vision

arXiv - Machine Learning · 4 min ·
[2602.23499] TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving
Robotics

[2602.23499] TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

Abstract page for arXiv paper 2602.23499: TaCarla: A comprehensive benchmarking dataset for end-to-end autonomous driving

arXiv - AI · 4 min ·
[2602.23438] DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation
Machine Learning

[2602.23438] DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Generation

Abstract page for arXiv paper 2602.23438: DesignSense: A Human Preference Dataset and Reward Modeling Framework for Graphic Layout Genera...

arXiv - AI · 4 min ·
[2602.23388] Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages
Nlp

[2602.23388] Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

Abstract page for arXiv paper 2602.23388: Task-Lens: Cross-Task Utility Based Speech Dataset Profiling for Low-Resource Indian Languages

arXiv - AI · 4 min ·
[2602.24288] DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science
Llms

[2602.24288] DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Abstract page for arXiv paper 2602.24288: DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

arXiv - AI · 4 min ·
[2602.23974] Pessimistic Auxiliary Policy for Offline Reinforcement Learning
Data Science

[2602.23974] Pessimistic Auxiliary Policy for Offline Reinforcement Learning

Abstract page for arXiv paper 2602.23974: Pessimistic Auxiliary Policy for Offline Reinforcement Learning

arXiv - AI · 3 min ·
Previous Page 26 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime