Top Data Science This Week

The most engaging data science content from this week, curated by AI News.

  1. 1

    [2605.02318] Can Causal Discovery Algorithms Help in Generating Legal Arguments?

    Abstract page for arXiv paper 2605.02318: Can Causal Discovery Algorithms Help in Generating Legal Arguments?

    arXiv - AI · 6 days ago
  2. 2

    [2605.07444] Accelerated and data-efficient flow prediction in stirred tanks via physics-informed learning

    Abstract page for arXiv paper 2605.07444: Accelerated and data-efficient flow prediction in stirred tanks via physics-informed learning

    arXiv - AI · about 9 hours ago
  3. 3

    [2605.07462] The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

    Abstract page for arXiv paper 2605.07462: The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

    arXiv - AI · about 9 hours ago
  4. 4

    [2511.09907] Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

    Abstract page for arXiv paper 2511.09907: Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis

    arXiv - AI · about 9 hours ago
  5. 5

    [P] QLoRA Fine-Tuning of Qwen2.5-1.5B for CEFR English Proficiency Classification (A1–C2) [P]

    I fine-tuned Qwen2.5-1.5B for multi-class CEFR English proficiency classification using QLoRA (4-bit NF4). The goal was to classify English text into one of the 6 CEFR levels (A1 → C2), which can b...

    Reddit - Machine Learning · 7 days ago
  6. 6

    [2407.04183] Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

    Abstract page for arXiv paper 2407.04183: Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

    arXiv - AI · about 9 hours ago
  7. 7

    [2602.02320] A Large-Scale Dataset for Molecular Structure-Language Description via a Rule-Regularized Method

    Abstract page for arXiv paper 2602.02320: A Large-Scale Dataset for Molecular Structure-Language Description via a Rule-Regularized Method

    arXiv - AI · about 9 hours ago
  8. 8

    [2605.01676] Missingness-aware Data Imputation via AI-powered Bayesian Generative Modeling

    Abstract page for arXiv paper 2605.01676: Missingness-aware Data Imputation via AI-powered Bayesian Generative Modeling

    arXiv - AI · 6 days ago
  9. 9

    How do you experiment with a (very) large model architecture? [D]

    Im trying to reproduce a paper (a very particular kind of diffusion model), and their training regime is incredibly compute heavy. In general, how are quick experiments performed to validate hypoth...

    Reddit - Machine Learning · 7 days ago
  10. 10

    [2602.05817] Interpreting Manifolds and Graph Neural Embeddings from Internet of Things Traffic Flows

    Abstract page for arXiv paper 2602.05817: Interpreting Manifolds and Graph Neural Embeddings from Internet of Things Traffic Flows

    arXiv - AI · 4 days ago
  11. 11

    [2605.00764] Modeling Subjective Urban Perception with Human Gaze

    Abstract page for arXiv paper 2605.00764: Modeling Subjective Urban Perception with Human Gaze

    arXiv - AI · 4 days ago
  12. 12

    [2605.04062] EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation

    Abstract page for arXiv paper 2605.04062: EdgeRazor: A Lightweight Framework for Large Language Models via Mixed-Precision Quantization-Aware Distillation

    arXiv - AI · 4 days ago
  13. 13

    [2605.04098] Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology

    Abstract page for arXiv paper 2605.04098: Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology

    arXiv - AI · 4 days ago
  14. 14

    [2605.04180] MedFabric and EtHER: A Data-Centric Framework for Word-Level Fabrication Generation and Detection in Medical LLMs

    Abstract page for arXiv paper 2605.04180: MedFabric and EtHER: A Data-Centric Framework for Word-Level Fabrication Generation and Detection in Medical LLMs

    arXiv - AI · 4 days ago
  15. 15

    [2605.04501] Example-Based Object Detection

    Abstract page for arXiv paper 2605.04501: Example-Based Object Detection

    arXiv - AI · 4 days ago
  16. 16

    [2605.04729] AISSA: Implementation and Deployment of an AI-based Student Slides Analysis tool for Academic Presentations

    Abstract page for arXiv paper 2605.04729: AISSA: Implementation and Deployment of an AI-based Student Slides Analysis tool for Academic Presentations

    arXiv - AI · 4 days ago
  17. 17

    [2605.04857] Assessing Cognitive Effort in L2 Idiomatic Processing: An Eye-Tracking Dataset

    Abstract page for arXiv paper 2605.04857: Assessing Cognitive Effort in L2 Idiomatic Processing: An Eye-Tracking Dataset

    arXiv - AI · 4 days ago
  18. 18

    [2605.00116] ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

    Abstract page for arXiv paper 2605.00116: ViLegalNLI: Natural Language Inference for Vietnamese Legal Texts

    arXiv - AI · 7 days ago
  19. 19

    [2605.03410] Geometry over Density: Few-Shot Cross-Domain OOD Detection

    Abstract page for arXiv paper 2605.03410: Geometry over Density: Few-Shot Cross-Domain OOD Detection

    arXiv - AI · 4 days ago
  20. 20

    [2605.08019] Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners

    Abstract page for arXiv paper 2605.08019: Reason to Play: Behavioral and Brain Alignment Between Frontier LRMs and Human Game Learners

    arXiv - AI · about 9 hours ago

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime