[D] Data curation and targeted replacement as a pre-training alignment and controllability method
Hi, r/MachineLearning: has much research been done in large-scale training scenarios where undesirable data has been replaced before trai...
Alignment, bias, regulation, and responsible AI
Hi, r/MachineLearning: has much research been done in large-scale training scenarios where undesirable data has been replaced before trai...
I’ve written an essay exploring what I’m calling the Super-Intelligent Octopus Problem—a thought experiment designed to surface a paradox...
AI bias is an anomaly in the output of ML algorithms due to prejudiced assumptions. Explore types of AI bias, examples, how to reduce bia...
Abstract page for arXiv paper 2603.19308: GT-Space: Enhancing Heterogeneous Collaborative Perception with Ground Truth Feature Space
Abstract page for arXiv paper 2603.19302: Parameter-Efficient Token Embedding Editing for Clinical Class-Level Unlearning
Abstract page for arXiv paper 2603.19273: LSR: Linguistic Safety Robustness Benchmark for Low-Resource West African Languages
An anonymous Substack post accuses compliance startup Delve of “falsely” convincing “hundreds of customers they were compliant” with priv...
An anonymous Substack post accuses compliance startup Delve of “falsely” convincing “hundreds of customers they were compliant” with priv...
NEXUS is an open-source market analysis AI that runs 3 automated sessions per day. It analyzes 45 financial instruments, generates trade ...
I have been experimenting with Heuristic-based Deliverability Intelligence to solve the "Month 2 Tanking" problem. The Data Science Chall...
Abstract page for arXiv paper 2510.18120: Generalization Below the Edge of Stability: The Role of Data Geometry
Abstract page for arXiv paper 2509.05609: New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Tr...
Abstract page for arXiv paper 2508.18088: How Quantization Shapes Bias in Large Language Models
Abstract page for arXiv paper 2510.17276: Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems
Abstract page for arXiv paper 2509.25762: OPPO: Accelerating PPO-based RLHF via Pipeline Overlap
Abstract page for arXiv paper 2508.04899: Honest and Reliable Evaluation and Expert Equivalence Testing of Automated Neonatal Seizure Det...
Abstract page for arXiv paper 2412.20298: An Experimental Study on Fairness-aware Machine Learning for Credit Scoring Problems
Abstract page for arXiv paper 2603.05226: Learning Optimal Individualized Decision Rules with Conditional Demographic Parity
Abstract page for arXiv paper 2603.05157: The Impact of Preprocessing Methods on Racial Encoding and Model Robustness in CXR Diagnosis
Abstract page for arXiv paper 2603.04895: How Does the ReLU Activation Affect the Implicit Bias of Gradient Descent on High-dimensional N...
Abstract page for arXiv paper 2603.04807: The Inductive Bias of Convolutional Neural Networks: Locality and Weight Sharing Reshape Implic...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime