AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

Washington needs AI guardrails — now | Opinion
Ai Safety

Washington needs AI guardrails — now | Opinion

We need legislation that draws clear lines on what AI systems may and may not do on behalf of the United States government

AI Tools & Products · 3 min ·
[2601.12910] SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
Ai Safety

[2601.12910] SciCoQA: Quality Assurance for Scientific Paper--Code Alignment

Abstract page for arXiv paper 2601.12910: SciCoQA: Quality Assurance for Scientific Paper--Code Alignment

arXiv - AI · 3 min ·
[2509.21385] Debugging Concept Bottleneck Models through Removal and Retraining
Machine Learning

[2509.21385] Debugging Concept Bottleneck Models through Removal and Retraining

Abstract page for arXiv paper 2509.21385: Debugging Concept Bottleneck Models through Removal and Retraining

arXiv - Machine Learning · 4 min ·

All Content

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal
Machine Learning

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal

Abstract page for arXiv paper 2602.07058: SPARE: Self-distillation for PARameter-Efficient Removal

arXiv - Machine Learning · 4 min ·
[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN
Machine Learning

[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Abstract page for arXiv paper 2512.23138: Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with ...

arXiv - Machine Learning · 4 min ·
[2411.15087] Phrase-Instance Alignment for Generalized Referring Segmentation
Machine Learning

[2411.15087] Phrase-Instance Alignment for Generalized Referring Segmentation

Abstract page for arXiv paper 2411.15087: Phrase-Instance Alignment for Generalized Referring Segmentation

arXiv - Machine Learning · 3 min ·
[2511.18178] Bayesian Calibration of Engine-out NOx Models for Engine-to-Engine Transferability
Machine Learning

[2511.18178] Bayesian Calibration of Engine-out NOx Models for Engine-to-Engine Transferability

Abstract page for arXiv paper 2511.18178: Bayesian Calibration of Engine-out NOx Models for Engine-to-Engine Transferability

arXiv - Machine Learning · 4 min ·
[2511.04854] SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion
Generative Ai

[2511.04854] SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion

Abstract page for arXiv paper 2511.04854: SigmaDock: Untwisting Molecular Docking With Fragment-Based SE(3) Diffusion

arXiv - Machine Learning · 4 min ·
[2510.06020] RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics
Machine Learning

[2510.06020] RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics

Abstract page for arXiv paper 2510.06020: RamPINN: Recovering Raman Spectra From Coherent Anti-Stokes Spectra Using Embedded Physics

arXiv - Machine Learning · 4 min ·
[2510.00430] PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
Machine Learning

[2510.00430] PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment

Abstract page for arXiv paper 2510.00430: PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment

arXiv - Machine Learning · 4 min ·
[2509.14181] Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting
Nlp

[2509.14181] Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting

Abstract page for arXiv paper 2509.14181: Bridging Past and Future: Distribution-Aware Alignment for Time Series Forecasting

arXiv - Machine Learning · 4 min ·
[2407.01111] Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
Ai Safety

[2407.01111] Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation

Abstract page for arXiv paper 2407.01111: Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation

arXiv - Machine Learning · 4 min ·
[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations
Machine Learning

[2210.11039] Entire Space Counterfactual Learning for Reliable Content Recommendations

Abstract page for arXiv paper 2210.11039: Entire Space Counterfactual Learning for Reliable Content Recommendations

arXiv - Machine Learning · 4 min ·
[2603.24580] Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA
Nlp

[2603.24580] Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

Abstract page for arXiv paper 2603.24580: Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

arXiv - Machine Learning · 4 min ·
[2603.24209] HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer
Machine Learning

[2603.24209] HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer

Abstract page for arXiv paper 2603.24209: HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directi...

arXiv - Machine Learning · 4 min ·
[2603.23835] Beyond Consistency: Inference for the Relative risk functional in Deep Nonparametric Cox Models
Machine Learning

[2603.23835] Beyond Consistency: Inference for the Relative risk functional in Deep Nonparametric Cox Models

Abstract page for arXiv paper 2603.23835: Beyond Consistency: Inference for the Relative risk functional in Deep Nonparametric Cox Models

arXiv - Machine Learning · 4 min ·
[2603.23583] ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings
Nlp

[2603.23583] ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings

Abstract page for arXiv paper 2603.23583: ZeroFold: Protein-RNA Binding Affinity Predictions from Pre-Structural Embeddings

arXiv - Machine Learning · 4 min ·
[2603.24384] On the Use of Bagging for Local Intrinsic Dimensionality Estimation
Machine Learning

[2603.24384] On the Use of Bagging for Local Intrinsic Dimensionality Estimation

Abstract page for arXiv paper 2603.24384: On the Use of Bagging for Local Intrinsic Dimensionality Estimation

arXiv - Machine Learning · 4 min ·
[2603.24275] Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers
Llms

[2603.24275] Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers

Abstract page for arXiv paper 2603.24275: Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Sem...

arXiv - Machine Learning · 3 min ·
[2603.24265] DeepDTF: Dual-Branch Transformer Fusion for Multi-Omics Anticancer Drug Response Prediction
Machine Learning

[2603.24265] DeepDTF: Dual-Branch Transformer Fusion for Multi-Omics Anticancer Drug Response Prediction

Abstract page for arXiv paper 2603.24265: DeepDTF: Dual-Branch Transformer Fusion for Multi-Omics Anticancer Drug Response Prediction

arXiv - Machine Learning · 4 min ·
[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation
Llms

[2603.24124] The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty Estimation

Abstract page for arXiv paper 2603.24124: The Alignment Tax: Response Homogenization in Aligned LLMs and Its Implications for Uncertainty...

arXiv - Machine Learning · 4 min ·
[2603.23889] Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration
Ai Safety

[2603.23889] Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

Abstract page for arXiv paper 2603.23889: Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

arXiv - Machine Learning · 3 min ·
[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models
Llms

[2603.23783] Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation Models

Abstract page for arXiv paper 2603.23783: Probabilistic Geometric Alignment via Bayesian Latent Transport for Domain-Adaptive Foundation ...

arXiv - Machine Learning · 4 min ·
Previous Page 3 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime