India advancing AI-ready public data infrastructure for smarter governance
India has reported significant progress in developing artificial intelligence (AI)-ready public data infrastructure, with a range of digi...
Alignment, bias, regulation, and responsible AI
India has reported significant progress in developing artificial intelligence (AI)-ready public data infrastructure, with a range of digi...
I've been working on an alternative to attention-based sequence modeling that I'm calling Geometric Flow Networks (GFN). The core idea: i...
Hi, r/MachineLearning: has much research been done in large-scale training scenarios where undesirable data has been replaced before trai...
Abstract page for arXiv paper 2603.04986: Debiasing Sequential Recommendation with Time-aware Inverse Propensity Scoring
Abstract page for arXiv paper 2603.04976: 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Abstract page for arXiv paper 2603.04968: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
Abstract page for arXiv paper 2603.04905: Deterministic Preprocessing and Interpretable Fuzzy Banding for Cost-per-Student Reporting from...
Abstract page for arXiv paper 2603.04676: Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks
Abstract page for arXiv paper 2603.04421: Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?
Abstract page for arXiv paper 2603.04410: SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models
Abstract page for arXiv paper 2603.04407: Semantic Containment as a Fundamental Property of Emergent Misalignment
Abstract page for arXiv paper 2603.05485: Towards Provably Unbiased LLM Judges via Bias-Bounded Evaluation
Abstract page for arXiv paper 2603.05295: WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
Abstract page for arXiv paper 2603.05040: Enhancing Zero-shot Commonsense Reasoning by Integrating Visual Knowledge via Machine Imagination
Abstract page for arXiv paper 2603.05027: S5-SHB Agent: Society 5.0 enabled Multi-model Agentic Blockchain Framework for Smart Home
Abstract page for arXiv paper 2603.04904: Alignment Backfire: Language-Dependent Reversal of Safety Interventions Across 16 Languages in ...
Abstract page for arXiv paper 2603.04837: Design Behaviour Codes (DBCs): A Taxonomy-Driven Layered Governance Benchmark for Large Languag...
Abstract page for arXiv paper 2603.04822: VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment
Abstract page for arXiv paper 2603.04746: Visioning Human-Agentic AI Teaming: Continuity, Tension, and Future Research
Abstract page for arXiv paper 2603.04631: Towards automated data analysis: A guided framework for LLM-based risk estimation
Abstract page for arXiv paper 2603.04582: Self-Attribution Bias: When AI Monitors Go Easy on Themselves
Abstract page for arXiv paper 2603.04514: Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime