[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX
New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...
ML algorithms, training, and inference
New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...
News News: The Continuing Education Programme (CEP) at IIT Delhi has announced the launch of the 8th batch of its Advanced Certificate Pr...
Chamco Digital, a recognized Microsoft AI and Cloud Technology Partner, announced the launch of its globally accessible Microsoft AI and ...
Abstract page for arXiv paper 2603.25268: CRAFT: Grounded Multi-Agent Coordination Under Partial Information
Abstract page for arXiv paper 2603.25403: Shape and Substance: Dual-Layer Side-Channel Attacks on Local Vision-Language Models
Abstract page for arXiv paper 2603.25253: MolQuest: A Benchmark for Agentic Evaluation of Abductive Reasoning in Chemical Structure Eluci...
Abstract page for arXiv paper 2603.25397: A Causal Framework for Evaluating ICU Discharge Strategies
Abstract page for arXiv paper 2603.25374: Supercharging Federated Intelligence Retrieval
Abstract page for arXiv paper 2603.25247: FEAST: Fully Connected Expressive Attention for Spatial Transcriptomics
Abstract page for arXiv paper 2603.25243: FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA
Abstract page for arXiv paper 2603.25311: Practical Efficient Global Optimization is No-regret
Abstract page for arXiv paper 2603.25226: WebTestBench: Evaluating Computer-Use Agents towards End-to-End Automated Web Testing
Abstract page for arXiv paper 2603.25216: A Wireless World Model for AI-Native 6G Networks
Abstract page for arXiv paper 2603.25257: Mitigating Evasion Attacks in Fog Computing Resource Provisioning Through Proactive Hardening
Abstract page for arXiv paper 2603.25209: Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
Abstract page for arXiv paper 2603.25196: A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence ...
Abstract page for arXiv paper 2603.25251: Does Explanation Correctness Matter? Linking Computational XAI Evaluation to Human Understanding
Abstract page for arXiv paper 2603.25187: Probing the Lack of Stable Internal Beliefs in LLMs
Abstract page for arXiv paper 2603.25229: An Image Dataset of Common Skin Diseases of Bangladesh and Benchmarking Performance with Machin...
Abstract page for arXiv paper 2603.25250: Activation Matters: Test-time Activated Negative Labels for OOD Detection with Vision-Language ...
Abstract page for arXiv paper 2603.25170: Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling
Abstract page for arXiv paper 2603.25164: PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented ...
Abstract page for arXiv paper 2603.25145: Learning to Rank Caption Chains for Video-Text Alignment
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime