[D] thoughts on the controversy about Google's new paper?
Openreview: https://openreview.net/forum?id=tO3ASKZlok It's sad to see almost no one mention this on Reddit and people are being mean to ...
GPUs, training clusters, MLOps, and deployment
Openreview: https://openreview.net/forum?id=tO3ASKZlok It's sad to see almost no one mention this on Reddit and people are being mean to ...
New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...
UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...
Abstract page for arXiv paper 2603.19375: Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents
Abstract page for arXiv paper 2603.19285: Beam-aware Kernelized Contextual Bandits for User Association and Beamforming in mmWave Vehicul...
Abstract page for arXiv paper 2603.19277: MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering
Abstract page for arXiv paper 2603.19261: Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword ...
Abstract page for arXiv paper 2603.20037: Federated Hyperdimensional Computing for Resource-Constrained Industrial IoT
Abstract page for arXiv paper 2603.20036: Continual Learning as Shared-Manifold Continuation Under Compatible Shift
Abstract page for arXiv paper 2603.20014: AgenticRS-EnsNAS: Ensemble-Decoupled Self-Evolving Architecture Search
Abstract page for arXiv paper 2603.20009: A Super Fast K-means for Indexing Vector Embeddings
Abstract page for arXiv paper 2603.19864: NASimJax: GPU-Accelerated Policy Learning Framework for Penetration Testing
Abstract page for arXiv paper 2603.19742: Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target ...
Abstract page for arXiv paper 2603.19611: Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL
Abstract page for arXiv paper 2603.19360: Warm-Start Flow Matching for Guaranteed Fast Text/Image Generation
Abstract page for arXiv paper 2603.19338: DAPA: Distribution Aware Piecewise Activation Functions for On-Device Transformer Inference and...
Abstract page for arXiv paper 2603.19331: FalconBC: Flow matching for Amortized inference of Latent-CONditioned physiologic Boundary Cond...
Abstract page for arXiv paper 2603.19296: TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly
Abstract page for arXiv paper 2603.18377: PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents
Abstract page for arXiv paper 2603.18062: S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition
Abstract page for arXiv paper 2504.09775: Understanding and Optimizing Multi-Stage AI Inference Pipelines
Abstract page for arXiv paper 2502.19095: Cross-site scripting adversarial attacks based on deep reinforcement learning: Evaluation and e...
Abstract page for arXiv paper 1709.09051: Exact MAP inference in general higher-order graphical models using linear programming
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime