Diffusion for generating/editing ASTs? [D]
I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...
GPT, Claude, Gemini, and other LLMs
I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...
OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...
what I mean is that every time I use Claude, or Grok or any of the AI platforms and tools, I realize how far this technology is from repl...
Abstract page for arXiv paper 2510.15863: PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
Abstract page for arXiv paper 2510.20264: Optimistic Task Inference for Behavior Foundation Models
Abstract page for arXiv paper 2510.21314: A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization
Abstract page for arXiv paper 2510.13888: Reliable Fine-Grained Evaluation of Natural Language Math Proofs
Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models
Abstract page for arXiv paper 2510.18245: Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
Abstract page for arXiv paper 2510.10066: OBsmith: LLM-Powered JavaScript Obfuscator Testing
Abstract page for arXiv paper 2510.09462: Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models
Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Abstract page for arXiv paper 2510.06377: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
Abstract page for arXiv paper 2510.06292: ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
Abstract page for arXiv paper 2510.05064: Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Abstract page for arXiv paper 2510.05174: Emergent Coordination in Multi-Agent Language Models
Abstract page for arXiv paper 2510.05132: Training Large Language Models To Reason In Parallel With Global Forking Tokens
Abstract page for arXiv paper 2510.05069: SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
Abstract page for arXiv paper 2510.05109: Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on B...
Abstract page for arXiv paper 2510.04682: TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Abstract page for arXiv paper 2510.04067: What Scales in Cross-Entropy Scaling Law?
Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime