Phone screen: Microsoft AI Principal MLE
submitted by /u/sustain-able-tea [link] [comments]
ML algorithms, training, and inference
submitted by /u/sustain-able-tea [link] [comments]
We've been building Caliber to solve AI agent configuration management and released our full setup as open source. The response has been ...
Over the past year our team and community have been building an open-source collection of AI agent configs: production-ready system promp...
Abstract page for arXiv paper 2604.04157: Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents
Abstract page for arXiv paper 2604.04145: Solar-VLM: Multimodal Vision-Language Models for Augmented Solar Power Forecasting
Abstract page for arXiv paper 2604.04131: Profile-Then-Reason: Bounded Semantic Complexity for Tool-Augmented Language Agents
Abstract page for arXiv paper 2604.04106: InsTraj: Instructing Diffusion Models with Travel Intentions to Generate Real-world Trajectories
Abstract page for arXiv paper 2604.03976: Quantifying Trust: Financial Risk Management for Trustworthy AI Agents
Abstract page for arXiv paper 2604.03898: LLM-Agent-based Social Simulation for Attitude Diffusion
Abstract page for arXiv paper 2604.03888: PolySwarm: A Multi-Agent Large Language Model Framework for Prediction Market Trading and Laten...
Abstract page for arXiv paper 2604.03893: FeynmanBench: Benchmarking Multimodal LLMs on Diagrammatic Physics Reasoning
Abstract page for arXiv paper 2604.03820: Affording Process Auditability with QualAnalyzer: An Atomistic LLM Analysis Tool for Qualitativ...
Abstract page for arXiv paper 2604.03742: Structured Multi-Criteria Evaluation of Large Language Models with Fuzzy Analytic Hierarchy Pro...
Abstract page for arXiv paper 2604.03675: PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training
Abstract page for arXiv paper 2604.03660: TableVision: A Large-Scale Benchmark for Spatially Grounded Reasoning over Complex Hierarchical...
Abstract page for arXiv paper 2604.03656: Beyond Retrieval: Modeling Confidence Decay and Deterministic Agentic Platforms in Generative E...
Abstract page for arXiv paper 2604.03631: Single-agent vs. Multi-agents for Automated Video Analysis of On-Screen Collaborative Learning ...
Abstract page for arXiv paper 2604.03630: A Multimodal Foundation Model of Spatial Transcriptomics and Histology for Biological Discovery...
Abstract page for arXiv paper 2604.03589: Entropy and Attention Dynamics in Small Language Models: A Trace-Level Structural Analysis on t...
Abstract page for arXiv paper 2604.03571: Selective Forgetting for Large Reasoning Models
Abstract page for arXiv paper 2604.03557: When Do Hallucinations Arise? A Graph Perspective on the Evolution of Path Reuse and Path Compr...
Abstract page for arXiv paper 2604.03527: Explainable Model Routing for Agentic Workflows
Abstract page for arXiv paper 2604.03524: Structural Rigidity and the 57-Token Predictive Window: A Physical Framework for Inference-Laye...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime