[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
GPUs, training clusters, MLOps, and deployment
Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
Abstract page for arXiv paper 2602.07374: TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Lay...
Abstract page for arXiv paper 2512.11798: Particulate: Feed-Forward 3D Object Articulation
Abstract page for arXiv paper 2603.21720: SemEval-2026 Task 12: Abductive Event Reasoning: Towards Real-World Event Causal Inference for ...
Abstract page for arXiv paper 2603.21701: Rethinking Token Reduction for Large Vision-Language Models
Abstract page for arXiv paper 2603.21661: Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-...
Abstract page for arXiv paper 2603.21610: Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Dom...
Abstract page for arXiv paper 2603.21576: PRISM: Breaking the O(n) Memory Wall in Long-Context LLM Inference via O(1) Photonic Block Sele...
Abstract page for arXiv paper 2603.21508: Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences
Abstract page for arXiv paper 2603.21461: DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment
Abstract page for arXiv paper 2603.21301: enhancing reasoning accuracy in large language models during inference time
Abstract page for arXiv paper 2603.21280: WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making
Abstract page for arXiv paper 2603.21175: Reward Sharpness-Aware Fine-Tuning for Diffusion Models
Abstract page for arXiv paper 2603.21135: One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation
Abstract page for arXiv paper 2603.21095: Representation-Level Adversarial Regularization for Clinically Aligned Multitask Thyroid Ultras...
Abstract page for arXiv paper 2603.21084: ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural...
Abstract page for arXiv paper 2603.21045: LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction
Abstract page for arXiv paper 2603.21016: Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO
Abstract page for arXiv paper 2603.20980: From Causal Discovery to Dynamic Causal Inference in Neural Time Series
Abstract page for arXiv paper 2603.20957: Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Lan...
Abstract page for arXiv paper 2603.20920: Democratizing AI: A Comparative Study in Deep Learning Efficiency and Future Trends in Computat...
Abstract page for arXiv paper 2603.20899: Mitigating Shortcut Reasoning in Language Models: A Gradient-Aware Training Approach
Abstract page for arXiv paper 2603.20882: RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime