2026 Advanced Deep Learning Projects
As a hiring manager who’s been deep in the 2026 market, I wanted to share some real insights + a video I found that the community might f...
Text understanding and language tasks
As a hiring manager who’s been deep in the 2026 market, I wanted to share some real insights + a video I found that the community might f...
've been working on AI memory infrastructure and recently spent a few weeks reading through the source code of an open-source context-win...
Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instea...
The paper presents ULTRA, a transformer-based recommendation architecture tailored for the Urdu language, addressing challenges in semant...
This paper explores the use of Small Language Models (SLMs) for translating natural language queries into Kusto Query Language (KQL) in S...
The Tool Decathlon introduces a benchmark for evaluating language agents on diverse, realistic, and complex tasks, highlighting significa...
The paper presents RELOOP, a novel framework for recursive retrieval in heterogeneous question answering (QA) that enhances efficiency an...
AgentHub proposes a registry for AI agents that enhances discoverability, verifiability, and reproducibility, addressing gaps in current ...
The paper presents RepGen, an intelligent agent designed to automate the reproduction of deep learning bugs, achieving an 80.19% success ...
The paper introduces Temporal Sparse Autoencoders (T-SAEs), enhancing interpretability in language models by leveraging the sequential na...
This paper explores how language models (LMs) categorize event plausibility, revealing that LMs can reliably discern modal categories, wh...
The paper introduces DeepMartingale, a deep-learning framework addressing the dual formulation of optimal stopping problems, enhancing sc...
This article presents PathVis, a mixed-reality platform designed to enhance digital pathology workflows by integrating multimodal AI and ...
The paper explores how vision-language models can simulate dyslexia by disrupting word processing mechanisms, providing insights into rea...
This article evaluates the diversity and quality of content generated by large language models (LLMs), highlighting the trade-offs betwee...
The paper introduces ViT-Linearizer, a framework that distills knowledge from Vision Transformers (ViTs) into efficient linear-time model...
This paper presents MixCache, a novel caching framework designed to enhance the efficiency of text-to-video diffusion models, significant...
This article presents PoET-2, a multimodal retrieval-augmented protein foundation model that enhances protein function prediction and var...
This paper analyzes the interpolation error of nonlinear attention mechanisms compared to linear regression, revealing insights into thei...
The paper presents MomentMix, a novel augmentation technique using Length-Aware DETR to enhance video moment retrieval, particularly for ...
The paper presents VALTEST, a framework for validating test cases generated by large language models (LLMs) using semantic entropy, impro...
The G-reasoner paper introduces a unified framework that enhances reasoning over graph-structured knowledge using a new graph foundation ...
The paper introduces Versor, a novel geometric sequence architecture that leverages Conformal Geometric Algebra for enhanced performance ...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime