Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.16876] ML-driven detection and reduction of ballast information in multi-modal datasets
Machine Learning

[2602.16876] ML-driven detection and reduction of ballast information in multi-modal datasets

This paper presents a framework for detecting and reducing ballast information in multi-modal datasets, enhancing machine learning effici...

arXiv - Machine Learning · 3 min ·
[2602.16837] A Residual-Aware Theory of Position Bias in Transformers
Machine Learning

[2602.16837] A Residual-Aware Theory of Position Bias in Transformers

This paper presents a residual-aware theory explaining the position bias in Transformers, revealing how residual connections prevent atte...

arXiv - Machine Learning · 3 min ·
[2602.16823] Formal Mechanistic Interpretability: Automated Circuit Discovery with Provable Guarantees
Machine Learning

[2602.16823] Formal Mechanistic Interpretability: Automated Circuit Discovery with Provable Guarantees

This article presents a novel approach to automated circuit discovery in neural networks, emphasizing provable guarantees for robustness ...

arXiv - Machine Learning · 4 min ·
[2602.16793] Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models
Machine Learning

[2602.16793] Escaping the Cognitive Well: Efficient Competition Math with Off-the-Shelf Models

The paper presents a novel inference pipeline that leverages off-the-shelf models to solve International Mathematical Olympiad problems e...

arXiv - Machine Learning · 4 min ·
[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift
Llms

[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift

This paper explores omitted variable bias in language models under distribution shifts, proposing a framework to evaluate and optimize pe...

arXiv - Machine Learning · 3 min ·
[2602.16764] Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction
Machine Learning

[2602.16764] Machine Learning Argument of Latitude Error Model for LEO Satellite Orbit and Covariance Correction

This article presents a machine learning model designed to correct latitude error in Low Earth Orbit (LEO) satellite propagation, enhanci...

arXiv - Machine Learning · 4 min ·
[2602.10993] LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules
Machine Learning

[2602.10993] LoRA-Squeeze: Simple and Effective Post-Tuning and In-Tuning Compression of LoRA Modules

The paper introduces LoRA-Squeeze, a method for improving Low-Rank Adaptation (LoRA) by allowing dynamic rank adjustments during training...

arXiv - AI · 4 min ·
[2602.07666] SoK: DARPA's AI Cyber Challenge (AIxCC): Competition Design, Architectures, and Lessons Learned
Llms

[2602.07666] SoK: DARPA's AI Cyber Challenge (AIxCC): Competition Design, Architectures, and Lessons Learned

This paper analyzes DARPA's AI Cyber Challenge (AIxCC), focusing on competition design, architectural approaches of finalists, and key le...

arXiv - AI · 4 min ·
[2601.06932] Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching
Nlp

[2601.06932] Symphonym: Universal Phonetic Embeddings for Cross-Script Name Matching

The paper presents Symphonym, a neural embedding system designed for cross-script name matching, mapping names into a unified phonetic sp...

arXiv - AI · 4 min ·
[2512.22213] On the Existence and Behavior of Secondary Attention Sinks
Machine Learning

[2512.22213] On the Existence and Behavior of Secondary Attention Sinks

This paper explores the concept of secondary attention sinks in machine learning models, highlighting their distinct properties and behav...

arXiv - AI · 4 min ·
[2512.11108] Explanation Bias is a Product: Revealing the Hidden Lexical and Position Preferences in Post-Hoc Feature Attribution
Llms

[2512.11108] Explanation Bias is a Product: Revealing the Hidden Lexical and Position Preferences in Post-Hoc Feature Attribution

This article explores the biases inherent in post-hoc feature attribution methods used in language models, revealing how lexical and posi...

arXiv - AI · 4 min ·
[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?
Llms

[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

This article evaluates the performance of language models in text classification tasks for South Slavic languages, comparing fine-tuned B...

arXiv - AI · 4 min ·
[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
Llms

[2511.00794] Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration

This paper presents PREPO, a novel approach to enhance data efficiency in reinforcement learning for large language models by leveraging ...

arXiv - AI · 4 min ·
[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
Llms

[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

This article introduces the concept of multimodal prompt optimization for Multimodal Large Language Models (MLLMs), proposing a new frame...

arXiv - AI · 4 min ·
[2507.19634] MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
Llms

[2507.19634] MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks

The MCIF benchmark introduces a novel framework for evaluating multimodal crosslingual instruction-following capabilities in large langua...

arXiv - AI · 4 min ·
[2505.20650] FinTagging: Benchmarking LLMs for Extracting and Structuring Financial Information
Llms

[2505.20650] FinTagging: Benchmarking LLMs for Extracting and Structuring Financial Information

The paper introduces FinTagging, a benchmark for evaluating LLMs in extracting and structuring financial information, addressing limitati...

arXiv - AI · 4 min ·
[2601.15599] Autonomous Business System via Neuro-symbolic AI
Llms

[2601.15599] Autonomous Business System via Neuro-symbolic AI

The paper presents AUTOBUS, an Autonomous Business System that integrates LLM-based AI agents with predicate-logic programming to enhance...

arXiv - AI · 4 min ·
[2511.17673] Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer
Llms

[2511.17673] Bridging Symbolic Control and Neural Reasoning in LLM Agents: Structured Cognitive Loop with a Governance Layer

This article introduces the Structured Cognitive Loop (SCL) architecture for large language model (LLM) agents, addressing key architectu...

arXiv - AI · 4 min ·
[2508.12026] Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems
Machine Learning

[2508.12026] Bongard-RWR+: Real-World Representations of Fine-Grained Concepts in Bongard Problems

The paper presents Bongard-RWR+, a dataset designed to enhance fine-grained visual reasoning in Bongard Problems using real-world images ...

arXiv - Machine Learning · 4 min ·
[2505.08021] The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic
Machine Learning

[2505.08021] The Correspondence Between Bounded Graph Neural Networks and Fragments of First-Order Logic

This paper explores the relationship between Bounded Graph Neural Networks (GNNs) and fragments of first-order logic, providing insights ...

arXiv - AI · 3 min ·
Previous Page 100 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest β€’ Unsubscribe anytime