[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

[2602.16957] When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

arXiv - AI 3 min read Article

Summary

This study explores the challenges of cross-lingual euphemism transfer between Turkish and English, highlighting the limitations of semantic overlap in multilingual euphemism detection.

Why It Matters

Understanding euphemism transfer is crucial for improving natural language processing (NLP) models, especially in low-resource languages like Turkish. This research provides insights into how cultural and pragmatic contexts affect language processing, which can enhance multilingual applications and AI systems.

Key Takeaways

  • Semantic overlap does not guarantee effective euphemism transfer between languages.
  • The study categorizes euphemistic terms into overlapping and non-overlapping subsets.
  • Performance can degrade in low-resource language pairs, particularly Turkish to English.
  • Counterintuitive results suggest that training on non-overlapping euphemisms may improve outcomes.
  • Domain-specific alignment plays a role in euphemism transfer effectiveness.

Computer Science > Computation and Language arXiv:2602.16957 (cs) [Submitted on 18 Feb 2026] Title:When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English Authors:Hasan Can Biyik, Libby Barak, Jing Peng, Anna Feldman View a PDF of the paper titled When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English, by Hasan Can Biyik and Libby Barak and Jing Peng and Anna Feldman View PDF HTML (experimental) Abstract:Euphemisms substitute socially sensitive expressions, often softening or reframing meaning, and their reliance on cultural and pragmatic context complicates modeling across languages. In this study, we investigate how cross-lingual equivalence influences transfer in multilingual euphemism detection. We categorize Potentially Euphemistic Terms (PETs) in Turkish and English into Overlapping (OPETs) and Non-Overlapping (NOPETs) subsets based on their functional, pragmatic, and semantic alignment. Our findings reveal a transfer asymmetry: semantic overlap is insufficient to guarantee positive transfer, particularly in low-resource Turkish-to-English direction, where performance can degrade even for overlapping euphemisms, and in some cases, improve under NOPET-based training. Differences in label distribution help explain these counterintuitive results. Category-level analysis suggests that transfer may be influenced by domain-specific alignment, though evidence is limited by sparsity. Subject...

Related Articles

Machine Learning

[D] ICML Reviewer Acknowledgement

Hi, I'm a little confused about ICML discussion period Does the period for reviewer acknowledging responses have already ended? One of th...

Reddit - Machine Learning · 1 min ·
Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] ICML reviewer making up false claim in acknowledgement, what to do?

In a rebuttal acknowledgement we received, the reviewer made up a claim that our method performs worse than baselines with some hyperpara...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime