[2604.06191] Harf-Speech: A Clinically Aligned Framework for Arabic

[2604.06191] Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

arXiv - AI April 09, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.06191: Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

Electrical Engineering and Systems Science > Audio and Speech Processing arXiv:2604.06191 (eess) [Submitted on 11 Mar 2026] Title:Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment Authors:Asif Azad, MD Sadik Hossain Shanto, Mohammad Sadat Hossain, Bdour Alwuqaysi, Sabri Boughorbel, Yahya Bokhari, Abdulrhman Aljouie, Ayah Othman Sindi, Ehsan Hoque View a PDF of the paper titled Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment, by Asif Azad and 8 other authors View PDF HTML (experimental) Abstract:Automated phoneme-level pronunciation assessment is vital for scalable speech therapy and language learning, yet validated tools for Arabic remain scarce. We present Harf-Speech, a modular system scoring Arabic pronunciation at the phoneme level on a clinical scale. It combines an MSA phonetizer, a fine-tuned speech-to-phoneme model, Levenshtein alignment, and a blended scorer using longest common subsequence and edit-distance metrics. We fine-tune three ASR architectures on Arabic phoneme data and benchmark them with zero-shot multimodal models; the best, OmniASR-CTC-1B-v2, achieves 8.92\% phoneme error rate. Three certified speech-language pathologists independently scored 40 utterances for clinical validation. Harf-Speech attains a Pearson correlation of 0.791 and ICC(2,1) of 0.659 with mean expert scores, outperforming existing end-to-end assessment frameworks. These results show Harf-Speech yields clini...

Originally published on April 09, 2026. Curated by AI News.

Machine Learning

Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]

Hi all, Stjepan from Manning here. The mods said it's fine if I post this here. I wanted to share a new MEAP (early access) release we th...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

What is the “personality” of an LLM? What actually differentiates models psychometrically? Since LLMs entered public use, researchers hav...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED

Today on “Uncanny Valley,” we’re diving into recent reports that the Trump administration is considering an executive order that would es...

Wired - AI · 29 min · about 1 hour ago

Machine Learning

Feels like AI is entering its “infrastructure matters” phase

A year ago, most discussions were about which model was smartest. Now it increasingly feels like the bigger differentiators are becoming:...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

[2604.06191] Harf-Speech: A Clinically Aligned Framework for Arabic Phoneme-Level Speech Assessment

About this article

Related Articles

Quantization and Fast Inference (MEAP) - How much performance are you actually getting from quantization in production? [D]

We gave 45 psychological questionnaires to 50 LLMs. What we found was not “personality.”

Trump Pivots on AI Regulation, Worker Ousted by DOGE Runs for Office, and Hantavirus Explained | WIRED

Feels like AI is entering its “infrastructure matters” phase

No comments

Stay updated with AI News