Thread:User talk:CodeCat/Script recognition module/reply (5)

You're right. A letter like "C" is probably both Latn and Latinx. The same problem probably would happen with pa-Arab, ota-Arab, etc. if we had similar categories for the Arabic script.

Maybe it's not feasible, but can  iterate over all scripts, but give priority for 4-letter scripts? If it finds something in Latn or Arab, it stops the search and does not iterate over Latinx and fa-Arab.

Or maybe just give priority to Latn over Latinx and forget Arab and the others unless they become a problem at some point.