What steps will reproduce the problem?

  1. Assign two imaginary names to two streets : First ["name"="نهج الشمس"] (sun st.) and second: ["name"="نهج القمر"] (moon st.)
  2. Run validation

What is the expected result?

No warnings as the names are totally different ... when you do read Arabic :)

What happens instead?

I have a warning of two similar named ways: "نهج الشمس", "نهج القمر"

Please provide any additional information below. Attach a screenshot if possible.

comment:1 by skyper, 4 years ago

Keywords: Arabic added
Summary: The SimilarNamedWays plugins reports false positive on Arabic street namesThe SimilarNamedWays test reports false positive on Arabic street names

comment:2 by Don-vip, 4 years ago

Keywords: i18n added

by Don-vip, 4 years ago

Attachment: 20916_worksforme.osm added

comment:3 by Don-vip, 4 years ago

Owner: changed from team to selimachour@…
Status: newneedinfo

Can't reproduce with the dataset I created based on your examples. Can you please provide a small sample showing the problem?

by selimachour@…, 4 years ago

Attachment: similar_named_ways.osm added

comment:4 by anonymous, 4 years ago

Strangely, starting from a blank layer and adding two streets with the example names didn't produce the bug, but downloading some osm data still leads to the same problem.
I attached the .osm file where my version 17919 still warns about similar named ways.

comment:5 by anonymous, 4 years ago

Owner: changed from selimachour@… to team
Status: needinfonew

comment:6 by Don-vip, 4 years ago

Milestone: 21.07
Owner: changed from team to Don-vip
Status: newassigned

comment:7 by Don-vip, 4 years ago

Milestone: 21.07
Owner: changed from Don-vip to team
Status: assignednew

Thanks, I can reproduce, but unfortunately I have no idea how to fix that. This is not linked to the name being in Arabic, the test works as expected.

1st name: U+0627 U+0644 U+0634 U+0645 U+0633
2nd name: U+0627 U+0644 U+0642 U+0645 U+0631

The Levenshtein distance is only 2, that's the threshold used to detect a similarity.

by Don-vip, 4 years ago

Attachment: 20916.osm added

minimal test data to reproduce

