Opened 17 years ago
Closed 11 years ago
#518 closed defect (fixed)
[PATCH] Unicode normalization
Reported by: | Owned by: | framm | |
---|---|---|---|
Priority: | minor | Milestone: | 14.01 |
Component: | Core | Version: | latest |
Keywords: | Cc: |
Description (last modified by )
JOSM should normalize strings at input. They should also be normalized when searching.
For example inputing name="Rue de l'École" should end up the same as name="Rue de l'École". The first has "É" as U+0045 LATIN CAPITAL LETTER E + U+0301 COMBINING ACUTE ACCENT while the second has "É" U+00C9 LATIN CAPITAL LETTER E WITH ACUTE.
Searching for one should match the other.
See http://unicode.org/faq/normalization.html for more info.
java.text.Normalizer.normalize(string, java.text.Normalizer.Form.NFC) can be used when required.
NFC is probably better because it's better supported than NFD due to legacy.
Attachments (1)
Change History (5)
by , 17 years ago
Attachment: | josm-normalization.patch added |
---|
comment:1 by , 16 years ago
Summary: | Unicode normalizatin → [PATCH] Unicode normalization |
---|
comment:3 by , 11 years ago
Description: | modified (diff) |
---|---|
Milestone: | → 14.01 |
Priority: | trivial → minor |
Resolution: | fixed |
Status: | closed → reopened |
normalizing strings before comparison in SearchCompiler, and value in PropertiesDialog