#10393 closed enhancement (fixed)
LRM character on URLS
Reported by: | jgpacker | Owned by: | team |
---|---|---|---|
Priority: | normal | Milestone: | 14.09 |
Component: | Core validator | Version: | |
Keywords: | unicode lrm character url email | Cc: |
Description (last modified by )
If possible, JOSM should automatically remove the LRM character (an invisible unicode character used in some languages) from certain keys.
This character can be accidentally introduced by mappers that copy and paste information from other sources.
Usually this character is no big deal, but it can be nasty in tags like website=*
, contact:website=*
and similar ones, because it can corrupt their address.
Examples of such tags can be seen here.
Note that recently I removed this character from URLs across the globe (around 1200), and yet it is growing again (around 60 right now).
Fortunately it seems this character only appears in the end (and maybe at the beginning) of an URL.
I opened a similar issue for the iD editor: https://github.com/openstreetmap/iD/issues/2323
Alternatively, this job could be delegated to JOSM Validator
Attachments (0)
Change History (9)
comment:1 by , 10 years ago
comment:2 by , 10 years ago
Description: | modified (diff) |
---|
comment:4 by , 10 years ago
#9960 is related, although I was never able to reproduce the problem.
I approve removing these characters from URLs and e-mail addresses.
comment:5 by , 10 years ago
I said before that I only saw this invisible character on either the beginning or the end of the tag's value, but I found a counter-example.
This node have an email=*
tag with a LRM character both before and after the @
symbol.
comment:6 by , 10 years ago
Keywords: | unicode lrm character added |
---|
comment:7 by , 10 years ago
Component: | Core → Core validator |
---|---|
Milestone: | → 14.09 |
comment:9 by , 10 years ago
Keywords: | url email added |
---|
http://overpass-turbo.eu/s/3eJm yields 404 page