Incorrect character translation on download

H

hughprior

I have a problem with all accented characters not downloading correctly in the structure.rdf.u8 dump.

Below you will see examples (if the chars display incorrectly correctly <img src="/images/icons/tongue.gif" alt="" />) on words like "Français" it shows as "Français".

&lt;narrow r:resource="Top/World/Español"/&gt;
&lt;narrow r:resource="Top/World/Français"/&gt;
&lt;narrow r:resource="Top/World/Românã"/&gt;
&lt;narrow r:resource="Top/World/Català"/&gt;

Any help would be appreciated. Thanks!
 

sfromis

Member
Joined
Mar 25, 2002
Messages
202
Try setting encoding to UTF-8 in your browser; then you'll see that the characters are correct. But if you view the text in ISO-8859-1 encoding (or any other non Unicode UTF-8 encoding), you'll see unreadable characters.

As specified in the description of the RDF dumps, the data is in Unicode UTF-8 format. If you use tools which are not Unicode-aware for handling the data, it will look strange.
 
This site has been archived and is no longer accepting new content.
Top