JeanLucDmoz
Active Member
- Joined
- Sep 29, 2010
- Messages
- 31
Hi,
I downloaded http://rdf.dmoz.org/rdf/structure.rdf.u8.gz and http://rdf.dmoz.org/rdf/categories.txt (and other files that contain DMOZ categories), but all foreign characters are replaced by one or two question marks.
Here is an example of what I get :
where I expect
I inspected the binary content of the file and it really contains hexadecimal 3F where there is a question mark. So I guess this is not a matter of encoding method.
This problem does not exist with the sample at http://www.dmoz.org/docs/en/rdf/structure.example.txt .
As I am new with ODP data, I could have misunderstood something. Please help me sort this out.
Jean-Luc
I downloaded http://rdf.dmoz.org/rdf/structure.rdf.u8.gz and http://rdf.dmoz.org/rdf/categories.txt (and other files that contain DMOZ categories), but all foreign characters are replaced by one or two question marks.
Here is an example of what I get :
Code:
<altlang r:resource="French:Top/World/Fran??ais/Arts/Audiovisuel/Animation"></altlang>
Code:
<altlang r:resource="French:Top/World/Français/Arts/Audiovisuel/Animation"></altlang>
This problem does not exist with the sample at http://www.dmoz.org/docs/en/rdf/structure.example.txt .
As I am new with ODP data, I could have misunderstood something. Please help me sort this out.
Jean-Luc