dataferret Posted January 7, 2005 Posted January 7, 2005 Hi All I have spent the past four weeks grappling with the huge RDF dumps and have managed to parse the ODP data, dump it into a mysql database, convert this into a decent schema and generally clean up the data. I have been working with structure.rdf because it is the smaller file of the two. Now I have the data in a database, I find there are strange characters in the World sub-categories. These are characters from different languages which mysql does not seem to understand - replacing them with ?* instead. How do I overcome this problem? Does anyone have any thoughts? Thanks
Editall Callimachus Posted January 7, 2005 Editall Posted January 7, 2005 When using MySQL's UTF-8 character set, besides setting the database character set to UTF-8, you must remember to send SET NAMES 'utf8' to MySQL prior to querying or inserting. The following reference may be of some help: http://www.informit.com/articles/article.asp?p=328641&seqNum=6 ODP Editor callimachus Any opinions expressed are my own, and do not represent an official opinion or communication from the ODP. Private messages asking for submission status or preferential treatment will be ignored.
dataferret Posted January 9, 2005 Author Posted January 9, 2005 Thanks for the information and responding to my post. I have looked over the information at the link you suggested - I am not sure what it all means but it does look like it will solve the problem. Now I just need to figure out what the hell they are talking about
dataferret Posted January 13, 2005 Author Posted January 13, 2005 I have tried to set the database to utf8 and the columns to utf8 too but am still getting problems with invalid characters. Depending on which computer I use the following commands either work or return an error: # Create the database - use utf8 CREATE DATABASE dmoz CHARACTER SET utf8; # if database is already created alter to use utf8 ALTER DATABASE dmoz DEFAULT CHARACTER SET utf8; Can anyone point me in the right direction?
lonuncavisto Posted April 22, 2005 Posted April 22, 2005 I have the same problem. Tried everything but i cannot get the Russian categories correctly displayed. Does anyone know the solution ? Thank you in advance
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now