dzieciou Posted February 27, 2005 Posted February 27, 2005 Hello! I'm currently participant of the university project which is to compile agents' usage and semantic web data (exactly: chefmoz RDF dump, which is part of ODP). I'm sure this not a new problem, however very important to me and my team. 1. Let me know where can I get help about syntax of RDF dump data of chefmoz if this is wrong place. 2. I've tried to parse RDF data from chefmoz.org many time, and I've had to fix it (some lack in using of UTF-8 charset, misspelling in XML entities, incorrect RDF schema - failed checkind by W3W RDF Validator and JENA OWL Syntax Checker, lack of namespaces) and every try I had to fix some problems manually. I know it is everyday problem to chefmoz editors and "mannual approach" to fix it is common. Do you have any tools to correct it? 3. RDF data dump is inconsistent and not always agree to sketch of chefmoz data ontology (namaspace: http://chefmoz.org/rdf/elements/1.0/), e.g. reviews elements are many times used in description of a restuarant. I've talked to a few programmers using your data at their sites and they wonder why objective data (restaurant description) haven't been splitted from subjective data (reviews) and others (external pages) ? Thnx in advance for any help or suggestions. I appreciate your great work and I'm sure that validating syntax of data could be an advantage to us all, Best regards, Maciej Gawinecki
dzieciou Posted February 27, 2005 Author Posted February 27, 2005 Other people about this problem This comes from: http://esw.w3.org/topic/RestaurantRecommendation [...] The first example above (the restaurant description) already has a few basic fixes to the RDF/XML to make it parse as sensible, modern RDF. * £ written as # (because didn't know the numeric character reference nor where to look to find it. Clues please! --DanBri) (£ according to the html DTD) * some syntax fixes (rdf: prefix, namespace URI etc) * the rdf:id is now written cm:id, since this isn't usefully part of a URI * Dublin Core title is lowercase, ie. 'd:title" here Other changes can wait (eg. 'Phone' should use URI), and are more stylistic and pragmatic in nature: perhaps we might consider using GeoInfo or RdfCalendar techniques to better represent the details of place and time, or FOAF to represent contact and other details, 'who knows who' etc amongst reviewers. * anyone tried parsing it yet? - chefmoz as distributed doesn't quite parse as RDF or even (char encoding issues) XML * The toplevel RDF element is not namespace prefixed, and so is in the default namespace (theirs) * they use the older http://purl.org/dc/elements/1.0/ URI for dublin core, and write 'Title' instead of 'title'; recent DC applications almost all use http://purl.org/dc/elements/1.1/ and lowercase property names. * If the intention is to write what is often represented as 'rdf:ID', then their r:id element should be r:ID, with r bound to http://www.w3.org/1999/02/22/rdf-syntax-ns# instead of http://www.w3.org/TR/1999/REC-rdf-syntax-19990222 as it is currently * the namespace uri ends in a slash, like DC, RSS, FOAF vocabs. This bothers some people (LinkMe) * the parsed opening hours property -- can we try converting this to iCal/RDF markup? * If there was a cleaned up version of this, I'd love to encourage folk to use such markup alongside their FOAF self-descriptions. --DanBri * Factual errors in the data: Sudaca is in the Redland area of Bristol, not Cotham! what's the process for fixing this? [...]
motsa Posted February 27, 2005 Posted February 27, 2005 This site is really only for ODP topics. While chefmoz.org may be a sister site to dmoz.org, this site doesn't deal with anything related to it. I would use the email staff link on the Chefmoz site to contact the Chefmoz programmers about your problems.
dzieciou Posted February 28, 2005 Author Posted February 28, 2005 I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times. I also sended e-mails (e-mail chefmoz staff link) twice in the past, without any response. I've also asked editors and root editors of chefmoz.org -- without any response. Has chefmoz died? I can't believe. Although the last date of change on their official site comes from 2003, there are a lot of people,restaurants etc. participating in it, take a look: 259684 restaurants - 53418 reviews - 56373 links to reviews - 1551 editors. Maybe there is possibility to get all data from chefmoz in raw (not RDF) format? How you, people from dmoz aquire data from dmoz for your personal use? Thnx for your help Maciej.
motsa Posted February 28, 2005 Posted February 28, 2005 I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times.You might want to reread where I wrote: While chefmoz.org may be a sister site to dmoz.org, this site doesn't deal with anything related to it.
sdang Posted January 18, 2008 Posted January 18, 2008 Error parsing RDF CHEFMOZ So I guess I am bringing back the dead on this thread but wondering if anyone has successfully figured a way out to parse the Chefmoz rdf file? I am using the php dmoz_v3 to do this and getting XML Parser Error (error code 7) mismatched tag. The error was found on line 2394977. Is this because what I am reading above that the structure isn't exactly the same as dmoz? Any other suggestions on how to get the data from chefmoz?? Thank you! Simon
weglobenet Posted October 20, 2008 Posted October 20, 2008 So I guess I am bringing back the dead on this thread but wondering if anyone has successfully figured a way out to parse the Chefmoz rdf file? I am using the php dmoz_v3 to do this and getting XML Parser Error (error code 7) mismatched tag. The error was found on line 2394977. Is this because what I am reading above that the structure isn't exactly the same as dmoz? Any other suggestions on how to get the data from chefmoz?? Thank you! Simon see another thread: http://www. webmasterworld.com/forum21/107.htm
weglobenet Posted October 23, 2008 Posted October 23, 2008 see another thread: http://www. webmasterworld.com/forum21/107.htm Error:confused: see another thread in this forum http://www.resource-zone.com/forum/index.php?showtopic=48680
retty Posted November 24, 2008 Posted November 24, 2008 Hi..... The url link provided was really useful..... Thanks man.....
EddieDent Posted December 1, 2008 Posted December 1, 2008 hi... :(I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times. I also sended e-mails (e-mail chefmoz staff link) twice in the past, without any response. I've also asked editors and root editors of chefmoz.org -- without any response. Has chefmoz died? I can't believe. Although the last date of change on their official site comes from 2003, there are a lot of people,restaurants etc. participating in it, take a look: 259684 restaurants - 53418 reviews - 56373 links to reviews - 1551 editors. Maybe there is possibility to get all data from chefmoz in raw (not RDF) format? How you, people from dmoz aquire data from dmoz for your personal use? Thnx for your help Maciej.
jimnoble Posted December 1, 2008 Posted December 1, 2008 To more or less repeat what's been said higher up this thread: While chefmoz.org may be a sister site to dmoz.org, this forum doesn't deal with anything related to it.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now