Chefmoz RDF syntax validity

dzieciou

Member
Joined
Feb 27, 2005
Messages
8
Hello!

I'm currently participant of the university project which is to compile agents' usage and semantic web data (exactly: chefmoz RDF dump, which is part of ODP).

I'm sure this not a new problem, however very important to me and my team.

1. Let me know where can I get help about syntax of RDF dump data of chefmoz if this is wrong place.

2. I've tried to parse RDF data from chefmoz.org many time, and I've had to fix it (some lack in using of UTF-8 charset, misspelling in XML entities, incorrect RDF schema - failed checkind by W3W RDF Validator and JENA OWL Syntax Checker, lack of namespaces) and every try I had to fix some problems manually. I know it is everyday problem to chefmoz editors and "mannual approach" to fix it is common. Do you have any tools to correct it?

3. RDF data dump is inconsistent and not always agree to sketch of chefmoz data ontology (namaspace: http://chefmoz.org/rdf/elements/1.0/), e.g. reviews elements are many times used in description of a restuarant. I've talked to a few programmers using your data at their sites and they wonder why objective data (restaurant description) haven't been splitted from subjective data (reviews) and others (external pages) ?

Thnx in advance for any help or suggestions. I appreciate your great work and I'm sure that validating syntax of data could be an advantage to us all,

Best regards,

Maciej Gawinecki
 

dzieciou

Member
Joined
Feb 27, 2005
Messages
8
Other people about this problem

This comes from:

http://esw.w3.org/topic/RestaurantRecommendation

[...] The first example above (the restaurant description) already has a few basic fixes to the RDF/XML to make it parse as sensible, modern RDF.

* £ written as # (because didn't know the numeric character reference nor where to look to find it. Clues please! --DanBri) (£ according to the html DTD)
* some syntax fixes (rdf: prefix, namespace URI etc)
* the rdf:id is now written cm:id, since this isn't usefully part of a URI
* Dublin Core title is lowercase, ie. 'd:title" here

Other changes can wait (eg. 'Phone' should use URI), and are more stylistic and pragmatic in nature: perhaps we might consider using GeoInfo or RdfCalendar techniques to better represent the details of place and time, or FOAF to represent contact and other details, 'who knows who' etc amongst reviewers.

* anyone tried parsing it yet? - chefmoz as distributed doesn't quite parse as RDF or even (char encoding issues) XML
* The toplevel RDF element is not namespace prefixed, and so is in the default namespace (theirs)
* they use the older http://purl.org/dc/elements/1.0/ URI for dublin core, and write 'Title' instead of 'title'; recent DC applications almost all use http://purl.org/dc/elements/1.1/ and lowercase property names.
* If the intention is to write what is often represented as 'rdf:ID', then their r:id element should be r:ID, with r bound to http://www.w3.org/1999/02/22/rdf-syntax-ns# instead of http://www.w3.org/TR/1999/REC-rdf-syntax-19990222 as it is currently
* the namespace uri ends in a slash, like DC, RSS, FOAF vocabs. This bothers some people (LinkMe)
* the parsed opening hours property -- can we try converting this to iCal/RDF markup?
* If there was a cleaned up version of this, I'd love to encourage folk to use such markup alongside their FOAF self-descriptions. --DanBri
* Factual errors in the data: Sudaca is in the Redland area of Bristol, not Cotham! what's the process for fixing this? [...]
 

motsa

Curlie Admin
Joined
Sep 18, 2002
Messages
13,294
This site is really only for ODP topics. While chefmoz.org may be a sister site to dmoz.org, this site doesn't deal with anything related to it. I would use the email staff link on the Chefmoz site to contact the Chefmoz programmers about your problems.
 

dzieciou

Member
Joined
Feb 27, 2005
Messages
8
I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times.
I also sended e-mails (e-mail chefmoz staff link) twice in the past, without any response. I've also asked editors and root editors of chefmoz.org -- without any response.
Has chefmoz died? I can't believe. Although the last date of change on their official site comes from 2003, there are a lot of people,restaurants etc. participating in it, take a look: 259684 restaurants - 53418 reviews - 56373 links to reviews - 1551 editors.
Maybe there is possibility to get all data from chefmoz in raw (not RDF) format? How you, people from dmoz aquire data from dmoz for your personal use?

Thnx for your help

Maciej.
 

motsa

Curlie Admin
Joined
Sep 18, 2002
Messages
13,294
I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times.
You might want to reread where I wrote: While chefmoz.org may be a sister site to dmoz.org, this site doesn't deal with anything related to it.
 

sdang

Member
Joined
Jan 16, 2008
Messages
6
Error parsing RDF CHEFMOZ

So I guess I am bringing back the dead on this thread but wondering if anyone has successfully figured a way out to parse the Chefmoz rdf file?

I am using the php dmoz_v3 to do this and getting XML Parser Error (error code 7) mismatched tag. The error was found on line 2394977. Is this because what I am reading above that the structure isn't exactly the same as dmoz?

Any other suggestions on how to get the data from chefmoz??

Thank you!
Simon
 

weglobenet

Member
Joined
Oct 7, 2007
Messages
16
sdang said:
So I guess I am bringing back the dead on this thread but wondering if anyone has successfully figured a way out to parse the Chefmoz rdf file?

I am using the php dmoz_v3 to do this and getting XML Parser Error (error code 7) mismatched tag. The error was found on line 2394977. Is this because what I am reading above that the structure isn't exactly the same as dmoz?

Any other suggestions on how to get the data from chefmoz??

Thank you!
Simon

see another thread:
http://www. webmasterworld.com/forum21/107.htm
 

EddieDent

Member
Joined
Dec 1, 2008
Messages
2
hi...
:(I have some doubts about what you've said, because I've read "chefmoz is part of ODP" many times.
I also sended e-mails (e-mail chefmoz staff link) twice in the past, without any response. I've also asked editors and root editors of chefmoz.org -- without any response.
Has chefmoz died? I can't believe. Although the last date of change on their official site comes from 2003, there are a lot of people,restaurants etc. participating in it, take a look: 259684 restaurants - 53418 reviews - 56373 links to reviews - 1551 editors.
Maybe there is possibility to get all data from chefmoz in raw (not RDF) format? How you, people from dmoz aquire data from dmoz for your personal use?:confused:

Thnx for your help

Maciej.
 

jimnoble

DMOZ Meta
Joined
Mar 26, 2002
Messages
18,915
Location
Southern England
To more or less repeat what's been said higher up this thread:

While chefmoz.org may be a sister site to dmoz.org, this forum doesn't deal with anything related to it.
 
This site has been archived and is no longer accepting new content.
Top