Jump to content

Recommended Posts

Posted

Dear Editors,

 

I am currently working on a WiderNet project (http://www.widernet.org), and we are having a eGranary digital library, which containes many whole websites (http://www.widernet.org/digitalLibrary/content/WhatsInside.asp). We need to gather the metadata for our websites, and we think DMOZ might have them in their database.

 

So, is it the RDF that we need to download? What kinds of information is in it? Is there a sample record for a single item?

 

What are the tools that are available for tranforming the data file into SQL files?

 

Thank you!

  • Meta
Posted

I have no idea what you mean with "metadata".

DMOZ only has the url of the websites we have listed and a title and description as written by us.

Information about the RDF cen be found on http://rdf.dmoz.org/ , inlcuding a small sample.

But remember that when you use the DMOZ data you must follow the license agreement as presented on http://www.dmoz.org/license.html

I will not answer PM or emails send to me. If you have anything to ask please use the forum.

  • Meta
Posted
So, is it the RDF that we need to download?

Assuming you are talking about a large number of samples you want to test: Yes, the content.* rdf file is what you need. be prepared that ist has a few GB uncompressed. If you are only talking about a few URLs you want to check manually, use the dmoz.org onsite search, omitting www. or other precfixes, searching for the domain only.

 

It is somewhat like XML, but unfortunately a very early stage of the RDF specification, which renders it unreadable for common parsers. But due to the syntax a semi skilled programmer should be able to parse the file easily.

 

What are the tools that are available for tranforming the data file into SQL files?

Not that many, sorry to say. The resources we know are listed in http://www.dmoz.org/Computers/Internet/Searching/Directories/Open_Directory_Project/Use_of_ODP_Data/Upload_Tools/ - maybe one of those links can help you.

Curlie Meta/kMeta Editor windharp

 

d9aaee9797988d021d7c863cef1d0327.gif

  • 1 year later...
Posted
Could you please explain me more in details regarding DMOZ matadata? I am new to that project and would like to get more info or online reference pages with information regardign that topic. Please let me know at your earliest convinience.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...