dmoz Posted August 1, 2006 Posted August 1, 2006 Hi, I would like to download the data related to only "Recreation" catagory. How can I do this? The download link http://rdf.dmoz.org/rdf/content.rdf.u8.gz has the data of more than 300 MB for all catagory. I need only "Recreation". How can I do it? Thanks, JK
Meta informator Posted August 1, 2006 Meta Posted August 1, 2006 I am afraid that you can only download the complete rdf-file. Rdf-files from individual categories are not offered. You would have to extract the information you want to use by yourself. Curlie (Dmoz) Meta editor informator
timamie261 Posted August 1, 2006 Posted August 1, 2006 You can do a search for a script file and parse out what you want, or you can import the whole file in to a custom DB file then export only what you need. Hope that helps
chaos127 Posted August 2, 2006 Posted August 2, 2006 Have you tried doing a google search for "odp subcat dumps"?
sfromis Posted August 3, 2006 Posted August 3, 2006 As hinted in the last posting, there exists a set of uofficial split dump files created by an editor. http://rodan.ncc.com/rdf/cats/ Beware that the ODP directory structure has a lot of cross-category links which would not be resolveable within each dump file. This includes @links which are often used to have the same category appear at different locations in the directory. As an example, http://dmoz.org/Recreation/Travel/Attractions/ would be rather useless without the @linked categories, of which only a few are in the Recreation top level category.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now