Somebody know when next update of ODP will happens?

BEP

Member
Joined
Apr 21, 2009
Messages
4
Last date of moved in the archive of ODP database according
http://rdf.dmoz.org/rdf/archive/ is 2009-04-07.

Last modification date of ODP database according
http://rdf.dmoz.org/rdf is 2009-04-15.

All previous months these updates was in begin of each month.

Why so big delay for may 2009? Is somebody know when next update of ODP will happens?
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
Messages
10,093
The proces that creates the RDF has been broken for some time. AOL tech staff is working on the problem but we do not know when it will be solved.

This does not mean that the DMOZ database is not updated. It only means that we can not provide a copy to other websites that use our data.

Sorry for the inconvenience
 

BEP

Member
Joined
Apr 21, 2009
Messages
4
Is any contacts of AOL stuff who solve this problem exist? Maybe we need ask them when they solve this problem?

It's really big trouble, all sites, include google and alexa can't anymore use updates of DMOZ.

Nobody have interest to sovle this trouble fast?
 

photofox

Curlie Admin
RZ Admin
Joined
Jun 9, 2010
Messages
2,092
Location
[Right here]
No updates yet, but AOL are working on fixing the problems. We will post an update as soon as things are working properly again.

Thanks for your patience. :)
 

jpnutch

Member
Joined
Oct 17, 2006
Messages
32
Thanks!

Seems like the AOL SW team just can't handle this ODP stuff...
I remember in late 2006 they lost the entire database, and there were no updates for over 2 months, and it took even longer to get the RDF data working again. I would have thought they fixed all these issues back then, guess not...
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
Messages
10,093
Yes, there is a new version of the rdf. But unfortunately it is not error free.
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
Messages
10,093
yes, they are still working on the problems
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
Messages
10,093
As you might notice these "aol-" files are all very small and do not hold the complete data. Most probably they were from a testrun.

http://rdf.dmoz.org/ mentiones that the files you should use are structure.rdf.u8.gz and content.rdf.u8.gz. These two are last created begin June. There are unzipped versions from a later date. But we do not know if these are without problems. We (the DMOZ editors) have not been told that the problem is solved.

content.rdf.u8 21-Jul-2009 06:43 2.4G
content.rdf.u8.gz 02-Jun-2009 19:29 304M
structure.rdf.u8 21-Jul-2009 06:42 829M
structure.rdf.u8.gz 02-Jun-2009 19:29 73M
 

chaos127

Curlie Admin
Joined
Nov 13, 2003
Messages
1,344
I believe a new RDf dump is now available. It's being generated by a new system, so there may still be some slight issues. Could you have a look and see if whatever you thought had changed is still different? If so, could you be more specific (ideally with some example code snippets and line numbers) as to what is different...
 

cmeerw

Member
Joined
Feb 9, 2008
Messages
10
In last week's dump some tags were capitalised differently (newsGroup vs. newsgroup, description vs. Description) - but that appears to have been changed back in this week's dump. (edit: the newsgroup tag is still different - it previously was newsGroup, now it's newsgroup)

I believe even the non-dmoz_2.0 dumps had the new language structure last week, i.e. Top/en/... - also seems to be fixed now.

But the xmlns definitions are still not correct in the current dumps. Older dumps had

<RDF xmlns:r="http://www.w3.org/TR/RDF/"
xmlns:d="http://purl.org/dc/elements/1.0/"
xmlns="http://dmoz.org/rdf">

the current content.rdf.u8.gz now just has:

<RDF>

and the current structure.rdf.u8.gz uses:

<RDF xmlns:r="http://www.w3.org/TR/RDF/" xmlns:d="http://purl.org/dc/elements/1.0/">
 

cmeerw

Member
Joined
Feb 9, 2008
Messages
10
And the current dmoz_2.0 RDF dumps contain categories in almost random order, i.e. the first category in structure.rdf is /Top/en/Arts/Animation/Anime/Characters - it would make things much easier if it would start with /Top (and ensure that parent categories are always defined first).
 

cmeerw

Member
Joined
Feb 9, 2008
Messages
10
Also the content.rdf dump doesn't appear to include any mediadate tags any more and the structure.rdf dump doesn't appear to contain any editor tags.
 

jimnoble

DMOZ Meta
Joined
Mar 26, 2002
Messages
18,915
Location
Southern England
Presuming that you've read other threads in this forum, you'll be aware that the most recent dumps are being made by an entirely different process which is still under development.

We are aware of the two issues that you report.

If you discover any more, please report them here and we'll pass them along to the development team.
 

JustMeme

Member
Joined
Sep 9, 2009
Messages
4
Hello,

I see that the new files are ready, but from the discussion I understand there is something wrong with them? I just took over the responsiblity of running an old DMOZ clone, how this does this affect me in practise? Is it better to just wait till everything is sorted and running the old import job? (Will it fail if I use the current settings? (I am not a computer guy, and am afraid to touch a monster I do not understand...)

Also, how does this affect google directory? I have a site that has been added since the last import to directory.google.com, do anyone have any idea when they will update? Will they also wait till the update is complety sorted? Or is just yet another one of these wait and see things ;)
 

jimnoble

DMOZ Meta
Joined
Mar 26, 2002
Messages
18,915
Location
Southern England
The dumps can't be considered to be bug free at the moment and so we can't predict what technical difficulties you'll have.

As to Google, they set their own time scales, just as you do, and we have no knowledge of them :).
 
This site has been archived and is no longer accepting new content.
Top