When will geocities links being removed from the RDF download?

dzspider

Member
Joined
Nov 28, 2009
Messages
6
Just checked the "content.rdf.u8.gz", 19-Nov-2009 09:01 294M



Still contains geocities links.


But dmoz.org has the links removed already.
 

chaos127

Curlie Admin
Joined
Nov 13, 2003
Messages
1,344
Once the links have been removed from the underlying database, then the next RDF generated from it will no longer have those links. RDF dumps are generated once a week when things are running smoothly. IIRC, date on the RDF dump is the date on which the dump file formatting was completed, rather than the date the snapshot of the data was taken from the live database. The latter may be 2-4 days before the RDF date.

I would therefore imagine the geocities links will be absent from the next RDF dump generated.
 

dzspider

Member
Joined
Nov 28, 2009
Messages
6
chaos127 said:
I would therefore imagine the geocities links will be absent from the next RDF dump generated.

In the latest dump

content.rdf.u8.gz 12-Dec-2009 06:36 295M

still contains the geocities links.
 

pvgool

kEditall/kCatmv
Curlie Meta
Joined
Oct 8, 2002
Messages
10,093
This means that our quality control processes did not finish a complete scan of the database yet. Only after all the entries are removed from the directory they will also be removed from the RDF.

I'll post in the internal forum. Maybe one of our more technical editors can speed up this process.
 
This site has been archived and is no longer accepting new content.
Top