dzspider Posted November 28, 2009 Posted November 28, 2009 Just checked the "content.rdf.u8.gz", 19-Nov-2009 09:01 294M Still contains geocities links. But dmoz.org has the links removed already.
chaos127 Posted November 28, 2009 Posted November 28, 2009 Once the links have been removed from the underlying database, then the next RDF generated from it will no longer have those links. RDF dumps are generated once a week when things are running smoothly. IIRC, date on the RDF dump is the date on which the dump file formatting was completed, rather than the date the snapshot of the data was taken from the live database. The latter may be 2-4 days before the RDF date. I would therefore imagine the geocities links will be absent from the next RDF dump generated.
dzspider Posted December 12, 2009 Author Posted December 12, 2009 I would therefore imagine the geocities links will be absent from the next RDF dump generated. In the latest dump content.rdf.u8.gz 12-Dec-2009 06:36 295M still contains the geocities links.
Meta pvgool Posted December 12, 2009 Meta Posted December 12, 2009 This means that our quality control processes did not finish a complete scan of the database yet. Only after all the entries are removed from the directory they will also be removed from the RDF. I'll post in the internal forum. Maybe one of our more technical editors can speed up this process. I will not answer PM or emails send to me. If you have anything to ask please use the forum.
jimnoble Posted December 12, 2009 Posted December 12, 2009 ...or for a shorter answer to the original question, we don't know.
tszming Posted June 1, 2010 Posted June 1, 2010 Just checked, it has been fixed in the latest RDF dump.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now