Bug: Duplicate category names in the dump

Fduch

New Member
Joined
Oct 24, 2012
Messages
4
Dmoz category names (identifed by supposedly unique id attribute of the Topic element) are assumed to be unique, but in fact they aren't. This creates problems when the data is being processed and used.

Can this be fixed?

Here is the list of non-unique category names with their catids:

Top/Health/Conditions_and_Diseases/Congenital_Anomalies/Craniofacial_Anomalies/Sturge-Weber_Syndromelies/Sturge-Weber_Syndrome 456346 456764
Top/Health/Conditions_and_Diseases/Skin_Disorders/Cowden_Syndrome 456324 456754
Top/Health/Conditions_and_Diseases/Skin_Disorders/Ectodermal_Dysplasia 456294 456398
Top/Health/Conditions_and_Diseases/Skin_Disorders/Pseudoxanthoma_Elasticum 456351 456389
Top/World/Deutsch/Wirtschaft/Bauwesen/Öffentlich-Private-Partnerschaften 351307 351329
Top/World/Deutsch/Wirtschaft/Gastgewerbe/Gastronomie/Catering/Marktbeschicker 351245 351247 351248
 
This site has been archived and is no longer accepting new content.
Top