I know symbolic link in DMOZ category hierarchy is an artifact to accomadate the multi-labled requirements inherent in hierarchical classification scheme, but when I take it for granted that a symbolic linked category is a subcategory of the category which owns the symbolic link, I got some really puzzlement. For example, below is a category path( >: narrow; =>[...]: symbolic link):
Top> Computers> Artificial_Intelligence> People> Papert, Seymour@ =>[Top> Computers> History> Pioneers>Papert, Seymour] > Logo@ => [Top> Computers> Programming> Languages> Lisp> Logo]
All this seems good, but then comes the puzzle, the "Logo" categary has a symbolic link: "Papert, Seymour@", which is linked to: Top> Computers> History> Pioneers> Papert, Seymour
Then "Top> Computers> History> Pioneers> Papert, Seymour" and "Top> Computers> Programming> Languages> Lisp> Logo" contains each other!
My first reaction is that I spotted an exception, then I wrote a simple graph traversing algorithm to detect path cycles and applied it to my DMOZ category structure database. I have the final conclusion that DMOZ category structure is by no means a DAG graph, that is, it is not uncommon to come up with path cycles like the above.
I am a researcher in automatic text categorization area and DMOZ is a must for me, but when I follow the symbolic links as well as ordinary subcategories, I often reach unexpected unrelevant categories in the end. Can you imagine when I am collecting web pages from DMOZ about the topic "Artificial Intelligence", I reach a category labeled "Microsoft Office"? It seems that some symbolic links are not edited well enough or discrimented from "see also" link well engouth. For the time being, I refrained myself from following symbolic links. That is a pity because many symbolic links refer to truely relevant content.
Top> Computers> Artificial_Intelligence> People> Papert, Seymour@ =>[Top> Computers> History> Pioneers>Papert, Seymour] > Logo@ => [Top> Computers> Programming> Languages> Lisp> Logo]
All this seems good, but then comes the puzzle, the "Logo" categary has a symbolic link: "Papert, Seymour@", which is linked to: Top> Computers> History> Pioneers> Papert, Seymour
Then "Top> Computers> History> Pioneers> Papert, Seymour" and "Top> Computers> Programming> Languages> Lisp> Logo" contains each other!
My first reaction is that I spotted an exception, then I wrote a simple graph traversing algorithm to detect path cycles and applied it to my DMOZ category structure database. I have the final conclusion that DMOZ category structure is by no means a DAG graph, that is, it is not uncommon to come up with path cycles like the above.
I am a researcher in automatic text categorization area and DMOZ is a must for me, but when I follow the symbolic links as well as ordinary subcategories, I often reach unexpected unrelevant categories in the end. Can you imagine when I am collecting web pages from DMOZ about the topic "Artificial Intelligence", I reach a category labeled "Microsoft Office"? It seems that some symbolic links are not edited well enough or discrimented from "see also" link well engouth. For the time being, I refrained myself from following symbolic links. That is a pity because many symbolic links refer to truely relevant content.