D
dkoch
Hi:
I run a couple of sites, one of which has a targeted directory of subsets of DMOZ data. In the course of running my downsized version of DMOZ, I have noticed a LOT of stale links in the DMOZ data.
I am not too familiar with all the inner workings of DMOZ and all, so I posted my observations to a forum (www.webmasterworld.com) that I knew was frequented by DMOZ editors. In the course of our discussions, I learned the following.
1) DMOZ does have a spider that does regularly check for old links.
2) A LOT of links can "pass" the spider test, and be bad- custom 403 and 404 pages, a page that says, "We have moved..." etc.
3) DMOZ editors are a bit overworked, and do not always have time (or the inclination) to check existing links
4) DMOZ DOES put a priority on getting this sort of thing removed from their directory.
Well, in addition to pointing this out (which I am sure you are aware of anyway), I have a solution that should work well in the exisiting structure of DMOZ, AND NOT CAUSE A WHOLE LOT OF ADDITIONAL WORK BY THE EDITORS! Sure, it will cause SOME additional work, but it is not too time intensive. In practice, I have found I can review and deal with one bad URL in 10 seconds or so. But the MAJOR thing is that it at least brings these bad URL's to someones attention!
The solution is quite simple! Add a link to EVERY listing (dynamic, of course) that when clicked on by a user will take them to a page so they can report a bad link to the editor of that category. It would be run by a simple cgi script that is outside of existing DMOZ code, so should be able to be added quickly and easily, and yet work for the whole DMOZ system. (See a sample "Report a Bad Link" page from my site here: http://www.toon.com/links/cgi-bin/badlink.cgi?Title=The%20Cartoon%20Factory%20Animation%20Art%20Gallery&ID=7888&URL=http://www.cartoon-factory.com/&g=Report). Once reported by a user, my script would e-mail the editor of the category, tell him/her what site MIGHT be bad, the URL of that site (which in most e-mail systems is a clickable link, so click and decide right there!) and then also provide the editor RIGHT IN THE E-MAIL a link to either edit the URL (if the page has moved) or delete the entry entirely.
Yes, a very simple solution... and works very well for me. DMOZ might want to add some safeguards or something, again I do not know how you work. But this would really make cleaning up DMOZ a LOT easier, I think!
Yes, I am sure you will get some abuse, too. But as I said, it only takes 10 seconds to check each report, so the time impact is pretty minimal. But getting those stale links OUT of DMOZ has a tremendous benefit.
Anyway, I wanted to bring this to your attention. Hopefully, there is someone here who knows how to take this the next step (if you think it is a good idea!)
I have a basic script all written, and would be happy to GIVE it to DMOZ, gratis, no strings attached. (Kind of my way of paying DMOZ back)
If you think it is a good idea, and would like to see my script, and know who to go to to get this up and running, please feel free to contact me at admin-at-bcdb-dot-com
Thanks!
Dave
I run a couple of sites, one of which has a targeted directory of subsets of DMOZ data. In the course of running my downsized version of DMOZ, I have noticed a LOT of stale links in the DMOZ data.
I am not too familiar with all the inner workings of DMOZ and all, so I posted my observations to a forum (www.webmasterworld.com) that I knew was frequented by DMOZ editors. In the course of our discussions, I learned the following.
1) DMOZ does have a spider that does regularly check for old links.
2) A LOT of links can "pass" the spider test, and be bad- custom 403 and 404 pages, a page that says, "We have moved..." etc.
3) DMOZ editors are a bit overworked, and do not always have time (or the inclination) to check existing links
4) DMOZ DOES put a priority on getting this sort of thing removed from their directory.
Well, in addition to pointing this out (which I am sure you are aware of anyway), I have a solution that should work well in the exisiting structure of DMOZ, AND NOT CAUSE A WHOLE LOT OF ADDITIONAL WORK BY THE EDITORS! Sure, it will cause SOME additional work, but it is not too time intensive. In practice, I have found I can review and deal with one bad URL in 10 seconds or so. But the MAJOR thing is that it at least brings these bad URL's to someones attention!
The solution is quite simple! Add a link to EVERY listing (dynamic, of course) that when clicked on by a user will take them to a page so they can report a bad link to the editor of that category. It would be run by a simple cgi script that is outside of existing DMOZ code, so should be able to be added quickly and easily, and yet work for the whole DMOZ system. (See a sample "Report a Bad Link" page from my site here: http://www.toon.com/links/cgi-bin/badlink.cgi?Title=The%20Cartoon%20Factory%20Animation%20Art%20Gallery&ID=7888&URL=http://www.cartoon-factory.com/&g=Report). Once reported by a user, my script would e-mail the editor of the category, tell him/her what site MIGHT be bad, the URL of that site (which in most e-mail systems is a clickable link, so click and decide right there!) and then also provide the editor RIGHT IN THE E-MAIL a link to either edit the URL (if the page has moved) or delete the entry entirely.
Yes, a very simple solution... and works very well for me. DMOZ might want to add some safeguards or something, again I do not know how you work. But this would really make cleaning up DMOZ a LOT easier, I think!
Yes, I am sure you will get some abuse, too. But as I said, it only takes 10 seconds to check each report, so the time impact is pretty minimal. But getting those stale links OUT of DMOZ has a tremendous benefit.
Anyway, I wanted to bring this to your attention. Hopefully, there is someone here who knows how to take this the next step (if you think it is a good idea!)
I have a basic script all written, and would be happy to GIVE it to DMOZ, gratis, no strings attached. (Kind of my way of paying DMOZ back)
If you think it is a good idea, and would like to see my script, and know who to go to to get this up and running, please feel free to contact me at admin-at-bcdb-dot-com
Thanks!
Dave