C
chriskud5
Thank you both for clearing up the way that DMOZ goes about storing info.
Certainly an automated status check could be made to work with data files rather than a specific database. Having just a few possible statuses, such as "Not Submitted" "Pending" "Listed" and "Denied". The "listed" status could be easily achieved by scanning the current dev cat for the url string, the "submitted" status can be checked by scanning through the "pending" file (or whatever way DMOZ uses a file to store sites that are waiting for review).
"There are lots of possible statuses, and because the system of processing the submission is so based on Humans, interpreting the status is more than a simple check to see if a site is waiting in a given category. "
From my readings (and I know samiam has had tons more experience with all this than most people) it seems that not to many statuses that are given back to people here on this site exist. "Pending" is a popular one, "Denied" is another popular one, and "It is listed in the dev dir" is another one. Giving specifics about when the editor was last there do not need to be relayed, they are not now.
I don't seem to get the point of denying status checks to known abusers?? If the abuser is known, and the site being submitted is spammy or whatever, the editor won't submit it. Who cares if the submitter checks the status, it probably won't take up any more processing or bandwidth than having an editor from here go and check. If a status check is done correctly, the processing cycles would be low, the bandwidth very low, making 500 queries in the status tool equal to the processing power and bandwidth equal to one status check from an editor at this website.
Common logic would indicate that it would be more worthwhile to have a huge sign pointing to a bathroom so you DO NOT ask a person where it is. Asking a person where it is takes away time and effort that they can be doing their job. Asking editors here for the status of some site that probably won't get listed anyways only takes them away from getting the sites that should be listed into the directory, and thus creating a better experience for users of DMOZ. Just the other day at the O'Hare airport in America I was looking for the bathroom, I went to the front desk at the airport lounge to ask, and they didn't even say a word, just pointed to a big sign that said "Restrooms ->"
Taking away the productivity of employees, volunteers, whoever they may be is a situation that administrators want to avoid at all costs (when productivity and profits are in mind of course).
The statement "Humans it do it better" is a great mindset to have when editing categories and reviewing sites for submission, and certainly does produce more relevant results (another debate altogether). Busy work such as checking a status, and having a finite answer exist (listed, denied, pending, not submitted) does not require the "personal touch"; it only detracts from an editor adding more useful sites to the directory. I stumbled upon a thread the other day that said "The more you ask about the status of your site, the more time you are taking us away from getting pending sites reviewed"
I think that this sums up the current system perfectly. And for a directory that has had people waiting with good sites for over a year, it is certainly something that needs to be addressed. The DMOZ is a service, and in my mind a valuable one. If you requested a plumber come fix the leaky pipes, and he took over a year to come, you wouldn’t be too pleased. I do think that this is how people are starting to see the ODP. It certainly would be hard to provide a 1 week turn around time on submitted sites, but some aspects (such as automated site status checks) could free up more time for editors to edit, possibly increasing the amount of sites reviewed and added to the directory, decreasing the waiting time for sites to be listed, and thus providing a BIGGER directory with MORE useful sites for the entire world.
"Humans do it better" certainly is a great foundation to work off of, but tasks that do not need a human touch could be automated (in my mind!)
As far as this comment goes
"And perhaps even many times a day by automated programs. (Similar to those that monitor rankings in search engines). "
This problem has been addressed very easily by major automated services such as the network solutions database which resolves urls, nic handles, etc to a database entry with the owners name, DNS server, admin and tech contact.
A field where a user has to copy in a random generated series of numbers and letters to be able to check the status would make it impossible for automated systems to use the tool. Every time a user wants to query the status, he/she has to enter in the string. This has cut down the traffic and server cycles to the network solutions Database. Many SE require that in order to suggest your URL to the BOT, you enter in a random generated code, to prevent automated entries from "website traffic" programs and the like. A status check on DMOZ would certainly be a candidate for that type of technology.
"So far, I think this system has been working relatively well - and you even get human interaction."
I disagree. From a submitter side of things, I do not think the system is all that great. I would much rather be able to "bug a machine" at my leisure than request that someone else take time out of there work to do something for me. I think this for a few reasons,
1. They have other things more important than looking at my site.
2. I would rather have a response right away as to the status than wait for an editor to look it up
3. I would rather be able to check 1 time a day than 1 time a month.
I am sure many people who do request site status feel a similar way. It is easy for an editor to tell people not to ask for a month, but it is a little unnerving to want to know the status of something that could mean success or failure (DMOZ does provide high ranking PR links to websites, influencing the results of searches for keywords in Google and other engines that crawl using DMOZ).
As a user of the DMOZ, I feel that I (as well as other DMOZ users) would be at a benefit if more listing and less "status checking" went on. Editors have given tons of time to reviewing sites; they should not be our secretaries to check on the status of something for us.
Anyone have any types of statistics on the number of added pages to the directory daily for the past couple years? It would be interesting to see the number of sites listed compared with the number of editors.
Certainly an automated status check could be made to work with data files rather than a specific database. Having just a few possible statuses, such as "Not Submitted" "Pending" "Listed" and "Denied". The "listed" status could be easily achieved by scanning the current dev cat for the url string, the "submitted" status can be checked by scanning through the "pending" file (or whatever way DMOZ uses a file to store sites that are waiting for review).
"There are lots of possible statuses, and because the system of processing the submission is so based on Humans, interpreting the status is more than a simple check to see if a site is waiting in a given category. "
From my readings (and I know samiam has had tons more experience with all this than most people) it seems that not to many statuses that are given back to people here on this site exist. "Pending" is a popular one, "Denied" is another popular one, and "It is listed in the dev dir" is another one. Giving specifics about when the editor was last there do not need to be relayed, they are not now.
I don't seem to get the point of denying status checks to known abusers?? If the abuser is known, and the site being submitted is spammy or whatever, the editor won't submit it. Who cares if the submitter checks the status, it probably won't take up any more processing or bandwidth than having an editor from here go and check. If a status check is done correctly, the processing cycles would be low, the bandwidth very low, making 500 queries in the status tool equal to the processing power and bandwidth equal to one status check from an editor at this website.
Common logic would indicate that it would be more worthwhile to have a huge sign pointing to a bathroom so you DO NOT ask a person where it is. Asking a person where it is takes away time and effort that they can be doing their job. Asking editors here for the status of some site that probably won't get listed anyways only takes them away from getting the sites that should be listed into the directory, and thus creating a better experience for users of DMOZ. Just the other day at the O'Hare airport in America I was looking for the bathroom, I went to the front desk at the airport lounge to ask, and they didn't even say a word, just pointed to a big sign that said "Restrooms ->"
Taking away the productivity of employees, volunteers, whoever they may be is a situation that administrators want to avoid at all costs (when productivity and profits are in mind of course).
The statement "Humans it do it better" is a great mindset to have when editing categories and reviewing sites for submission, and certainly does produce more relevant results (another debate altogether). Busy work such as checking a status, and having a finite answer exist (listed, denied, pending, not submitted) does not require the "personal touch"; it only detracts from an editor adding more useful sites to the directory. I stumbled upon a thread the other day that said "The more you ask about the status of your site, the more time you are taking us away from getting pending sites reviewed"
I think that this sums up the current system perfectly. And for a directory that has had people waiting with good sites for over a year, it is certainly something that needs to be addressed. The DMOZ is a service, and in my mind a valuable one. If you requested a plumber come fix the leaky pipes, and he took over a year to come, you wouldn’t be too pleased. I do think that this is how people are starting to see the ODP. It certainly would be hard to provide a 1 week turn around time on submitted sites, but some aspects (such as automated site status checks) could free up more time for editors to edit, possibly increasing the amount of sites reviewed and added to the directory, decreasing the waiting time for sites to be listed, and thus providing a BIGGER directory with MORE useful sites for the entire world.
"Humans do it better" certainly is a great foundation to work off of, but tasks that do not need a human touch could be automated (in my mind!)
As far as this comment goes
"And perhaps even many times a day by automated programs. (Similar to those that monitor rankings in search engines). "
This problem has been addressed very easily by major automated services such as the network solutions database which resolves urls, nic handles, etc to a database entry with the owners name, DNS server, admin and tech contact.
A field where a user has to copy in a random generated series of numbers and letters to be able to check the status would make it impossible for automated systems to use the tool. Every time a user wants to query the status, he/she has to enter in the string. This has cut down the traffic and server cycles to the network solutions Database. Many SE require that in order to suggest your URL to the BOT, you enter in a random generated code, to prevent automated entries from "website traffic" programs and the like. A status check on DMOZ would certainly be a candidate for that type of technology.
"So far, I think this system has been working relatively well - and you even get human interaction."
I disagree. From a submitter side of things, I do not think the system is all that great. I would much rather be able to "bug a machine" at my leisure than request that someone else take time out of there work to do something for me. I think this for a few reasons,
1. They have other things more important than looking at my site.
2. I would rather have a response right away as to the status than wait for an editor to look it up
3. I would rather be able to check 1 time a day than 1 time a month.
I am sure many people who do request site status feel a similar way. It is easy for an editor to tell people not to ask for a month, but it is a little unnerving to want to know the status of something that could mean success or failure (DMOZ does provide high ranking PR links to websites, influencing the results of searches for keywords in Google and other engines that crawl using DMOZ).
As a user of the DMOZ, I feel that I (as well as other DMOZ users) would be at a benefit if more listing and less "status checking" went on. Editors have given tons of time to reviewing sites; they should not be our secretaries to check on the status of something for us.
Anyone have any types of statistics on the number of added pages to the directory daily for the past couple years? It would be interesting to see the number of sites listed compared with the number of editors.