Hi All,
I need to download all ODP pages that contain links to english sites.
After reading about ODP, I thought that I just need to download all pages
except from Wold/* and Kids_And_Teens/International/* and then extract
links to all english sites from those pages.
However, I found many exceptions such as
http://www.dmoz.org/Business/Busine...lation/Single_Language/Slovenian_and_English/
This path is neither in World directory nor in Kids_And_Teens/International but still
when I click on "Tomaž Metelko" on that page, I reach a non-english site.
Please let me know if I am doing something wrong.
Also, please let me know if you are aware of how to download ODP pages contaning
links only to english sites.
Thanks,
kiran
I need to download all ODP pages that contain links to english sites.
After reading about ODP, I thought that I just need to download all pages
except from Wold/* and Kids_And_Teens/International/* and then extract
links to all english sites from those pages.
However, I found many exceptions such as
http://www.dmoz.org/Business/Busine...lation/Single_Language/Slovenian_and_English/
This path is neither in World directory nor in Kids_And_Teens/International but still
when I click on "Tomaž Metelko" on that page, I reach a non-english site.
Please let me know if I am doing something wrong.
Also, please let me know if you are aware of how to download ODP pages contaning
links only to english sites.
Thanks,
kiran