DrJ Posted July 7, 2006 Posted July 7, 2006 Given a domain name, I'm trying to extract a site description from the content data. When the domain is listed under multiple ExternalPage elements, how do I identify a "primary" one? For example, yahoo.com has three: <ExternalPage about="http://www.yahoo.com/"> <d:Title>Yahoo!</d:Title> <d:Description>Yahoo!'s webservers exclusively run FreeBSD. In addition, all the non-production servers and developer workstations run FreeBSD.</d:Description> <priority>1</priority> <topic>Top/Computers/Software/Operating_Systems/Unix/BSD/FreeBSD/Prominent_Users</topic> </ExternalPage> <ExternalPage about="http://www.yahoo.com/"> <d:Title>Yahoo!</d:Title> <d:Description>Personalized content and search options. Chatrooms, free e-mail, clubs, and pager.</d:Description> <priority>1</priority> <topic>Top/Computers/Internet/On_the_Web/Web_Portals</topic> </ExternalPage> <ExternalPage about="http://www.yahoo.com/"> <d:Title>Yahoo!</d:Title> <d:Description>The first large scale directory of the Internet, now a major portal offering search engine results, customizable content, chatrooms, free e-mail, clubs, and pager.</d:Description> <priority>1</priority> <topic>Top/Computers/Internet/Searching/Directories/Yahoo</topic> </ExternalPage> When I search for yahoo.com on dmoz.org, the third description comes out at the top, so I'm guessing something identifies this as better than the other two. How do I figure this out?
Editall/Catmv arubin Posted July 7, 2006 Editall/Catmv Posted July 7, 2006 I don't think we identify the concept of "primary listing". In other words, they're all listings.
motsa Posted July 7, 2006 Posted July 7, 2006 There's really no such thing as a "primary" listing. When I search for yahoo.com on dmoz.org, the third description comes out at the top, so I'm guessing something identifies this as better than the other two. The search results are more or less in random order (for all intents and purposes) -- I believe in this case it's ordering them roughly by when they were added to the directory but date of listing does not denote importance.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now