Jump to content

Recommended Posts

Posted

Could guru who has grown up with perl be so kind as to tell a regular expression that would filter out every site from an rdf dump that does not

start with the requested url pattern, e.g. Top/Arts/Music/Metal in order to be able to build a directory which is a subtree of the whole big dump.

 

It probably just needs to evaluate everythign betwenn the start "<" and before the beginning of the next "<" and look at everything that starts with "Top" and ends with " and replace that by and empty string if it is not mtached.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...