dermotz Posted March 18, 2004 Posted March 18, 2004 Could guru who has grown up with perl be so kind as to tell a regular expression that would filter out every site from an rdf dump that does not start with the requested url pattern, e.g. Top/Arts/Music/Metal in order to be able to build a directory which is a subtree of the whole big dump. It probably just needs to evaluate everythign betwenn the start "<" and before the beginning of the next "<" and look at everything that starts with "Top" and ends with " and replace that by and empty string if it is not mtached.
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now