Here is my code for Regex matching which worked for a webpage: public class RegexTestHarness { public static void main(String[] args) {
I\'ve tried WebSphinx application. I realize if I put wikipedia.org as the starting URL, it will not crawl further.