PDA

View Full Version : good web page parsing language/tool?


lsman11
04-16-2007, 12:32 PM
Hello,

wanted to get some feedback from this group before writing any code. want to have a script/application which accesses a webpage, MAY fill out a form and submit the form, and then parse the data from the page.

Its basically an aggregation of data from a page. first language to use was perl, since i have used their 'mechanize' library, which has the api support i need.

but any other suggestions for languages to do this?

tokenuser
04-16-2007, 05:42 PM
PERL is an obvious choice, because it is a language designed for text parsing (Practical Extraction and Report Language), but I am a Java whore. A system I wrote was still running unattended 4 years after the companny I wrote it with collapsed. Why? Because I was able to code a lot of error checking into the app - it was robust, which is more than most PERL systems. PERL is great for one offs and prototypes, but if it is going to be around for awhile, consider Java :)