Web Crawling Sites with Javascripts or web forms

Posted by Jojo on Stack Overflow See other posts from Stack Overflow or by Jojo
Published on 2010-03-30T10:51:25Z Indexed on 2010/03/30 10:53 UTC
Read the original article Hit count: 337

Filed under:
|
|
|
|

Hello everyone, I have a webcrawler application. It successfully crawled most common and simple sites. Now i ran into some types of websites wherein HTML documents are dynamically generated through FORMS or javascripts. I believe they can be crawled and I just don't know how. Now, these websites do not show the actual HTML page. I mean if I browse that page in IE or firefox, the HTML code does not match what's actually in the IE or firefox. These sites contain textboxes, checkboxes, etc... so I believe they are what they call "Web Forms". Actually I am not much familiar with web development so correct me if I'm wrong.

My question is, does anyone in similar situation as I am now and have successfully solved these types of "challenges"? Does anyone know of a certain book or article regarding web crawling? Those that pertains to these advanced type of websites?

Thanks.

© Stack Overflow or respective owner

Related posts about JavaScript

Related posts about webforms