Scrape a dynamic website

This is a difficult problem because you either have to reverse engineer the javascript on a per-site basis, or implement a javascript engine and run the scripts (which has its own difficulties and pitfalls).

It’s a heavy weight solution, but I’ve seen people doing this with greasemonkey scripts – allow Firefox to render everything and run the javascript, and then scrape the elements. You can even initiate user actions on the page if needed.

-Adam

Leave a Comment