1

Currently I'm using python and its urllib2, urllib to retrieve a simple static web page. Everything was smooth until web-page developers added java scripts. Now the most interesting information is hidden behind the scripts:

<a href="javascript://" class="event-more-view" id="view-moreid-12311" onclick="Markets.applyView(this);return false;" treeid="1291266" eventstate ="false" > add table </a>

Browser preloads data and shows it when the "a href" link is clicked. The results of my short research are JSOUP and HTMLunit. Am I digging in a right direction? Any cons and pros?

Will python help? Should I be using Java? What packages can help with dynamic content? What is simpler?

In my case I have to create some sort of a virtual browser as far as built-in scripts refresh data over time which has to be processed.

1 Answer 1

2

You are digging in a right direction.

Here are some options/tools to consider:

See also:

Hope that helps.

Sign up to request clarification or add additional context in comments.

1 Comment

Thanks for your answer... Looking in to these packages... What about Ruby and Watir?

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.