"Can I crawl websites in java script using rapidminer"

ArnoGArnoG MemberPosts:22Contributor II
edited June 2019 inHelp
I have a problem crawling a website. I believe the problem is that the website is build in javascript. Is it possible to crawl such a page using rapidminer?

For example:http://www.booking.com/hotel/nl/easyhotel-amsterdam.nl.html?sid=9fc05dc001129cc3698397a2efbfba2f;dcid=1#hash-blockdisplay4

When I use the Crawl web operator i only creates two files. The files leads to the startingpage of the hotel, not the review page. While I use the reviewpage as URL in the operator.

How can I crawl this website?

Thanks Arno
Tagged:

Answers

  • ArnoGArnoG MemberPosts:22Contributor II
    The process I created so far leads me to the starting page of a specific hotel and not to the review page.






    <宏/ >




    http://www.booking.com/hotel/nl/easyhotel-amsterdam.nl.html?sid=9fc05dc001129cc3698397a2efbfba2f;dcid=1#hash-blockdisplay4"/>

















  • Nils_WoehlerNils_Woehler MemberPosts:463Maven
    Hi Arno,

    unfortunately at the moment this is not possible.

    Best,
    Nils
  • ArnoGArnoG MemberPosts:22Contributor II
    Hi Nils,
    Thanks for your answer. Maybe a functionality in the next releases. More and more websites are using javascript.
    I crawled the webites using 'Mozenda', works perfectly!

    Regards, Arno
Sign InorRegisterto comment.