"Problem with FeatureExtraction (VISTA, Java 1.6.16)"

skarabskarab MemberPosts:10Contributor II
edited May 2019 inHelp
Could somebody explain me how to use FeatureExtraction, because it does not work for me...

Here is my code:
















<参数键= "结果" value = " / h: html / h:身体/ h: div/h:div[2]/h:div[3]/h:div/h:div[3]/h:div/h:ul/hi:li[2]/h:div[2]/text()"/>







I am using it in my plan, which extracts data for analysis from HTML page using XPATH:
Tagged:

Answers

  • skarabskarab MemberPosts:10Contributor II
    Here is my complete plan, which I use to traverse directories, read html files, clean them, replace DOCTYPE declaration, and finally, extract data fields:




















    <参数键= "目录" value = " % {file_path} " / >



























































    <参数键= "结果" value = " / h: html / h:身体/ h: div/h:div[2]/h:div[3]/h:div/h:div[3]/h:div/h:ul/hi:li[2]/h:div[2]/text()"/>




















  • landland RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:2,531Unicorn
    Hi,
    what's the error message? Perhaps your XPath expression is wrong? Did you take a look into the Operator description?

    Greetings,
    Sebastian
  • skarabskarab MemberPosts:10Contributor II
    The problem is solved... I had a mistake (stupid;)) in namespace mapping of FeatureExtractor...


    ...

    http://www.w3.org/1999/xhtml"/>

    ...


    and...

    works ;D
Sign InorRegisterto comment.