"Association rule creating. problem with FP-growth operator"

mzharovmzharov MemberPosts:5Contributor I
edited June 2019 inHelp

Hi! I have a problem with handling Excel file via association rule creating. Please let me know what do I do wrong? The error message I got is "The exampleset contains non-nominal attribute "prod.date" which is not allowed to fp-growth" .". I try to handle data from Excel file but I have transformed it to .txt in order to attach it.


<运营商激活= " true "类=“检索”兼容ibility="8.0.001" expanded="true" height="68" name="Retrieve рабочий файл для проверки (2)" width="90" x="45" y="34">








































Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @mzharovcan you just use a Select Attributes and remove 'prod.date?' Then see if it works.

    sgenzer
  • mzharovmzharov MemberPosts:5Contributor I

    Hi Thomas

    I have deleted any dates from my file at all. Ran the same process Retrive-Nominal to Binominal-FP-Growth-Create assoc. rule and got a similar error "The exampleset contains non-nominal attribute "ID" which is not allowed to fp-growth" . ID is just unique number of lines in my file. Could you tell where is my mistake ? Thanks in advance

    screen shot of error and sample of loading file are in attachment

    error.docx 155.2K
    sample.docx 14.8K
  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    I dont open DOCX files as a rule, so you'd have to post a screenshot. Also, your XML is corrupt. You have to post the XML correctly like from this KB articlehttps://community.www.turtlecreekpls.com/t5/RapidMiner-Studio-Knowledge-Base/How-can-I-share-processes-without-RapidMiner-Server/ta-p/37047

    sgenzer
  • mzharovmzharov MemberPosts:5Contributor I

    Hi Thomas,

    ok, once again . I created new xml and attached correct files. pleace, check . thanks in advance.


    <运营商激活= " true "类=“检索”兼容ibility="8.1.001" expanded="true" height="68" name="Retrieve" width="90" x="45" y="85">










































    error.pptx 202.6K
    sample.txt 439B
  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @mzharovthe XML is still corrupted. You have to open the XML view and copy it from there.

    Also as a rule, I don't open PPTX files either. Pretty much nothing from MSFT.

  • mzharovmzharov MemberPosts:5Contributor I

    Hi Thomas,

    actually I do not have an xml option in View-Show panel. I have turned xml at bottom of screen on. So xml from that screen attached. Please, check it. Thanks. Konstantin









    <运营商激活= " true "类=“检索”兼容ibility="8.1.001" expanded="true" height="68" name="Retrieve" width="90" x="45" y="85">





    <连接from_op = "Retrieve" from_port="output" to_op="Nominal to Binominal" to_port="example set input"/>
    <连接from_op = "Nominal to Binominal" from_port="example set output" to_op="FP-Growth" to_port="example set"/>
    <连接from_op = "FP-Growth" from_port="frequent sets" to_op="Create Association Rules" to_port="item sets"/>
    <连接from_op = "Create Association Rules" from_port="rules" to_port="result 1"/>






  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @mzharovok, the XML works now but your data file appears to be rather strange. Is it supposed to look this way, meaning are different data-types mixed into the same attribute column? It could also be my encoding as well.

    2018-03-21_6-29-26.png

  • mzharovmzharov MemberPosts:5Contributor I

    Hi Thomas,

    I have fixed my file. But anyway I get the similar error . Is there something wrong with data format ?? More info about my file below in the bottom of my post









    <运营商激活= " true "类=“检索”兼容ibility="8.1.001" expanded="true" height="68" name="Retrieve" width="90" x="112" y="34">





    <连接from_op = "Retrieve" from_port="output" to_op="Nominal to Binominal" to_port="example set input"/>
    <连接from_op = "Nominal to Binominal" from_port="example set output" to_op="FP-Growth" to_port="example set"/>
    <连接from_op = "FP-Growth" from_port="frequent sets" to_op="Create Association Rules" to_port="item sets"/>
    <连接from_op = "Create Association Rules" from_port="rules" to_port="result 1"/>






    the sample of my loading data below.

    Row ID code material name stock am.per serie stock date

    1 1.0 1.01010204E8 Амлодипин 78.789 8.789 Wed May 01 00:00:00 MSK 2013
    2 2.0 1.01010204E8 Амлодипин 70.0 25.0 Wed May 01 00:00:00 MSK 2013
    3 3.0 1.01010204E8 Амлодипин 70.0 45.0 Wed May 01 00:00:00 MSK 2013
    4 4.0 1.01010286E8 Акрилжелтый 83.563 83.563 Wed May 01 00:00:00 MSK 2013
    5 5.0 1.00000001E8 АкрилРINK 13.542 13.542 Wed May 01 00:00:00 MSK 2013
    6 6.0 1.01010441E8 Акрил 9.888 9.888 Wed May 01 00:00:00 MSK 2013
    7 7.0 1.01010273E8 SАденозилм 577.251 77.251 Wed May 01 00:00:00 MSK 2013
    8 8.0 1.01010273E8 SАденозилм 500.0 250.0 Wed May 01 00:00:00 MSK 2013
    9 9.0 1.01010273E8 SАденозилм 500.0 250.0 Wed May 01 00:00:00 MSK 2013
  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    @mzharovok, I see what's going on here. You have two attribute columns with real numbers in them. In order to use the FP-Growth operator, you must convert all the data into binmomals (true/false). The conversion is not happening for the numerical values as the Nominal to Binomal operator can't figure out how to transform 70 to true or false.

    You would have to figure out how to manipulate the numericals into true/false OR remove them from the data set.

    sgenzer
Sign InorRegisterto comment.