Create Association Process Error

jes_craig_94jes_craig_94 MemberPosts:3Contributor I
edited June 2019 inHelp

I am trying to run a process to create associations but am getting "no results found". I am using excel files running through Process Documents from Files, then to Numerical to Binomial, to fp-growth, and then create association rules. I have tried changing the number of files I use and changing min support on fp-growth but all changes still return no results. Any suggestions on how to get the process to run? Thanks!

Answers

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager

    hello@jes_craig_94- welcome to the community. We would be happy to help you. Can you please post your XML process (see instructions titled "Read Before Posting" on the right) and, if possible, the data files you are trying to use?

    Scott

  • jes_craig_94jes_craig_94 MemberPosts:3Contributor I

    这是XML和附加文件。谢谢for the help@sgenzer!




    < output/>


    < operator activated="true" class="process" compatibility="7.6.001" expanded="true" name="Process">

    < operator activated="true" class="text:process_document_from_file" compatibility="7.5.000" expanded="true" height="82" name="Process Documents from Files" width="90" x="179" y="136">





    < operator activated="true" class="text:tokenize" compatibility="7.5.000" expanded="true" height="68" name="Tokenize" width="90" x="112" y="85"/>
    < operator activated="true" class="text:transform_cases" compatibility="7.5.000" expanded="true" height="68" name="Transform Cases" width="90" x="246" y="85"/>
    < operator activated="true" class="text:filter_stopwords_german" compatibility="7.5.000" expanded="true" height="68" name="Filter Stopwords (German)" width="90" x="447" y="85"/>
    <运营商激活= " true " class = "文本:filter_by_length" compatibility="7.5.000" expanded="true" height="68" name="Filter Tokens (by Length)" width="90" x="581" y="85"/>










    < operator activated="true" class="numerical_to_binominal" compatibility="7.6.001" expanded="true" height="82" name="Numerical to Binominal (2)" width="90" x="380" y="85">


    < operator activated="true" class="fp_growth" compatibility="7.6.001" expanded="true" height="82" name="FP-Growth" width="90" x="514" y="85">



    < operator activated="true" class="create_association_rules" compatibility="7.6.001" expanded="true" height="82" name="Create Association Rules" width="90" x="648" y="85"/>













    CFG1.csv 23.5K
    NCFG2.csv 17K
    NCFG3.csv 38.3K
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager

    hello@jes_craig_94- thanks for posting. OK it's pretty clear why you're getting no results. If you put a breakpoint after Process Documents from Files, you will see that you only have three examples in your example set, one from each file:

    Screen Shot 2017-11-22 at 7.01.32 PM.png

    So RapidMiner does not have much to go on here in order to create association rules. My hunch is that this is not what you intended - you probably wanted each row in your csv to be a separate example, not each file. That operator, at least in my experience, is when you have a large folder of individual text files that you want to analyze. If my hunch is correct, you're going to likely be more successful with something like this:




    < output/>


    < operator activated="true" class="process" compatibility="7.6.001" expanded="true" name="Process">

    < operator activated="true" class="concurrency:loop_files" compatibility="7.6.001" expanded="true" height="82" name="Loop Files" width="90" x="45" y="34">




    < operator activated="true" class="read_csv" compatibility="7.6.001" expanded="true" height="68" name="Read CSV" width="90" x="246" y="34">









    < portSpacing端口= " sink_output 2”间隔= " 0 " / >


    < operator activated="true" breakpoints="after" class="append" compatibility="7.6.001" expanded="true" height="82" name="Append" width="90" x="179" y="34"/>
    < operator activated="true" class="text:process_document_from_data" compatibility="7.5.000" expanded="true" height="82" name="Process Documents from Data" width="90" x="313" y="34">


    < operator activated="true" class="text:tokenize" compatibility="7.5.000" expanded="true" height="68" name="Tokenize" width="90" x="179" y="85"/>
    < operator activated="true" class="text:transform_cases" compatibility="7.5.000" expanded="true" height="68" name="Transform Cases" width="90" x="313" y="85"/>
    < operator activated="true" class="text:filter_stopwords_german" compatibility="7.5.000" expanded="true" height="68" name="Filter Stopwords (German)" width="90" x="514" y="85"/>
    <运营商激活= " true " class = "文本:filter_by_length" compatibility="7.5.000" expanded="true" height="68" name="Filter Tokens (by Length)" width="90" x="648" y="85"/>










    < operator activated="true" class="numerical_to_binominal" compatibility="7.6.001" expanded="true" height="82" name="Numerical to Binominal (2)" width="90" x="447" y="34">


    < operator activated="true" class="fp_growth" compatibility="7.6.001" expanded="true" height="82" name="FP-Growth" width="90" x="581" y="34">



    < operator activated="true" class="create_association_rules" compatibility="7.6.001" expanded="true" height="82" name="Create Association Rules" width="90" x="715" y="85"/>














    This process does not work (yet) as there has to be some ETL done to get the csv files to append properly. I will leave that to you.:)

    Good luck!


    Scott

  • jes_craig_94jes_craig_94 MemberPosts:3Contributor I

    @sgenzerthank you so much for the help! I was able to load the files and run through the breakpoint point after the "Append" but I am running into an issue on the "Process Documents from Data" and it is not returning any attributes. I am using the suggestion you gave and it still gave me no results. What should I expect to see? And do you have any idea what I could be doing wrong? I assume this data will produce some results but again, am unsure.

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager

    hmm...you have lots of examples now and they are all binomial now, right? Can you share your new process?

    Scott

Sign InorRegisterto comment.