Finding correlations between outputs and inputs for large number of data

maeltmaelt MemberPosts:1Newbie
Hi everyone!

I'm very new at Rapidminer and I have a question (I just installed it and started toying around with the "auto-model").

我想实现的是:我做了各种测试s by variating some inputs, and I have an excel file with for each tests the used inputs, the outputs (temperature, forces...). Since I have a large number of tests, I would like an analysis using a software like rapidminer. I would like to find correlation between inputs and outputs (like I have lower forces for this kind of tests... things like that).

I'm not quite sure if rapidminer is suitable for this? If this kind of analysis is achievable through rapidminer, I would really appreciate if you could indicate me some tutoriel to achieve this or give me some advices here (english is not my first language as you may have noticed and I have difficulties to find something that match my problem. So far on the forum I just found some posts suggesting using auto-model).


Have a good day.

Tagged:

Answers

  • varunm1varunm1 Moderator, MemberPosts:1,207Unicorn
    Hello@maelt

    If you are trying to find a correlation between attributes(including output labels), You can use Correlation matrix operator in RapidMiner which provides you with a correlation matrix. In the below scenario I selected Titanic training dataset from samples which have an output label "Survived". I included this so that I can find the correlation between inputs and output. I also provided XML code below for your understanding. You can also observe which of these are highly correlated based on their coloring.


                                              
    Please inform if this is not what you are looking for.

    Thanks

    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

Sign InorRegisterto comment.