Finding correlations between outputs and inputs for large number of data
Hi everyone!
I'm very new at Rapidminer and I have a question (I just installed it and started toying around with the "auto-model").
我想实现的是:我做了各种测试s by variating some inputs, and I have an excel file with for each tests the used inputs, the outputs (temperature, forces...). Since I have a large number of tests, I would like an analysis using a software like rapidminer. I would like to find correlation between inputs and outputs (like I have lower forces for this kind of tests... things like that).
I'm not quite sure if rapidminer is suitable for this? If this kind of analysis is achievable through rapidminer, I would really appreciate if you could indicate me some tutoriel to achieve this or give me some advices here (english is not my first language as you may have noticed and I have difficulties to find something that match my problem. So far on the forum I just found some posts suggesting using auto-model).
Have a good day.
Tagged:
0
Answers
If you are trying to find a correlation between attributes(including output labels), You can use Correlation matrix operator in RapidMiner which provides you with a correlation matrix. In the below scenario I selected Titanic training dataset from samples which have an output label "Survived". I included this so that I can find the correlation between inputs and output. I also provided XML code below for your understanding. You can also observe which of these are highly correlated based on their coloring.
Thanks
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing