Use sampling inverse output
Hi,
i would like to use the output of the "sample"-operator as training data and the rest values as test data.
What's the easiest way to extract the rest of the data which was not selected by the "sample"-Operator?
Thank you very much!
i would like to use the output of the "sample"-operator as training data and the rest values as test data.
What's the easiest way to extract the rest of the data which was not selected by the "sample"-Operator?
Thank you very much!
Tagged:
0
Best Answer
-
varunm1 Moderator, MemberPosts:1,207UnicornHello@StefanRei
Actually, you can do the same with Split data operator in one step. We can also solve your exact requirement. After playing with some operators, I think here is what you are looking for.
I used the titanic dataset for this and added an ID column using generate ID operator, I then applied sampling(your requirement) based on probability (0.7). The output of the sample is connected to Multiply as we need two sets one for comparison and other for training the model. Then a Set Minus operator is used to remove the Sampled data from original data so that it gives us dataset with samples that are not present in our sample dataset. You can use one output of Multiply operator for testing and the "exa" of set minus operator for testing your model. XML code is provided below, click on SHOW.
XML:
<宏/ >
<连接from_op = "设置-“from_port =“原始”to_port="result 1"/>
Regards,
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
7
Answers
MarlaBot