Using Coordinate Data in predictive models

davidnealbrowndavidnealbrown MemberPosts:4Contributor I
edited December 2018 inHelp

I have data that includes coordinates X,Y,Z + condition and a label of interest (outcome). How do I build a predictive model using these coordinates as a factor?

Answers

  • IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University ProfessorPosts:1,751RM Founder

    Hi,

    Is this data in tabular format, i.e. can you read it into RapidMiner and end up with 4 columns for x,y,z, condition and then a 5th one with the outcome information? If yes, all you need to do is to define the outcome column as label using the "Set Role" operator and using any of the included machine learning algorithms to build the model.

    And if this does not tell you anything, I highly recommend to go through the tutorials first which are available at the help screen which comes up if you press the icon in the top right corner of RapidMiner.

    Hope this helps,

    Ingo

    sgenzer
  • davidnealbrowndavidnealbrown MemberPosts:4Contributor I

    Thank you Ingo. This does help and yes the data sets are in tis format. So if I understand you correctly you are saying that the rapid miner models will automatically consider the x,y, and z coordinate in the example rather than considering an individual axis. Is that correct?

    进一步澄清一下,我使用fMRI哒ta - brain activation coordinates in Tailarach standardized sapce. here is a sample spread sheet. I know there is not enough data for a model it is for illustration purposes only.

    Thanks!

    DNB

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    What@IngoRMis saying is that the coordinates will just be input in your model and won't carry with it the implied information of it being in some 3D space tied to a reference point. A set of coordinates tied to no reference point (i.e. the Earth) would be meaningless.

    In your case I would look at transforming the coordinates to a new feature, maybe a new distance feature. Without knowing more about your particulary task but you could calculate the distance between the coordinates to each other, or your reference point. That might be more useful than just coordinates. Something to think about.:)

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager

    hello@davidnealbrownthanks for the csv file. That's a very interesting use case! Some thoughts/questions...

    - so as@Thomas_Ottsaid, RapidMiner has no idea what x, y, and z coordinates are. They are just numerical attributes. But I cannot see any reason why that would be a problem in your use case.

    - what are the cluster #s? Are these the results of a segmentation process or some medical terminology?

    - I assume you're trying to predict Label via decision tree, right? So your process looks something like this?


































    <连接from_op =“性能”摇来摇去m_port="performance" to_port="performance 1"/>





















    Very cool. Keep us posted.


    Scott

    [EDIT: and thank you - I just googled "Talairach standardized space" and learned a whole new thing.:)

  • davidnealbrowndavidnealbrown MemberPosts:4Contributor I

    Thanks!

Sign InorRegisterto comment.