Labelled and unlabelled data use in single model

ShrikantShrikant MemberPosts:2Newbie
Read CSV is for initial data. is labelled
Read CSV (2) is for _new.csv file for prediction is unlabelled.

Inside cross validation:

Is this approach correct?





Answers

  • ceaperezceaperez MemberPosts:447Unicorn
    Shrikant,

    Because the second CSV file is used to validate you model, both datasets need to have the same structure, data types and roles.

    如果你需要改变或asign new roles you can use the Set Role operator.

    Best,

    Cesar
  • ShrikantShrikant MemberPosts:2Newbie
    second file has same structure except it doesn't have one column which is for prediction. In practice first file is the past data and the second one is for prediction. Then that design still correct?
  • ceaperezceaperez MemberPosts:447Unicorn
    Hi again,

    The design is correct.

    Best
Sign InorRegisterto comment.