Row name changes in Turbo Prep and apply calculations to all cells

graceweigracewei MemberPosts:9Contributor I
Hi,

Brief summary of my data set:
I have 6152 columns/attributes and 156 rows/examples. My columns are DNA names, and my rows are patient IDs. Simply put, the values are the number of mRNAs in each DNA for each patient, and these are transformed by log.

I have two separate questions about RapidMiner:

1.我每次运行流程,第一列of the result table is always row names like I wanted. However, after I click on Turbo Prep, the row name column always becomes the last column. I have tried the "Set Role" operator, and tried making the target role as "label" and "id". But it still doesn't work. How can I fix this problem?

2. I want to apply the same formula to all the cells/values (except for the column names and row names) in my table. I want to reverse log base 10. In other words, let all the values become the exponent of 10. How can I achieve that? I was thinking to use the GENERATE function in Turbo Prep, but it seems like that only creates a new column and my dataset is too large for that.

Thank you in advance!
Tagged:
Tghadially

Answers

  • cdapontecdaponte MemberPosts:29Maven
    Hi, for the first question you can try using the Reorder Attributes operator. And for the second question, you can with operators like Replace or Map.
  • yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:363RM Data Scientist
    Hi@gracewei,

    I don't think RapidMiner will use the row names like R. If your mRNA data has meaningful row names, you can convert the rownames to a proper column of the data and set it to special role (role can be called rname or name).

    If you want to generate new columns with a formula like 10^(att) for 6000+ columns, a "loop attribute" is suggested here.
    Inside the loop attribute, you will use "Generate Attribute" for the transformations.

    Cheers,

    YY




    Tghadially sgenzer
  • yyhuangyyhuang Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:363RM Data Scientist
    Example "Loop" with "Generate Attribute"

    < ?xml version = " 1.0 " encoding = " utf - 8 " ?> <过程版本sion="9.3.001">                                                                                


    Tghadially sgenzer gracewei
Sign InorRegisterto comment.