"Struggling importing CSV from R Studio to Rapidminer"

matteomartignonmatteomartignon MemberPosts:3Contributor I
edited June 2019 inHelp

Hello guys,

I have a lot of problems importing my CSV from R Studio into Rapidminer.

The firt 50 rows are completly gone..

Here the comand in R Studio I used to save my dataframe:

write.table(musei, file = "musei_18_F.csv", quote = TRUE, sep = ",", row.names = FALSE)

//SCREENSHOTS

Schermata 2017-07-26 alle 11.26.59.pngMy CSV from R StudioSchermata 2017-07-26 alle 11.26.43.png相同的CSV Rapidmine进口r

Tagged:

Answers

  • Thomas_OttThomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,761Unicorn

    You have 31 warnings when importing the CSV file, look at the lower right of the 2nd image. Click on that and see what it says.

  • matteomartignonmatteomartignon MemberPosts:3Contributor I

    Got it

    Is there a way to fix this problem in R?

    Thanks on advanceSchermata 2017-07-26 alle 12.23.36.pngWarnings

  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn

    In the last screenshot it looks like your separator might be a ";" instead of a "," which isn't what you are telling RapidMiner, which would definitely cause problems. If you post a sample data file in addition to the screenshots it would be easier to troubleshoot.

    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
  • matteomartignonmatteomartignon MemberPosts:3Contributor I

    Sure, this is my CSV.

    It is a data frame I extracted with RFacebook. The function was "search.pages".

    Thanks in advance,

    matteo

  • Telcontar120Telcontar120 Moderator, RapidMiner Certified Analyst, RapidMiner Certified Expert, MemberPosts:1,635Unicorn

    Looking at the raw csv it is evident that there are many line break characters in some of the records and that is what is causing the parsing problem with the import of certain records in RapidMiner. It's theoretically possible to clean this up in RapidMiner but it is probably easier to do it in R or Python using some scripting to remove the extra characters before importing.

    Brian T.
    Lindon Ventures
    Data Science Consulting from Certified RapidMiner Experts
    matteomartignon
  • SGolbertSGolbert RapidMiner Certified Analyst, MemberPosts:344Unicorn

    I've also seen your CSV. As far as I can tell, there is a problem writing the CSV. The problem is entirely in the R part, so there is no much help to give from our part.

    I can suggest to save the CSV and then read it again from R with read.table() or similar. When you manage to correctly retrieve the file, it will be ready to be read by RapidMiner.

    Or you can put the code that generates the dataset inside the R scripting operator in RapidMiner, then you won't need to save a CSV file.

    MartinLiebig FBT matteomartignon
Sign InorRegisterto comment.