Database doesn't appear after importing

Jack1701Jack1701 MemberPosts:5Newbie
edited September 2019 inHelp
Hello,

我想import a .csv database to rapidminer, but after I tell it where to store the data, it says that it is importing the data to the location I specified. Then after the importing data window closes the dataset doesn't appear.

我想import the 2014 .csv from the Stanford Database on Ideology, Money in politics and elections: public version 2.0 and I've already extracted it from the .gz. I use the settings that it recommends in the data format, and no errors seem to appear, I have it replace errors with missing values then I try to place the data in the local repository, and it says it is importing the data, but nothing happens when it looks like it is finished.

I don't know what is going on, and why it is doing this.

Thank you for the help,

Jack.

Best Answer

Answers

  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager
    @Jack1701that clearly should not happen. Can you please share a screenshot when "Then after the importing data window closes the dataset doesn't appear"? Also please send me your rapidminer-studio.log file. It is in your .RapidMiner folder.

    Scott

    Tghadially Jack1701
  • Jack1701Jack1701 MemberPosts:5Newbie


    Here is a screenshot right before the import appears to stop, and what it shows after.



  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager
    @Jack1701ok you did not warn me that this was a 10GB csv file:smile:I just tried to import it myself and got an error (exactly the same as what I saw in your log file:


    Keep in mind that it took RM about 15 min to get here. How much RAM do you have on your machine?

    Scott

    Tghadially
  • Jack1701Jack1701 MemberPosts:5Newbie
    Sorry, I didn't realize that was abnormally large.

    I have 16.0 GB of RAM on my machine.
  • Jack1701Jack1701 MemberPosts:5Newbie
    I opened it in Excel, and it was able to load the first 1,048,575 rows, and RapidMiner was able to import that when I saved it in another .csv file. It ended up being about 350 MB, is there a way to get excel to load different parts of the file, and load the file in, in parts, or is that the most excel can do in this case?
    Tghadially
  • sgenzersgenzer Administrator, Moderator, Employee, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM ModeratorPosts:2,959Community Manager
    so as you can see, Excel is a piece of[email protected]when it comes to handling large data sets. My local installation (Office 365 Excel for Mac) only uses ONE logical processor so it's not even parallelized. My advice would be to load the data set into a MySQL database and forget Excel.

    Scott
    Jack1701
  • Jack1701Jack1701 MemberPosts:5Newbie
    Ok, thank you so much for the help.:)
Sign InorRegisterto comment.