Sign In
Register
乐鱼官网手机版下载
Solutions
乐鱼体育安装
Pricing
Partners
Company
Howdy, Stranger!
It looks like you're new here. Sign in or register to get started.
Sign In with RapidMiner
Sign In with RapidMiner
Sign In
Register
Quick Links
Categories
Recent Discussions
Best Of...
Unanswered
Groups
Categories
所有类别
19.7K
Help
442
Knowledge Base
Altair RapidMiner Community
GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.
Discussion
Random sampling of a large corpus
Author
Date within
1 day
3 days
1 week
2 weeks
1 month
2 months
6 months
1 year
of
Examples: Monday, today, last week, Mar 26, 3/26/04
Search
0 Comments
0 Discussions
0 Members
0 Online
ASK A QUESTION
FIND HELPFUL VIDEOS
Home
›
Help
Random sampling of a large corpus
frankc
Member
Posts:
3
Contributor I
September 2014
edited November 2019
in
Help
How can you pick a random sample from a large corpus for files to perform pre-processing and text mining with the text mining extension? Is there an operator that does that?
Frank
Tagged:
Sampling
0
Answers
homburg
Moderator, Employee, Member
Posts:
114
RM Data Scientist
September 2014
Hi frankc,
just a quick question. Do you want to read a random set of files or read all files and shuffle a random set of documents?
Cheers,
Helge
0
bkriever
RapidMiner Certified Analyst, Member
Posts:
11
Contributor II
October 2014
You should be able to use the Sample operator and select "
我们e local random seed
" to select a random sample, similar to a non-text data set.
0
Sign In
or
Register
to comment.
Answers
just a quick question. Do you want to read a random set of files or read all files and shuffle a random set of documents?
Cheers,
Helge