How to read PDF file in rapidminer
KanikaAg15
MemberPosts:19Contributor I
inHelp
Hi,
I have a PDF file available with text and tabloid content. I would like to make a pipeline which can read only the specified tables from the PDF. Can anyone recommend any process for the same.
1st constraint being reading pdf into rapidminer.
2nd constraint extracting information from the PDF.
I have a PDF file available with text and tabloid content. I would like to make a pipeline which can read only the specified tables from the PDF. Can anyone recommend any process for the same.
1st constraint being reading pdf into rapidminer.
2nd constraint extracting information from the PDF.
0
Best Answer
-
MarcoBarradas Administrator, Employee, RapidMiner Certified Analyst, MemberPosts:271UnicornHi@KanikaAg15,
You'll need to add the Text Processing extension that will help let you extract the data from the pdf.
There is another extension that might usefulPDF Table Extraction
And this course will be usefulhttps://academy.www.turtlecreekpls.com/learn/course/text-and-web-mining-with-rapidminer/text-and-web-mining/lets-get-started
0