Access e-mails with the Text Processing Extension

David_ADavid_A Administrator, Moderator, Employee, RMResearcher, MemberPosts:294RM Research
edited December 2018 inKnowledge Base

现在电子邮件可能是最常见的来源of text you encounter every day. They are also a common source of data for text mining applications, e.g., spam detection or sentiment analysis.

Having a direct access to an e-mail account and processing mails automatically can be a great boon when putting text mining tasks into production. New incoming mails can be processed and their content scored, which can even trigger more reactions.


Read Documents (Mail)

The Text Processing Extension available at the RapidMinermarketplaceincludes an Operator that allows exactly that:

Read Documents (Mail).

It provides access using the common IMAP protocoll or the older POP3 standard. All you need to know is the host address, the log-in data and the used protocol. Also quite important is thefolderparameter, as you seldom want to download the complete inbox folder.

Connection properties

Unfortunately it is not always as simple as it could be. Some e-mail provider requires you to set additional parameters to access their services and it is sometimes hidden at their sites.

Below are the settings for the two most common providers: Microsoft Outlook (Office365) and Gmail.


Outlook 365

Parameter settings:

Outlook_parameterspng

Connection properties:

ReadDocuments_Outlook_propertiespng

Gmail

Parameter settings:

Gmail_parameterspng

Connection properties:

ReadDocuments_GMail_propertiespng

Similar is theProcess Documents from Mail StoreOperator, which requires the same properties settings. Whith this Operator it is possible to directly work on single e-mails. The Operator also allows to download attached files

DocMusher Telcontar120 yyhuang Edin_Klapic Pavithra_Rao BalazsBarany

Comments

  • osadasliitosadasliit MemberPosts:3Contributor I

    Hi

    I am new to rapidminer how do you access Gmail parameter settings forms. in send mail operator only provide

    to

    supbject

    html check box

    body text

    headers

    Gmail

    Parameter settings: form

  • osadasliitosadasliit MemberPosts:3Contributor I

    Hi

    if I want to send mail (not read mail) how do we setup smtp information in rapidminer studio for gmail let say we use send mail operator

    to

    supbject

    html check box

    body text

    headers

    thanks

Sign InorRegisterto comment.