Categories

Versions

You are viewing the RapidMiner Radoop documentation for version 9.0 -Check here for latest version

RapidMiner Radoop: Big Data Predictive Analytics

RapidMiner Radoop provides an easy-to-use graphical interface for analyzing data on a Hadoop cluster with a running Hive server. This introduction provides a quick description of the software and the capabilities of the solution for processing and analyzing big data.

Understanding the basic architecture

RapidMiner Radoop is client software with an intuitive graphical user interface. Radoop requires your Hadoop cluster to be accessible from the client running RapidMiner Studio (and RapidMiner Server, if applicable). The diagram below shows the basic architecture of the RapidMiner Radoop solution on RapidMiner Studio:

You can also use RapidMiner Radoop on RapidMiner Server for scheduling and managing client-created processes, as well as for collaboration and as a web reporting interface. The diagram below incorporates RapidMiner Server to show the basic architecture of the complete solution:

Documentation overview

This document,RapidMiner Radoop Overview, provides some background and resource material for using Radoop. It assumes that you are already familiar with usingRapidMiner Studio.

The document provides:

  • a quickoverview of Radoop, including a description of Radoopoperatorsand theHadoop Data viewwhich allows you to easily manage your data on the cluster.

  • a guide to导入数据if it is not already in a Hive structure on the Hadoop cluster.

  • explanation of更高级的功能, including designing data mining processes, scoring, evaluation, and advanced data flow design.

  • discussion of how to modify一组tingsthat influence Radoop.