You are viewing the RapidMiner Radoop documentation for version 9.4 -Check here for latest version

What's New in RapidMiner Radoop 8.0.0?

Released:December 04th, 2017

The following improvements are part of RapidMiner Radoop 8.0.0.

NewK-Meansoperator is available that is based on the Spark MLlib/ML clustering algorithm
Hive access inSpark Scriptis now supported (HiveContextin Spark 1.x andSparkSessionin Spark 2.x), if the user has the appropriate privileges on the cluster
AddedSpark 2.2support (new option in theSpark Versionlist)
Aggregatenow support concatenation option as well
Hive (JDBC) connection error now displays more information
Import from Cluster Manageris now the default option to create a Radoop connection
K-Means,Fuzzy K-MeansandCanopyMahout clustering operators are now deprecated
Mahout testis now deprecated
Replace missing quick fix is now offered if there are known missing values in the input of an operator that does not handle them

BUGFIX: Installation of an Apache Spark release is no longer needed for more recent Cloudera Hadoop releases in order for Spark jobs to work (and not give anUnknown/unsupported paramerror). Use the newSpark 1.6 (CDH)Spark version connection setting in this case.
BUGFIX:K-Meansmodel apply inside Radoop Nest no longer fails withClassNotFoundExceptionorNoClassDefFoundError
BUGFIX: FixedCannot initialize Clustererror when a certain failover proxy provider class is used on the cluster (org.apache.hadoop.yarn.client.RequestHedgingRMFailoverProxyProvider) and the connection is imported
BUGFIX:Spark ScriptwithRno longer relies on the private RDD API inSpark 2.x
BUGFIX: Quick fix that adds aType Conversionoperator now selects the proper target type