Categories

Versions

You are viewing the RapidMiner Radoop documentation for version 9.4 -Check here for latest version

What's New in RapidMiner Radoop 8.0.0?

Released:December 04th, 2017

The following improvements are part of RapidMiner Radoop 8.0.0.

Enhancements

  • NewK-Meansoperator is available that is based on the Spark MLlib/ML clustering algorithm
  • Hive access inSpark Scriptis now supported (HiveContextin Spark 1.x andSparkSessionin Spark 2.x), if the user has the appropriate privileges on the cluster
  • AddedSpark 2.2support (new option in theSpark Versionlist)
  • Aggregatenow support concatenation option as well
  • Hive (JDBC) connection error now displays more information
  • Import from Manager IconImport from Cluster Manageris now the default option to create a Radoop connection
  • K-Means,Fuzzy K-MeansandCanopyMahout clustering operators are now deprecated
  • Mahout testis now deprecated
  • Replace missing quick fix is now offered if there are known missing values in the input of an operator that does not handle them

Bug fixes

  • BUGFIX: Installation of an Apache Spark release is no longer needed for more recent Cloudera Hadoop releases in order for Spark jobs to work (and not give anUnknown/unsupported paramerror). Use the newSpark 1.6 (CDH)Spark version connection setting in this case.
  • BUGFIX:K-Meansmodel apply inside Radoop Nest no longer fails withClassNotFoundExceptionorNoClassDefFoundError
  • BUGFIX: FixedCannot initialize Clustererror when a certain failover proxy provider class is used on the cluster (org.apache.hadoop.yarn.client.RequestHedgingRMFailoverProxyProvider) and the connection is imported
  • BUGFIX:Spark ScriptwithRno longer relies on the private RDD API inSpark 2.x
  • BUGFIX: Quick fix that adds aType Conversionoperator now selects the proper target type