You are viewing the RapidMiner Radoop documentation for version 9.4 -Check here for latest version
What's New in RapidMiner Radoop 8.0.0?
Released:December 04th, 2017
The following improvements are part of RapidMiner Radoop 8.0.0.
Enhancements
- NewK-Meansoperator is available that is based on the Spark MLlib/ML clustering algorithm
- Hive access inSpark Scriptis now supported (HiveContextin Spark 1.x andSparkSessionin Spark 2.x), if the user has the appropriate privileges on the cluster
- AddedSpark 2.2support (new option in theSpark Versionlist)
- Aggregatenow support concatenation option as well
- Hive (JDBC) connection error now displays more information
- Import from Cluster Manageris now the default option to create a Radoop connection
- K-Means,Fuzzy K-MeansandCanopyMahout clustering operators are now deprecated
- Mahout testis now deprecated
- Replace missing quick fix is now offered if there are known missing values in the input of an operator that does not handle them
Bug fixes
- BUGFIX: Installation of an Apache Spark release is no longer needed for more recent Cloudera Hadoop releases in order for Spark jobs to work (and not give anUnknown/unsupported paramerror). Use the newSpark 1.6 (CDH)Spark version connection setting in this case.
- BUGFIX:K-Meansmodel apply inside Radoop Nest no longer fails withClassNotFoundExceptionorNoClassDefFoundError
- BUGFIX: FixedCannot initialize Clustererror when a certain failover proxy provider class is used on the cluster (org.apache.hadoop.yarn.client.RequestHedgingRMFailoverProxyProvider) and the connection is imported
- BUGFIX:Spark ScriptwithRno longer relies on the private RDD API inSpark 2.x
- BUGFIX: Quick fix that adds aType Conversionoperator now selects the proper target type