Calculate mean using UDF in H2O

Here is the full code to write a UDF to calculate mean for a given data frame using H2O machine learning platform:   library(h2o) h2o.init() ausPath <- system.file(“extdata”, “australia.csv”, package=”h2o”) australia.hex <- h2o.uploadFile(path = ausPath) # Writing the UDF myMeanUDF = function(Fr) { mean(Fr[, 1]) } # Applying UDF using ddply MeanValue = h2o.ddply(australia.hex[, c(“premax”, […]

Continue reading


Choosing Between a Nonparametric Test and a Parametric Test

You may have heard that you should use nonparametric tests when your data don’t meet the assumptions of the parametric test, especially the assumption about normally distributed data. But there are additional considerations. This post will help you determine when you should use a parametric analysis to test group means or a nonparametric analysis to […]

Continue reading