Generating ROC curve in SCALA from H2O binary classification models

You can use the following blog to built a binomial classification  GLM model: https://aichamp.wordpress.com/2017/04/23/binomial-classification-example-in-scala-and-gbm-with-h2o/ To collect model metrics  for training use the following: val trainMetrics = ModelMetricsSupport.modelMetrics[ModelMetricsBinomial](glmModel, train) Now you can access model AUC (_auc object) as below: Note: _auc object has array of thresholds, and then for each threshold it has fps and tps (use […]

Continue reading


How I did it: Will Cukierski on finishing second in the IJCNN Social Network Challenge

Graph theory has always been an academic side interest of mine, so I was immediately interested when Kaggle posted the IJCNN social network challenge.  Graph-theoretic problems are deceptively accessible and simple in presentation (what other dataset in a data-mining competition can be written as a two-column list?!), but often hide complex, latent relationships in the […]

Continue reading