Movie Recommender -Affinity Analysis of Apriori in Python

“Affinity analysis can be applied to many processes that do not use transactions in this sense: Fraud detection Customer segmentation Software optimization Product recommendations. The classic algorithm for affinity analysis is called the Apriori algorithm. ” More details can be found in Robert Layton’s book here: https://www.goodreads.com/book/show/26019855-learning-data-mining-with-python?from_search=true We explored similar method of “Market Basket” here:https://charleshsliao.wordpress.com/2017/03/06/an-quick-association-rules-example-within-r/ […]

Continue reading


NBA Winning Estimator with Decision Tree in Python

It would be interesting to conduct prediction to understand the trend of NBA winning teams. We will use data from http://www.basketball-reference.com/leagues/NBA_2017_games-june.html and follow workflow. More details can be found in Robert Layton’s book here: https://www.goodreads.com/book/show/26019855-learning-data-mining-with-python?from_search=true ###1. Load data from http://www.basketball-reference.com/leagues/NBA_2017_games-june.html import pandas as pd file=”NBA2017.csv” NBA2017=pd.read_csv(file,sep=”,”,parse_dates=[“Date”]) #change string of “Date” to date value NBA2017.columns=[“Date”, “Start […]

Continue reading


Quick Machine Learning Workflow in Python, with KNN as Example of Ionosphere Data

Multiple approaches to build models of machine learning in Python are possible, and the article would serve as a simply summary of the essential steps to conduct machine learning from data loading to final visualization. You can find the data here: http://archive.ics.uci.edu/ml/datasets/Ionosphere More details can be found in Robert Layton’s book here:https://www.goodreads.com/book/show/26019855-learning-data-mining-with-python?from_search=true ###1. Load data […]

Continue reading


Preprocess: PCA Application in Python

We use the data from sklearn library, and the IDE is sublime text3. Most of the code comes from the book: https://www.goodreads.com/book/show/32439431-introduction-to-machine-learning-with-python?from_search=true ###sometimes we might face the situation that the features or vars in the data are not separate from each other ###We can always observe that data before we can even preprocess it with […]

Continue reading


Recommenders in R, Comparing Multiple Algorithms

We know several essential recommenders’ methods. If we want to recommend ourselves a book, we can do it 1. Based on our own exp 2. Based on our friends friends exp 3. Based on the catalog of the library 4. Based on the search engine’s result We already talked a little about the first method […]

Continue reading


Tencent Cloud’s Huang Ming on its New DI-X Deep Learning Platform

On March 28, Tencent Cloud announced the launch of its machine learning platform DI-X (short for Data Intelligence X), with a goal of providing a one stop shop for its machine learning and deep learning customers by reducing their barrier to entry and streamlining their development of AI. Based on Tencent Cloud’s compute and storage […]

Continue reading


Credit Analysis with ROC evaluation in Neural Network and Random Forest

This is quite like the article using C5.0 to conduct classification: https://charleshsliao.wordpress.com/2017/03/04/a-quick-classification-example-with-c5-0-in-r/ We tried to use more mature and powerful algorithms with cross validation and parameters tuning. 1. At first we preprocess the data. </pre> ################################################## #1. load, clean and preprocess data url <- ‘https://onlinecourses.science.psu.edu/stat857/sites/onlinecourses.science.psu.edu.stat857/files/german_credit.csv’ german_credit <- read.csv(url, header = TRUE, sep = ‘,’) # as the […]

Continue reading


Digital Marketing Application Method of Machine Learning and Data Mining, with RFM Model

No matter it is a classifier or a regression model, we apply the data mining and machine learning methods to achieve a target. To be more straightforward, we need to solve a problem. Especially in Digital Marketing (or “traditional marketing with data analytics approach”) when we focus on models of AARRR, PRAPA or ARM, in […]

Continue reading


A Quick Association Rules Example within R

Association rules are used to decided what items would lead to other items’ purchase. The practice is commonly known as market basket analysis due to the fact that it has been so frequently applied to supermarket data. The dataset used here was adapted from the Groceries dataset in the arules R package. library(arules) ## Loading […]

Continue reading