Using H2O AutoML for Kaggle Porto Seguro Safe Driver Prediction Competition

If you into competitive machine learning you must be visiting Kaggle routinely. Currently you can compete for cash and recognition at the Porto Seguro’s Safe Driver Prediction as well. I did try to given training dataset (as it is) with H2O AutoML which ran for about 5 hours and I was able to get into top […]

Continue reading


Improving Zillow’s Zestimate with 36 Lines of Code

Zillow and Kaggle recently started a $1 million competition to improve the Zestimate. We are releasing a public Domino project that uses H2O’s AutoML to generate a solution. The new Kaggle Zillow Price competition received a significant amount of press, and for good reason. Zillow has put $1 million on the line if you can […]

Continue reading


How to Become a Unicorn Data Scientist and Make More than $240,000

What makes a good data scientist? And if you are a good data scientist, how much should you expect to get paid? Owen Zhang, ranked #1 on Kaggle, the online stadium for data science competitions, lists his skills on his Kaggle profile as “excessive effort,” “luck,” and “other people’s code.” An engineer by training, Zhang says […]

Continue reading


The World’s #1 Data Scientist Talks about Data Science Skills and Tools

Owen Zhang is ranked #1 on Kaggle, the online stadium for data science competitions. An engineer by training, Zhang says that data science is finding “practical solutions to not very well-defined problems,” similar to engineering. He believes that good data scientists, “otherwise known as unicorn data scientists,” have three types of expertise. Since data science deals with […]

Continue reading


Revisiting Big Data and Crowdsourcing: Kaggle Today

I launched this blog a year ago in June 2011. In one of my first posts, I discussed “Crowdsourcing and Big Data,” offering a typology of crowdsourcing and connecting it to big data by mentioning a little-known (at the time) Australia-based venture called Kaggle. Today, Kaggle is a well-funded, Silicon Valley-based leading platform for predictive modeling […]

Continue reading


Fei-fei Li in Google Cloud NEXT ’17: Annoucing Google Could Video Intelligence API, and more Cloud Machine Learning Updates

Between March 8-10, Synced was invited as media guest to attend the Google Cloud NEXT ’17 conference in San Francisco. The first day of the conference was opened by keynote speeches from senior vice president of Google Cloud – Diane Greene, Google CEO – Sundar Pichai, Alphabet executive chairman – Eric Schmidt, and Google Cloud […]

Continue reading


[Kaggle] Poker Rule Induction

I wrote a note http://nbviewer.ipython.org/github/AhmedHani/Kaggle-Machine-Learning-Competitions/blob/master/Easy/PokerRuleInduction/PokerRuleInduction.ipynb about Poker Rule Induction problem, the note explains the problem description and the steps I used to solve it. It is considered a good problem for those who want to start solving at Kaggle and know about some Machine Learning libraries in Python that are commonly used when solving at Kaggle. Advertisements

Continue reading