Deep Learning OCR using TensorFlow and Python

In this post, deep learning neural networks are applied to the problem of optical character recognition (OCR) using Python and TensorFlow. This post makes use of TensorFlow and the convolutional neural network class available in the TFANN module. The full source code from this post is available here. Introduction to OCR OCR is the transformation […]

Continue reading


Answering Questions About Model Delivery on AWS at Strata

This post is a recap of the common questions Domino answered in the booth at Strata New York. We answered questions about access to EC2 machines, managing environments, and model delivery. One of the best parts about being at an event like Strata is the chance to spend time with data scientists and data science […]

Continue reading


What Your CIO Needs to Know about Data Science

What would you rather be doing? Data science or DevOps? As a data scientist, your CIO may hear that model deployment is a challenge (e.g., models sitting around for weeks before being deployed in production). However, your CIO may not have the nuanced insight they need to address the challenge. Meanwhile, being frustrated with models […]

Continue reading


Transforming organizations through analytics centers of excellence

[A version of this post appears on the O’Reilly Radar blog.] The O’Reilly Data Show Podcast: Carme Artigas on helping enterprises transform themselves with big data tools and technologies. In this episode of the Data Show, I spoke with Carme Artigas, co-founder and CEO of Synergic Partners (a Telefonica company). As more companies adopt big […]

Continue reading


The state of machine learning in Apache Spark

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Ion Stoica and Matei Zaharia explore the rich ecosystem of analytic tools around Apache Spark. In this episode of the Data Show, we look back to a recent conversation I had at the Spark Summit in San Francisco with Ion […]

Continue reading


A Neural Network in 10 lines of CUDA C++ Code

Purpose: For education purposes only. The code demonstrates supervised learning task using a very simple neural network. Reference: inspired by Andrew Trask‘s post. The core component of the code, the learning algorithm, is only 10 lines: The loop above runs for 50 iterations (epochs) and fits the vector of attributes X to the vector of […]

Continue reading


Domino now supports JupyterLab — and so much more

You can now run JupyterLab in Domino, using a new Domino feature that lets data scientists specify any web-based tools they want to run on top of the Domino platform. Introduction Domino is a data science platform that supports the entire data science lifecycle, from exploratory analysis through experimentation and all the way to deployment. […]

Continue reading


PoE AI Part 5: Real-Time Obstacle and Enemy Detection using CNNs in TensorFlow

This post is the fifth part of a series on creating an AI for the game Path of Exile © (PoE). A Deep Learning Based AI for Path of Exile: A Series Calibrating a Projection Matrix for Path of Exile PoE AI Part 3: Movement and Navigation PoE AI Part 4: Real-Time Screen Capture and […]

Continue reading