Infor Advances Data Agenda With ‘Coleman’ AI, Birst BI Integration

Infor lays out plans for artificial intelligence and cloud-based business intelligence and analytics. Here’s what customers need to know. Infor entered the increasingly crowded artificial intelligence (AI) arena July 11 by introducing its Coleman AI platform. Unveild at the company’s Inforum 2017 event in NY, Coleman was described as a language- and image-savvy AI platform […]

Continue reading


Digital Economy Summer School 2015

Meet Tolkien My fellow Horizonauts and I spent the first half of last week at the University of Southampton for our annual summer school. This year the theme was web science and the focus was on big data and social media. Many new and interesting things were learnt, but perhaps more importantly, many new and […]

Continue reading


DRY HiveQL

DRY (don’t repeat yourself) is one of the fundamental principles of software engineering. The main idea is to avoid duplicating business/processing logic throughout the code. However, I rarely see it being applied when writing SQL queries; making it difficult to understand and maintain them. Below are few tips on making HiveQL DRY. Quick Summary Use […]

Continue reading


Qlik Plots Course to Big Data, Cloud and ‘AI’ Innovation

Qlik highlights upgrades and the roadmap to high-scale, hybrid cloud and ‘augmented intelligence.’ Here’s my take on the long-range plans. Big data scalability, hybrid cloud flexibility and smart “augmented” intelligence. These are the three plans that business intelligence and analytics vendor Qlik officially put on its roadmap at the May 15-18 Qonnections conference in Orlando, Florida. Qlik also […]

Continue reading


Teradata Transition to Cloud and Consulting Continues

Teradata simplifies pricing, executes on business consulting and hybrid cloud strategy. A look at next steps in the company’s ongoing transition. “Business outcome led, technology enabled.” This was the theme at the May 8-10 Teradata Third-Party Influencers Summit in San Diego, and it reflected a two-to-one ratio of consulting-oriented presentations to technology updates. Teradata has […]

Continue reading


Efficient Textual Similarity Across Millions of Web Queries

Computing textual similarity (such as Jaccard similarity coefficient) between millions of search queries can be an arduous task. The main challenge is the number of pairs that one needs to consider; a relatively small dataset containing ten thousands queries leads to more than 49 million possible query pairs (). Based on Vernica, et.al. paper, I show […]

Continue reading