InnoCentive

InnoCentive Home Page InnoCentive is a Waltham, Massachusetts-based crowdsourcing company that accepts by commission research and development problems in engineering, computer science, math, chemistry, life sciences, physical sciences and business. The company frames these as “challenge problems” for anyone to solve. It gives cash awards for the best solutions to solvers who meet the challenge criteria.[1] […]

Continue reading


Big Data 1B dollars Club – Top 20 Players

Here is a list of top players in Big Data world having influence over billion dollars (or more) Big Data projects directly or indirectly (not in order): Microsoft Google Amazon IBM HP Oracle VMWare Terradata EMC Facebook GE Intel Cloudera SAS 10Gen SAP Hortonworks MapR Palantir Splunk The list is based on each above companies […]

Continue reading


What to do when compiling Hadoop branch 1.2.x returns java.io.IOException: Cannot run program “autoreconf”

Compiling Hadoop branch 1.2.x code in OSX returned exception as below: create-native-configure: BUILD FAILED/Users/avkash/work/hadoop/branch-1.2/build.xml:634: Execute failed: java.io.IOException: Cannot run program “autoreconf” (in directory “/Users/avkash/work/hadoop/branch-1.2/src/native”): error=2, No such file or directory at java.lang.ProcessBuilder.processException(ProcessBuilder.java:478) at java.lang.ProcessBuilder.start(ProcessBuilder.java:457) at java.lang.Runtime.exec(Runtime.java:593) at org.apache.tools.ant.taskdefs.Execute$Java13CommandLauncher.exec(Execute.java:862) at org.apache.tools.ant.taskdefs.Execute.launch(Execute.java:481) at org.apache.tools.ant.taskdefs.Execute.execute(Execute.java:495) at org.apache.tools.ant.taskdefs.ExecTask.runExecute(ExecTask.java:631) at org.apache.tools.ant.taskdefs.ExecTask.runExec(ExecTask.java:672) at org.apache.tools.ant.taskdefs.ExecTask.execute(ExecTask.java:498) at org.apache.tools.ant.UnknownElement.execute(UnknownElement.java:291) at sun.reflect.GeneratedMethodAccessor4.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) […]

Continue reading


Customized bash command prompt with line separator and other goodies

I wanted to have a fancy looking and very useful terminal windows with customize command prompt so after digging I build something as below for me: So what it have: Line Separator including current time at the end of the terminal History counter along with current command counter Logged user @ Hostname Current working folder […]

Continue reading


Brew on Mac – Just 3 steps and you are ready

Step 1: MachineHead:docs avkash$ ruby -e “$(curl -fsSL https://raw.github.com/mxcl/homebrew/go)” ==> This script will install: /usr/local/bin/brew /usr/local/Library/… /usr/local/share/man/man1/brew.1 ==> The following directories will be made group writable: /usr/local/. /usr/local/bin /usr/local/etc /usr/local/lib /usr/local/share /usr/local/share/man /usr/local/share/man/man1 /usr/local/share/info ==> The following directories will have their group set to admin: /usr/local/. /usr/local/bin /usr/local/etc /usr/local/lib /usr/local/share /usr/local/share/man /usr/local/share/man/man1 /usr/local/share/info Press ENTER […]

Continue reading


Apache Weave: Big Data Application runtime and development framework by Continuuity

Continuuity decided to build Weave and be part of the journey to take Apache YARN to the next level of usability and functionality. Continuuity has been using Weave extensively to support their  products and  seen the benefit and power of Apache YARN and Weave combined.  Continuuity decided to share Weave under the Apache 2.0 license in an […]

Continue reading


Processing unstructured content from a URL in R

R has a built in function name readLines() which read a local file or an URL to read content line by line. For example my blog URL is http://cloudcelebrity.wordpress.com so lets read it: > myblog <- readLines(“http://cloudcelebrity.wordpress.com&https://aichamp.wordpress.com/2013/02/12/processing-unstructured-content-from-a-url-in-r/8221😉Warning message:In readLines(“http://cloudcelebrity.wordpress.com&https://aichamp.wordpress.com/2013/02/12/processing-unstructured-content-from-a-url-in-r/8221😉 : incomplete final line found on ‘http://cloudcelebrity.wordpress.com&https://aichamp.wordpress.com/2013/02/12/processing-unstructured-content-from-a-url-in-r/8217; > length(myblog) [1] 1380 As you can see above there […]

Continue reading


Merging two data set in R based on one common column

Let’s create a new dataset using mtcars dataset and only mpg and hp column: > cars.mpg <- subset(mtcars, select = c(mpg, hp))   > cars.mpg                      mpg  hp Mazda RX4           21.0 110 Mazda RX4 Wag       21.0 110 Datsun 710          22.8  93 Hornet 4 Drive      21.4 110 Hornet Sportabout   18.7 175 Valiant             18.1 105 Duster 360          […]

Continue reading