Monthly Archives: June 2013

Hadoop and the honeycomb

I love the kind of honey where they leave a piece of the honeycomb in the jar.  Its great to chew on when you’ve used up all the honey.  Reminds me of this big old oak tree we used to … Continue reading

Leave a comment

Hadoop Hindsight #1 Start Small

I thought we would start a weekly series on some lessons we’ve learned.  Many of the topics we’ve learned the hard way so we thought it might be helpful for those a few steps behind us.  YMMV, but we wish … Continue reading

Leave a comment

Interview Questions for Hadoop Developers

Interview Questions for Hadoop Developers (via Dice News in Tech) Hadoop is an open distributed software framework that enables programmers to run an enormous number of nodes handling terabytes of data. One of its most significant abilities is allowing a … Continue reading

Leave a comment

Cinderella has left the Hadoop Cluster

It’s Friday evening before our Hadoop Administrator leaves for a week of vacation in New Hampshire and about an hour before he leaves he says “it’s turning into a pumpkin in an hour”.  Of course we wanted to go live … Continue reading

Leave a comment

The 3 Pillars of Data Democracy

In order to promote the use of data within the enterprise, we need to provide a collaborative environment which gives people the freedom and incentive to try new things.  This gives everyone the chance to prove great ideas, or at … Continue reading

Leave a comment

Why Enterprise Hadoop jobs will not require Java skills in 3-5 years.

In the late 1979, RSI’s Oracle version 2 ran on Digital’s VAX minicomputers (32bit AND virtual memory!). If you were proficient with the first commercial RDBMS, you had to posses mad Macro-11 or PL-11 (the high level version) skills to … Continue reading

Leave a comment

Building Clusters without installing an Operating System

I’m about to tell you how we build Hadoop clusters without install an OS on the DataNodes.  We simply PXE boot them, assign them to a cluster and they join automatically.  Read on (and on) to see how we do … Continue reading

Leave a comment

Hortonworks HDP 1.3 released

HDP 1.3 was released today. This milestone release takes advantage of improved performance in Hive 0.11 along with delivery on a series of enterprise requirements including NFS access to HDFS, improved MTTR for HBase, business continuity through HDFS and HBase … Continue reading

Leave a comment