Monthly Archives: June 2013

Hadoop and the honeycomb

I love the kind of honey where they leave a piece of the honeycomb in the jar.  Its great to chew on when you’ve used up all the honey.  Reminds me of this big old oak tree we used to … Continue reading

Comments Off on Hadoop and the honeycomb

Hadoop Hindsight #1 Start Small

I thought we would start a weekly series on some lessons we’ve learned.  Many of the topics we’ve learned the hard way so we thought it might be helpful for those a few steps behind us.  YMMV, but we wish … Continue reading

Comments Off on Hadoop Hindsight #1 Start Small

Interview Questions for Hadoop Developers

Interview Questions for Hadoop Developers (via Dice News in Tech) Hadoop is an open distributed software framework that enables programmers to run an enormous number of nodes handling terabytes of data. One of its most significant abilities is allowing a … Continue reading

Comments Off on Interview Questions for Hadoop Developers

Cinderella has left the Hadoop Cluster

It’s Friday evening before our Hadoop Administrator leaves for a week of vacation in New Hampshire and about an hour before he leaves he says “it’s turning into a pumpkin in an hour”.  Of course we wanted to go live … Continue reading

Comments Off on Cinderella has left the Hadoop Cluster

The 3 Pillars of Data Democracy

In order to promote the use of data within the enterprise, we need to provide a collaborative environment which gives people the freedom and incentive to try new things.  This gives everyone the chance to prove great ideas, or at … Continue reading

Comments Off on The 3 Pillars of Data Democracy

Why Enterprise Hadoop jobs will not require Java skills in 3-5 years.

In the late 1979, RSI’s Oracle version 2 ran on Digital’s VAX minicomputers (32bit AND virtual memory!). If you were proficient with the first commercial RDBMS, you had to posses mad Macro-11 or PL-11 (the high level version) skills to … Continue reading

Comments Off on Why Enterprise Hadoop jobs will not require Java skills in 3-5 years.

Building Clusters without installing an Operating System

I’m about to tell you how we build Hadoop clusters without install an OS on the DataNodes.  We simply PXE boot them, assign them to a cluster and they join automatically.  Read on (and on) to see how we do … Continue reading

Comments Off on Building Clusters without installing an Operating System

Hortonworks HDP 1.3 released

HDP 1.3 was released today. This milestone release takes advantage of improved performance in Hive 0.11 along with delivery on a series of enterprise requirements including NFS access to HDFS, improved MTTR for HBase, business continuity through HDFS and HBase … Continue reading

Comments Off on Hortonworks HDP 1.3 released