Category Archives: Development

Discussions regarding programming, coding, etc. for Hadoop clusters.

Why Enterprise Hadoop jobs will not require Java skills in 3-5 years.

In the late 1979, RSI’s Oracle version 2 ran on Digital’s VAX minicomputers (32bit AND virtual memory!). If you were proficient with the first commercial RDBMS, you had to posses mad Macro-11 or PL-11 (the high level version) skills to … Continue reading

Posted in Development, Opinions | Tagged , | Leave a comment

Hortonworks HDP 1.3 released

HDP 1.3 was released today. This milestone release takes advantage of improved performance in Hive 0.11 along with delivery on a series of enterprise requirements including NFS access to HDFS, improved MTTR for HBase, business continuity through HDFS and HBase … Continue reading

Posted in Development, Opinions | Tagged , , | Leave a comment

It’s scary… but sometimes things go really well!

If you’ve been in IT for more than a few minutes, you know that things seldom go as planned.  Sometimes they go sideways and you’re left to clean up an abomination.  Been there, done that, expect it at least 15% of … Continue reading

Posted in Administration, Deployment, Development, Tuning | Tagged , , | 1 Comment

Life on the edge of data node writes

If you’re serious about using Hadoop  you should subscribe to the User Mailing Lists.  They are a great source of insight as to how things are performing, new features and common problems. I’m currently working on a JIRA to clarify documentation … Continue reading

Posted in Administration, Development | Tagged , , , | 2 Comments

Confessions of a data architect

My name is DataG and I’m a data modeler.  It’s been 6 weeks since my last star-schema. Lets face it.  Codd, Imhoff, Inmon, and Kimball paved the way for almost every data analyst and app-dev professional since the relational model … Continue reading

Posted in Data, Development, Uncategorized | 1 Comment