Tag Archives: deployment

hello woRld!

R is the latest Hadoop darling. It is an open source language that “is widely used among statisticians and data miners for developing statistical software and data analysis. Polls and surveys of data miners are showing R’s popularity has increased … Continue reading

Posted in Administration, Deployment, Development, Tuning | Tagged , , , | Leave a comment

Weaponizing Hadoop

We are usually left to bash for scripting Hadoop functions. It’s the default in Linux and it’s usually good enough. There are enough “bash-isms” that will cause your Java/pig/database people serious heart ache. If you’re new to Hadoop, go ahead … Continue reading

Posted in Administration, Deployment, Development, syndicated | Tagged , , , | Leave a comment

Building Clusters without installing an Operating System

I’m about to tell you how we build Hadoop clusters without install an OS on the DataNodes.  We simply PXE boot them, assign them to a cluster and they join automatically.  Read on (and on) to see how we do … Continue reading

Posted in Administration, Deployment | Tagged , , , , , | Leave a comment

Cloudera and VMware Work Together to Accelerate Enterprise Use of Hadoop in Virtual and Cloud Environments

Thought this might be interesting to our readers. It doesn’t apply to our implementation, but it might be useful to someone.

Posted in Uncategorized | Tagged , , , | Leave a comment

It’s scary… but sometimes things go really well!

If you’ve been in IT for more than a few minutes, you know that things seldom go as planned.  Sometimes they go sideways and you’re left to clean up an abomination.  Been there, done that, expect it at least 15% of … Continue reading

Posted in Administration, Deployment, Development, Tuning | Tagged , , | 1 Comment