Tag Archives: deployment
R is the latest Hadoop darling. It is an open source language that “is widely used among statisticians and data miners for developing statistical software and data analysis. Polls and surveys of data miners are showing R’s popularity has increased … Continue reading
We are usually left to bash for scripting Hadoop functions. It’s the default in Linux and it’s usually good enough. There are enough “bash-isms” that will cause your Java/pig/database people serious heart ache. If you’re new to Hadoop, go ahead … Continue reading
I’m about to tell you how we build Hadoop clusters without install an OS on the DataNodes. We simply PXE boot them, assign them to a cluster and they join automatically. Read on (and on) to see how we do … Continue reading
Cloudera and VMware Work Together to Accelerate Enterprise Use of Hadoop in Virtual and Cloud Environments
Thought this might be interesting to our readers. It doesn’t apply to our implementation, but it might be useful to someone.