Category Archives: Uncategorized

Drilling thru Multiple Clusters

…or Using Apache Drill to join data across discreet domains. We’ve been doing some work with Redshift lately. While it’s an effective tool for storing and crunching thru large amounts of structured data, it’s limited by a few “-ism’s” that … Continue reading

Posted in Uncategorized | Leave a comment

A wonderful, ugly script that just keeps working

Today were going to look at parts of a complex “nudge” script as I’ve described previously. It has a few more bells and whistles and constantly amazes me how well it adapts. I’ll show the good bits in sections so … Continue reading

Posted in Uncategorized | Leave a comment

The 3 Question Test

A burger and fries costs $1.10; the burger costs $1 more than the fries. How much do the fries cost? 5 servers can sort 5 TB of data in 5 minutes; how long would 100 servers take to sort 100 … Continue reading

Posted in Uncategorized | 1 Comment

Hadoop 2.0 GA

I’ve been watching the Hadoop user mailing lists and jira counts. It sure seems like 2.0 GA is more like 2.0 Beta 1. I’m looking forward to RC 1 before we move it into a serious cluster. Just my $0.02.

Posted in Uncategorized | Leave a comment

Intuition about Chromecast

I bought a Google Chromecast device. It was really cheap ($35+tax) and I’m a whore for media casting devices. When I “flick” tablet video’s to Chromecast, I notice something interesting. I see the display of Chromecast ponder what I’ve sent … Continue reading

Posted in Uncategorized | Leave a comment

Consuming JSON Strings in SQL Server

This article describes a TSQL JSON parser and its evil twin, a JSON outputter, and provides the source. It is also designed to illustrate a number of string manipulation techniques in TSQL. With it you can do things like this … Continue reading

Posted in Uncategorized | Leave a comment

You Paid for Support?! Bwah-ha-ha

We’re using Open Source Software extensively in our Big Enterprise. It really irritates me that we pay millions of dollars for “Support” from our vendors and we get endless circles of “try this,” “that should work” and “oh, that’s an … Continue reading

Posted in Uncategorized | Leave a comment

Insights for Articles from the Hadoop Summit 2013

I just left the Hadoop Summit 2013 so my next series of articles are going to be on some insights I learned.  For this post I’m going to just post a long list of future topics  – let me know … Continue reading

Posted in Uncategorized | 1 Comment

Hadoop and the honeycomb

I love the kind of honey where they leave a piece of the honeycomb in the jar.  Its great to chew on when you’ve used up all the honey.  Reminds me of this big old oak tree we used to … Continue reading

Posted in Uncategorized | Leave a comment

Interview Questions for Hadoop Developers

Interview Questions for Hadoop Developers (via Dice News in Tech) Hadoop is an open distributed software framework that enables programmers to run an enormous number of nodes handling terabytes of data. One of its most significant abilities is allowing a … Continue reading

Posted in Uncategorized | Leave a comment