Jan
23

Hadoop Performance Tuning with ORC and Snappy? Here’s What You’re Missing

Posted on January 23, 2015 by Douglas Moore

ORC (Optimized Row Columnar) and Snappy compression can offer higher performance for Hadoop. But sometimes the Hive planner can’t keep up, causing your queries to run slowly and thus under utilizing your cluster.  While this problem would be common to any type of compressed input, here’s something to consider when tuning for performance. To set…

0 Comments
Dec
24

2015 in Big Data – What to Expect

Posted on December 24, 2014 by Ron Bodkin

In 2014, Apache™ Hadoop® gathered momentum as the leading platform for big data analytics. Without a doubt, Hadoop is clearly here to stay, it has extended its dominance from enterprise software into social media—Twitter and Facebook both use it—making it hard to imagine a clear successor emerging any time soon. That said, while data scientists…

0 Comments
Dec
23

Here comes Big Data 2.0 — and right on time

Posted on December 23, 2014 by Jeffrey Breen

Daniel Eklund, Think Big’s Engineering Lead and my officemate in our new Boston office, started using the term “Big Data 2.0″ nearly a year ago, but I have been seeing it more and more lately. The “2.0″ moniker is not just marketing hype. A lot has changed in the last year to justify the version bump: A whole new Hadoop. No,…

0 Comments
Page 1 of 11