10 Worst Practices for Big Data

So many times when you see your favorite teams lose in championship games or a company fails to meet its revenue goals, years later you will see an interview and one of the leaders will say that “they learned more from losing than they ever did winning.” Some might think that this is a losing philosophy, but I wholeheartedly buy into it because let’s face it; we are not all 100% perfect.

+ Check Out a Free 2015 Business Intelligence Tools Buyers Guide

And those who lose but learn from mistakes made will prosper in the long run. With that I’d like to make a connection to an article written by Andrew C. Oliver, President of Open Software Integrators, called “The 10 worst big data practices.”

Below is a list of Andrew C. Oliver’s 10 worst big data practices.

1. Choosing MongoDB as your big data platform.

MongoDB is a good operational database, but not an analytics system.

2. Using RDBMS schema as files.

There are better ways to create an extract that’s more denormalized.

3. Creating data ponds.

If you create a bunch of data ponds for each business group, you will end up with different views of the data.

4. Failing to develop plausible use cases.

Come up with use cases before the project starts. You may find that there are certain things you really don’t need even if the vendor recommends it. Ask yourself the right questions.

5. Thinking Hive is the be-all, end-all.

Don’t get locked in on SQL just because you are familiar with it. Reach outside your knowledge comfort zone and learn something new.

6. Treating HBase like an RDBMS.

You can do things with HBase that would make your RDBMS’s head spin, but the reverse is also true. HBase is good for what HBase is good for, and it is terrible at nearly everything else.

7. Installing 100 nodes by hand.

Sounds good until someone loses a node during this cumbersome activity.

8. RAID/LVM/SAN/VMing your data nodes.

“Hadoop stripes blocks of data across multiple nodes, and RAID stripes it across multiple disks. Put them together, what do you have? A roaring, low-performing, latent mess.”

9. Treating HDFS as just a file system.

If you dump stuff onto HDFS, you haven’t necessarily accomplished anything.

10. Whoo, shiney!

“As with any technology — or anything in life — find that moderate path that prevents you from being the last gazelle in the pack or the first lemming off the cliff.”

Again, no one is 100% perfect and here are just 10 ways you could go wrong with big data. However, I thought that highlighting Andrew C. Oliver’s analysis of bad practices could help you avoid them in the future and save you lots of time and stress rerouting them. I hope that you found this helpful.

Do you agree with this for the 10 worst big data practices? Would you add something to this list or delete anything?

Click here to read Andrew C. Oliver’s entire article on InfoWorld and learn more about the bad practices on this list.

Check out these additional resources:

2016 Business Intelligence and Data Analytics Buyer’s Guide

BI_Data Analytics_Buyers_Guide_Cover_350

BI and Data Analytics Buyer’s Matrix: Comparing 28 Vendors Features

Business Intelligence Solutions Buyers Matrix Comparison

Gartner Magic Quadrant for Enterprise BI and Data Analytics

Gartner Magic Quadrant Enterprise Business Intelligence

This article was written by Doug Atkinson on July 29, 2014

Doug Atkinson

An entrepreneur and executive with a passion for enterprise technology, Doug founded Solutions Review in 2012. He has previously served as a newspaper boy, a McDonald's grill cook, a bartender, a political consultant, a web developer, the VP of Sales for e-Dialog - a digital marketing agency - and as Special Assistant to Governor William Weld of Massachusetts.

What’s Changed: 2015 Gartner Magic Quadrant for Business Intelligence and Analytics Platforms - October 13, 2015
Hitachi Data Systems Announces Intent to Acquire Pentaho - February 11, 2015
Public Service Announcement: Best Practice Guide for Big Data in the Public Sector - January 28, 2015

Business Intelligence News

10 Worst Practices for Big Data

Check out these additional resources:

Doug Atkinson

Expert Insights

Latest Posts

Categories

Important Links

Useful Pages

10 Worst Practices for Big Data

Check out these additional resources:

Share This

Doug Atkinson

Related Posts

Analytics and Data Science News for the Week of June 12; Updates from Anacond...

Analytics and Data Science News for the Week of June 6; Updates from Qlik, Sp...

Analytics and Data Science News for the Week of May 30; Updates from Databric...

Expert Insights

Latest Posts

Follow Solutions Review