Cisco Blogs
Share

Drowning in Data: Data Preparation to the Rescue

- December 17, 2015 - 4 Comments

Everyone knows that data is important. Modern enterprises compete with data and win with the agility and insight it provides.

Thus the “Big Data Era” as metaphorically captured by terms such at Data Lakes, Data Reservoirs, Data Swamps, Data Streams and myriad others that attempt to describe the pools of liquid gold that this valuable data represents.

It seems everyone today is swimming in data. With many figuratively drowning in it.

The Rise of Data Swim Teams

With so much data and so much opportunity, big data has become a team sport at most large organizations today. This makes sense. There simply aren’t enough data scientists to carry the entire load. And while IT has skills and is ready to help, they already have a huge backlog. So business leaders are redirecting their resources to “jump in” with both feet.

The first strokes the business analysts learn are typically the self-service visualization and analysis tools like Tableau, Qlik and Spotfire. Over time these analysts swim their way toward the deep end of the data lake where the data preparation challenges are more difficult.

Self-service data preparation tools, including Cisco Data Preparation, help these analysts successfully navigate these deeper waters.

Data Preparation to the Rescue

Cisco Data Preparation provides helps business analysts “swimmers” glide across big data waters. It is:

  • Comprehensive – Integrating hundreds of data types, including end-user data files, big data/ Hadoop, traditional enterprise data sources and cloud/web service sources.
  • Agile – Providing all essential data preparation functions without requiring custom coding or scripting, and uses machine learning to automate time-consuming tasks.
  • Easy to Use – Using an intuitive, exploration-style data preparation approach that requires minimal training and accelerates user understanding.
  • Scalable – Running reliably at multi-terabyte scale using Cisco’s UCS big data infrastructure, and offering closed-loop integration with Cisco Data Virtualization.
  • Governed – Taking advantage of an on-premises deployment model that is provisioned and managed by IT, with all steps tracked and all data stored in a centrally administered Hadoop database.

Dive Into Cisco Data Preparation

For a quick introduction to features and benefits, check out the Cisco Data Preparation Data Sheet.

 

Join the Conversation

Follow @CiscoDataVirt and @CiscoAnalytics.

Learn More from My Colleagues

Check out the blogs of Mala AnandMike Flannagan and Kevin Ott to learn more.

Tags:

In an effort to keep conversations fresh, Cisco Blogs closes comments after 60 days. Please visit the Cisco Blogs hub page for the latest content.

4 Comments

  1. Thanks

    Its interesting there is so much big data that organizations need data scientists to analyze it. Hopefully companies are handling the massive amounts of information securely and not exposing customers to unnecessary risk.

    I really enjoy the reading article and specially for these lines: " Thus the “Big Data Era” as metaphorically captured by terms such at Data Lakes, Data Reservoirs, Data Swamps, Data Streams and myriad others that attempt to describe the pools of liquid gold that this valuable data represents."

    This is great, Bob. Big Data is a popular conversation across all industries and there is a commonly held belief that working with Big Data is very complex and time-consuming. I was glad to hear that Cisco Data Preparation is "a tool designed for non-technical business users" that masks the complexity and automates the time-consuming tasks associated with analyzing Big Data.

Share