Data Governance in Hadoop

In the previous post, we examined what data governance is and why it’s so important, as well as why it can be complicated for Hadoop. In short, when you have petabytes of data from many different sources, with varying degrees of structure, it’s difficult to keep track of where your sensitive or confidential data is located. But that also makes doing so even more important!

In this post, we’ll take a look at Cloudera Navigator, the only native data governance solution for Hadoop, and integrated into Cloudera Enterprise. We developed Cloudera Navigator to address the challenge of Hadoop data governance, and it’s one of the many reasons that so many organizations in financial services, healthcare, and pharmaceuticals rely on Cloudera Enterprise to store, analyze, and govern their most important data assets.

Cloudera Navigator is a turnkey data governance solution: once you install it, it automatically tracks everything you need to know about data usage in your cluster – in particular, what kind of data you have in your cluster, who’s accessing it, and what they’re doing with it. This includes the following:

Unified Auditing. Cloudera Navigator captures every data access attempt and logs it in a central location. Whether it’s through Apache Hive, Apache HBase, Cloudera Search, Impala, or just plain MapReduce. Cloudera Navigator tracks the user ID, IP address, resource name, and even the exact query that was run.

Comprehensive, Column-Level Lineage. Cloudera Navigator automatically captures lineage for all batch and interactive workloads, including Hive, Impala, MapReduce, Apache Oozie, Apache Pig, Apache Spark and Apache Sqoop transformations – all the way down to the column-level. It also integrates with all the top enterprise lineage frameworks.

Unified Metadata. Cloudera Navigator simplifies metadata access in Hadoop by consolidating all of the Hadoop technical metadata into a single, searchable interface. Additionally, Cloudera Navigator lets you classify all your data with custom tags and key-value pairs — so you can classify your data by clinical trial, customer type, degree of sensitivity, security clearance level, or anything else. Cloudera Navigator’s metadata features make it easy for data scientists and Hadoop administrators alike to effortlessly find and trust the data that matters most to them.

Data Lifecycle Management and Policy Enforcement. Cloudera Navigator’s flexible policy management, built on top of its rich metadata foundation, lets you automate crucial data stewardship and curation activities, such as metadata classification, data archiving and retention, or even invoking partner products for additional data preparation and transformation.

Encryption and Key Management. Cloudera Navigator includes enterprise-grade encryption and key management through Navigator Encrypt and Navigator Key Trustee to secure all data, metadata, and log files.

Seamless Integration with Existing Governance Solutions. Cloudera Navigator provides seamless integration with the leading enterprise metadata, lineage, and SIEM applications that organizations already rely on, including IBM, Imperva, Informatica, RSA, and Splunk.

Cloudera is leading the way for data governance in Hadoop and we will continue to be adding to and improving Cloudera Navigator’s already robust capabilities. Be on the lookout for exciting enhancements stemming from the acquisition Xplain.io around managing more complex data and workloads faster.

For more details on Cloudera Navigator, download the datasheet. It is also included as part of the free Cloudera Enterprise Trial. Try it out and let us know your thoughts!

Data Governance in Hadoop – Part 2

Trending Articles

Bath man appears in court charged with attempted murder of a man...

MACLEAN, Allan

Black Angus Grilled Artichokes

Practice Sheet of Right form of verbs for HSC Students

Police blotter for Jan. 12

99 God Status for Whatsapp, Facebook

Rajasthan Board 12th Science Result 2018 name wise- RBSE 12th commerce result...

Notorious Naushad of Ippa gang nabbed

Child Kidnapping: Amy McNeil was kidnapped on her way to school by 5 adults;...

Sonible Smartlimit v1.1.5-R2R

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Arrow Flash 2 – Sinhala Dubbed – Episode 23 – 20th March 2016

[GET] AI Traffic Goldmine

[E² Plugin] HDF-Radio

Universal Multi-Patch v1.3 By RADIXX11

IWAN – Thanks and Praise ( Throw Back Thursday )

RONALD P SONDERGAARD Arrested by Miami-Dade County Corrections on Mar 03, 2017

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

HSSC Excise & Taxation Inspector Result 2017 Scorecard/ Category Wise Merit List