Nairobi, Kenya

254728269396

Managing Metadata And Data Catalogs Training

In today’s data-driven landscape, organizations are increasingly dependent on accurate, accessible, and governed metadata to drive value from their data assets. The Managing Metadata and Data Ca...

Click to Register

ONSITE OR VIRTUAL

Programme Overview
Training Description

Who Should Attend

This course is ideal for;

  1. Data Stewards and Metadata Managers
  2. Data Engineers and Architects
  3. Data Governance Professionals
  4. Business Intelligence Developers
  5. Data Scientists and Analysts
  6. IT Compliance Officers
  7. Enterprise Architects
  8. Information Management Officers
Session Objectives
  • Understand the principles and value of metadata management
  • Learn to implement enterprise metadata standards and models
  • Gain skills in selecting and deploying data catalog platforms
  • Establish metadata-driven data governance frameworks
  • Automate metadata ingestion and lineage capture
  • Create and manage business glossaries and taxonomies
  • Enhance data discoverability and classification
  • Enable collaborative metadata stewardship workflows
  • Integrate metadata into analytics and ML environments
  • Enforce data access and usage policies via metadata
  • Monitor metadata quality, versioning, and impact analysis
About the Course

In today’s data-driven landscape, organizations are increasingly dependent on accurate, accessible, and governed metadata to drive value from their data assets. The Managing Metadata and Data Catalogs Training Course is designed to equip professionals with the tools and techniques needed to organize, maintain, and leverage metadata effectively using enterprise-grade data catalog platforms. This 10-day intensive course explores strategies for metadata lifecycle management, active data cataloging, automated lineage, policy enforcement, and business glossary integration. Through hands-on practice and expert-led modules, participants will gain the critical competencies to support data discovery, governance, compliance, and analytics initiatives at scale.

Curriculum & Topics

15 Topics | 10 Days

  • play Subtopic 1.1: What is metadata and why it matters

  • play Subtopic 1.2: Types of metadata: technical, business, operational

  • play Subtopic 1.3: Metadata lifecycle stages

  • play Subtopic 1.4: Benefits to governance, analytics, and compliance

  • play Subtopic 1.5: Common metadata management challenges

  • play Subtopic 2.1: Metadata modeling concepts

  • play Subtopic 2.2: Standards like ISO 11179 and Dublin Core

  • play Subtopic 2.3: Entity relationship modeling for metadata

  • play Subtopic 2.4: Aligning metadata models with business needs

  • play Subtopic 2.5: Mapping technical assets to business terms

  • play Subtopic 3.1: What is a data catalog and how it works

  • play Subtopic 3.2: Catalog components: search, lineage, tagging

  • play Subtopic 3.3: Comparing open-source and commercial catalogs

  • play Subtopic 3.4: Use cases across industries

  • play Subtopic 3.5: Cataloging structured and unstructured data

  • play Subtopic 4.1: Automated metadata discovery techniques

  • play Subtopic 4.2: Tagging, classification, and categorization

  • play Subtopic 4.3: Sensitivity detection and PII tagging

  • play Subtopic 4.4: Leveraging AI/ML for smart classification

  • play Subtopic 4.5: Organizing metadata using taxonomy

  • play Subtopic 5.1: Techniques for ingesting metadata from sources

  • play Subtopic 5.2: APIs, connectors, and crawlers

  • play Subtopic 5.3: Scheduling metadata synchronization

  • play Subtopic 5.4: Metadata harvesting from lakes and warehouses

  • play Subtopic 5.5: Event-driven ingestion workflows

  • play Subtopic 6.1: Difference between glossary and catalog

  • play Subtopic 6.2: Business term definition and approval

  • play Subtopic 6.3: Relationships and synonyms between terms

  • play Subtopic 6.4: Integrating glossary with catalog search

  • play Subtopic 6.5: Governance of glossary changes

  • play Subtopic 7.1: Capturing and visualizing data lineage

  • play Subtopic 7.2: Lineage at schema, table, and column levels

  • play Subtopic 7.3: Upstream and downstream dependency analysis

  • play Subtopic 7.4: Using lineage for impact and root cause analysis

  • play Subtopic 7.5: Change propagation and notification

  • play Subtopic 8.1: Roles and responsibilities in metadata governance

  • play Subtopic 8.2: Setting policies and stewardship workflows

  • play Subtopic 8.3: Governance models: centralized, federated, hybrid

  • play Subtopic 8.4: Stewardship tools and collaboration

  • play Subtopic 8.5: Ensuring metadata quality and accuracy

  • play Subtopic 9.1: Data catalog components and deployment models

  • play Subtopic 9.2: On-premise, cloud, and hybrid deployments

  • play Subtopic 9.3: Integration with security and identity systems

  • play Subtopic 9.4: Metadata storage and scalability

  • play Subtopic 9.5: Access control and role-based views

  • play Subtopic 10.1: Open-source options: Amundsen, DataHub, Apache Atlas

  • play Subtopic 10.2: Commercial platforms: Alation, Collibra, Informatica

  • play Subtopic 10.3: Feature comparison and selection criteria

  • play Subtopic 10.4: Licensing and pricing considerations

  • play Subtopic 10.5: Vendor support and extensibility

  • play Subtopic 11.1: Faceted search, autocomplete, and filtering

  • play Subtopic 11.2: Metadata enrichment for better UX

  • play Subtopic 11.3: Recommendation engines in data catalogs

  • play Subtopic 11.4: Bookmarking, annotations, and feedback

  • play Subtopic 11.5: Personalization and usage tracking

  • play Subtopic 12.1: Metadata pipeline automation using CI/CD

  • play Subtopic 12.2: Policy triggers and automated validation

  • play Subtopic 12.3: Scheduling lineage refresh and quality scans

  • play Subtopic 12.4: Workflow orchestration with Airflow or Prefect

  • play Subtopic 12.5: Alerting on stale or incomplete metadata

  • play Subtopic 13.1: Linking metadata to access control systems

  • play Subtopic 13.2: Tag-based access and masking rules

  • play Subtopic 13.3: Conditional access based on sensitivity

  • play Subtopic 13.4: Auditing metadata usage

  • play Subtopic 13.5: Policy enforcement and compliance reporting

  • play Subtopic 14.1: Using metadata in feature engineering

  • play Subtopic 14.2: Discovering reusable data assets

  • play Subtopic 14.3: Tracking model input/output lineage

  • play Subtopic 14.4: Ensuring explainability and transparency

  • play Subtopic 14.5: Metadata in MLOps pipelines

  • play Subtopic 15.1: Trends: active metadata, knowledge graphs

  • play Subtopic 15.2: Metadata interoperability and APIs

  • play Subtopic 15.3: AI/ML enhancements for catalogs

  • play Subtopic 15.4: Self-service and crowdsourced metadata

  • play Subtopic 15.5: Building an enterprise metadata strategy

img

$ 3,000

Availability Calendar

Find a schedule that works for you. Click any available session to submit a booking.

Selected Session:
Delivery modes & Locations
This Programme Includes

Certificate of completion

Training manual

Reference materials

10 o'clock tea

Lunch

4 o'clock tea

Course Highlights
  • icon 10 Days Intensive Training

  • icon 15 Core Learning Topics

  • icon 10 Days Professional Sessions

  • icon Training Expert-led Delivery

PB Training Institute of Research and Consultancy
FAQs

Frequently Asked Questions

Explore detailed answers to the most common questions about our platform and services.

No questions available at the moment.