🏛

Data Governance Platform

Training Manual & User Guide

Complete guide to managing business glossaries, technical metadata, data lineage, approval workflows, and AI-powered data governance across industry verticals.

Version 1.0 • 2025
Supports: Banking • Insurance • Manufacturing • Ecommerce

Table of Contents

1Getting Started
2Platform Overview & Navigation
3Industry Verticals
4Training Mode vs. Authenticated Mode
5User Roles & Permissions
6Business Glossary
7Business Domains & Subject Areas
8Technical Metadata
9Data Lineage
10Critical Data Elements (CDEs)
11Approval Workflows
12Bulk Upload & Import
13AI-Powered Features
14Analytics & Audit
15Administration
16Learning Center & Regulations
17Step-by-Step Workflows
18Cloud Deployment
19Glossary of Key Terms
20Quick Reference Card

1. Getting Started

What is Data Governance?

Data Governance is the practice of managing data as a strategic enterprise asset. It involves establishing policies, procedures, and standards to ensure data quality, security, compliance, and usability across the organization.

What This Platform Does

The Data Governance Platform provides a centralized system to:

Accessing the Platform

Training Mode (No Login)

Open the app URL directly. You enter Training Mode automatically. Explore all features freely — changes are saved in your browser session only.

Session Only

Authenticated Mode (Login)

Click Admin Login in the top bar. Enter your credentials. All changes persist to the database with full audit trail.

Persistent

Tip
Training Mode is ideal for learning the platform, running demos, and exploring features without affecting production data.

System Requirements

2. Platform Overview & Navigation

Main Navigation Sidebar

The left sidebar is your primary navigation tool. It is organized into functional sections:

SectionPagesPurpose
Learn Concepts, Platform Guide, A-Z Glossary, Regulations Educational resources and reference materials
Glossary Browse Terms, Create Term, Taxonomy Manage business terms and definitions
Organization Domains, Metadata, CDE Registry Structure and link business & technical data
Data Flow Lineage List, Lineage Graph Track data movement between systems
Operations Approval Inbox, Upload Manage approvals and bulk data imports
Insights Analytics Dashboard, Audit Log Monitor governance health and track changes
AI AI Assistant, AI Settings AI chat and provider configuration
Admin User Management, Settings System administration (admin only)

Top Bar

Dashboard

The home page provides an executive overview with:

3. Industry Verticals

The platform ships with four industry verticals, each pre-loaded with industry-specific data including domains, business terms, technical systems, lineage mappings, and sample users.

VerticalDomains ExampleSystems ExampleUse Case
Banking Retail Banking, Treasury, Risk, Compliance Core Banking, EDW, Regulatory Reporting Global bank governance
Insurance Underwriting, Claims, Policy Admin, Actuarial Policy System, Claims DB, Reinsurance Multi-line insurer
Manufacturing Supply Chain, Production, Quality, Logistics ERP, MES, SCADA, Quality Lab Industrial operations
Ecommerce Catalog, Orders, Customers, Marketing Product DB, Order System, CRM Online retail platform

Switching Verticals

1
Click the vertical name at the top of the sidebar
2
Select the desired industry vertical from the dropdown
3
Confirm the switch — the database will be completely reseeded
4
The page reloads with the new industry-specific data
Important
Switching verticals resets the entire database with the selected industry's data. Any custom terms, domains, or lineage mappings will be lost. AI provider configurations are preserved across switches.

4. Training Mode vs. Authenticated Mode

FeatureTraining ModeAuthenticated Mode
Login RequiredNo (default experience)Yes
Data PersistenceBrowser session onlyDatabase (permanent)
Create / Edit TermsYes (session only)Yes (database)
AI FeaturesFully functionalFully functional
Approval WorkflowSimulatedFull multi-step workflow
Upload FilesValidation onlyFull import to database
Role-Based AccessAll features visibleEnforced by role
Audit TrailNo loggingComplete audit log
AnalyticsView-onlyFull metrics & tracking
IndicatorTraining Mode badgeUser name displayed

How Training Mode Works

Tip
Training Mode is perfect for: learning the platform, running client demos, testing governance workflows, and exploring AI features — all without affecting real data.

5. User Roles & Permissions

RoleAccess LevelTypical User
ADMIN Full platform access — user management, settings, all CRUD operations IT governance lead, system administrator
DATA_STEWARD Create/edit terms, submit for approval, manage metadata, upload files, edit lineage Data governance analyst, domain coordinator
DATA_OWNER View terms, approve/reject terms for their domain, manage domain governance Business unit head, department leader
DATA_CUSTODIAN View terms, final approval authority, link & verify metadata DBA, data engineer, DDPO
VIEWER Read-only access to all glossary, lineage, and analytics pages Stakeholders, data consumers, auditors

Dual-Approval Workflow

Steward Creates Term Data Owner Reviews Data Custodian Approves APPROVED

At any step, a reviewer can reject the term with comments, sending it back to DRAFT status.

6. Business Glossary

Browse Terms

Navigate to Glossary > Browse Terms to see all business terms in a searchable, filterable table.

Filters Available

Table Columns

Create a New Term

Navigate to Glossary > Create Term to open the creation form.

Form Fields

FieldRequiredDescription
Term NameYesThe business name for this data concept
DefinitionYesClear, unambiguous description of the term
Business ContextNoHow this term is used in business operations
Example ValuesNoSample data values (comma-separated)
DomainNoBusiness domain this term belongs to
ClassificationNoSensitivity level (PUBLIC to PCI)
CDENoMark as Critical Data Element
Data OwnerNoPerson responsible for business decisions
Data CustodianNoPerson responsible for technical management
SynonymsNoAlternative names (SYNONYM, ALIAS, ACRONYM)
AI Assist
Click the "AI Suggest" button after entering a term name. The AI will generate a suggested definition, business context, classification, example values, synonyms, and CDE recommendation. Click "Use All Suggestions" to apply them all at once.

Term Detail Page

Click any term name to view its full detail page with five tabs:

Term Taxonomy

Navigate to Glossary > Taxonomy to see a hierarchical tree view of terms and their relationships (parent/child/IS_A/IS_PART_OF).

7. Business Domains & Subject Areas

What Are Domains?

Domains are logical groupings of related business terms. For example, a Banking vertical might have domains like "Retail Banking," "Treasury," "Risk Management," and "Compliance."

Browse Domains

Navigate to Organization > Domains to see all domains displayed as cards showing:

Domain Detail

Click on a domain card to see:

Subject Areas

Subject areas help organize terms within a domain. For example, the "Retail Banking" domain might have subject areas like "Deposits," "Loans," and "Cards."

Ownership Inheritance

If a domain has a Data Owner assigned, all terms in that domain inherit the owner unless the term has its own explicit owner. An inheritance indicator icon appears next to inherited owners.

8. Technical Metadata

Metadata Explorer

Navigate to Organization > Metadata to explore technical metadata (database systems, tables, and columns).

Filters

Metadata Fields

FieldDescription
System NameThe source system (e.g., EDW, CORE_BANKING)
Database / SchemaDatabase and schema location
Table NamePhysical table name
Column NamePhysical column name
Data TypeSQL data type (VARCHAR, INT, DATE, etc.)
Primary KeyWhether this column is part of the primary key
NullableWhether the column allows NULL values
Linked TermBusiness term mapped to this column (if any)

Linking Terms to Metadata

From a Term Detail page > Technical Metadata tab, you can link the term to specific database columns:

1
Click "Link Metadata" button
2
Use the cascading picker: System → Table → Column, or use the search box
3
Choose link type: DEFINES, POPULATED_BY, DERIVED_FROM, or REFERENCED_IN
4
Click "Link" to create the mapping
5
Click the verify icon to mark the link as verified

Link Types Explained

TypeMeaningExample
DEFINESAuthoritative source column"Customer ID" DEFINES CUSTOMER.CUST_ID
POPULATED_BYDownstream copy of data"Customer ID" POPULATED_BY DIM_CUSTOMER.CUST_KEY
DERIVED_FROMInput to a calculation"Net Interest Margin" DERIVED_FROM INCOME.INTEREST_REVENUE
REFERENCED_INIndirect reference"Account Type" REFERENCED_IN REPORTS.ACCT_RPT.TYPE_CD

9. Data Lineage

What is Data Lineage?

Data lineage tracks how data moves and transforms between systems — from source databases through ETL/ELT pipelines to final reports and analytics.

Lineage Mappings List

Navigate to Data Flow > Lineage to see all source-to-target mappings in a table showing:

Interactive Lineage Graph

Navigate to Data Flow > Visualization for an interactive D3.js graph showing:

Color Legend

ColorSystem Type
BlueSource Systems
GreenEnterprise Data Warehouse (EDW)
AmberODS / Staging
PurpleReports & Analytics

Uploading Lineage Data

Lineage mappings can be uploaded in bulk using the ELT Mapping template from the Upload page. See Chapter 12: Bulk Upload for details.

10. Critical Data Elements (CDEs)

What is a CDE?

A Critical Data Element is a high-impact data field that requires special governance due to its importance to business operations, regulatory compliance, or risk management. Examples include Customer ID, Account Balance, Transaction Amount, and SSN.

CDE Registry

Navigate to Organization > CDE Registry for a dedicated view of all critical data elements with their technical mappings.

Registry Dashboard

Per-CDE Detail

Each CDE card expands to show:

Marking a Term as CDE

Check the "Critical Data Element" checkbox when creating or editing a term. AI suggestions may also recommend CDE status based on the term's definition and classification.

11. Approval Workflows

Workflow Process

The platform uses a dual-approval workflow to ensure governance quality:

DRAFT PENDING_REVIEW PENDING_APPROVAL APPROVED
StatusWho ActsAction
DRAFTData StewardCreates term, clicks "Submit for Approval"
PENDING_REVIEWData OwnerReviews business accuracy, approves or rejects
PENDING_APPROVALData CustodianReviews technical completeness, gives final approval
APPROVEDTerm is officially governed
REJECTEDReturned to DRAFT with reviewer comments

Approval Inbox

Navigate to Operations > Approval Inbox to see all pending workflows. Filter by:

Taking Action

12. Bulk Upload & Import

Navigate to Operations > Upload to import data from Excel (.xlsx) or CSV files.

Supported File Types

File TypeWhat It ImportsKey Columns
Glossary Import Business terms into the glossary term_name, definition, domain, classification, data_owner, is_cde
Data Dictionary Technical metadata (systems, tables, columns) system_name, database_name, schema_name, table_name, column_name, data_type
ELT Mapping Lineage mappings (source to target) source_system, source_table, source_column, target_system, target_table, target_column, transformation_type

Upload Steps

1
Download Template: Click the template download button for your file type
2
Fill Data: Populate the Excel template with your data (one record per row)
3
Select File Type: Choose the matching file type from the dropdown
4
Upload: Drag and drop the file or click to browse
5
Review Results: Check success count, duplicates skipped, and any errors
Tip
Duplicate records (matched by term name for glossary, or system+table+column for metadata) are automatically skipped. The results summary shows how many were skipped.

Upload History

View all past uploads at Operations > Upload > History with filename, status, row counts, and error details.

13. AI-Powered Features

Overview

The platform integrates with AI providers (Claude, OpenAI, Azure OpenAI) to assist with governance tasks. AI features work in both Training Mode and Authenticated Mode.

AI Capabilities

FeatureWhereWhat It Does
Suggest Definition Create/Edit Term page Generates definition, context, examples, synonyms, classification, and CDE recommendation
Suggest Subject Areas Domain Detail page Recommends sub-categories for organizing a domain
Suggest Domain Terms Domain Detail page Recommends business terms relevant to a domain and subject area
AI Chat Assistant AI > Assistant page Multi-turn conversation about data governance topics

Using AI Suggest on Terms

1
Go to Create Term and enter a term name (e.g., "Customer Lifetime Value")
2
Click the "AI Suggest" button in the suggestion panel on the right
3
Wait for the AI to generate suggestions (takes a few seconds)
4
Review each suggestion — click "Use" next to individual items or "Use All Suggestions" to accept everything

AI Chat Assistant

Navigate to AI > Assistant for a full chat interface. You can ask about:

AI Provider Configuration

Admin users can configure AI providers at AI > Settings:

Note
AI features require at least one AI provider to be configured with a valid API key. The platform works without AI — you simply won't see the suggestion buttons.

14. Analytics & Audit

Governance Analytics Dashboard

Navigate to Insights > Analytics for governance health metrics:

Coverage Metrics

Additional Insights

Audit Log

Navigate to Insights > Audit Log for a complete record of all changes:

Compliance Tip
The audit log provides the evidence trail needed for regulatory compliance audits (GDPR, SOX, HIPAA). Every data governance action is tracked with who, what, and when.

15. Administration

User Management

Navigate to Admin > Users (admin only) to manage platform users:

AI Settings

Navigate to AI > Settings (admin only) to configure AI providers:

Vertical Switching

The vertical selector at the top of the sidebar allows admins to switch the entire platform between industry datasets (Banking, Insurance, Manufacturing, Ecommerce).

Data Export

Admins can export data as Excel files:

16. Learning Center & Regulations

The Learn section in the sidebar provides educational content built into the platform:

PageContent
Concepts & PrinciplesCore data governance concepts: data quality, stewardship, metadata management, lineage, classification
Platform GuideHow to navigate and use the platform effectively
A-Z GlossaryInteractive reference tool with governance terminology
GDPREU General Data Protection Regulation compliance guide
HIPAAUS Health Insurance Portability and Accountability Act guide
UAE Data ProtectionUAE Federal Data Protection Law compliance guide
India DPDP ActIndia Digital Personal Data Protection Act guide

17. Step-by-Step Workflows

Workflow A: End-to-End Term Governance

1
Create Term: Go to Glossary > Create Term. Enter name, click "AI Suggest" for definition. Fill in domain, classification, owner. Click "Create Term." Status: DRAFT
2
Submit for Approval: On the term detail page, click "Submit for Approval." Status changes to: PENDING_REVIEW
3
Data Owner Review: Owner goes to Approval Inbox, finds the term, clicks "Approve." Status: PENDING_APPROVAL
4
Data Custodian Approval: Custodian goes to Approval Inbox, reviews technical completeness, clicks "Approve." Status: APPROVED
5
Link Metadata: On term detail > Technical Metadata tab, click "Link Metadata." Pick System → Table → Column. Set link type to DEFINES. Click "Link."
6
Verify Link: Click the verify icon next to the link. Now visible in CDE Registry (if CDE).

Workflow B: Bulk Data Import

1
Download Template: Go to Upload page. Click template download for your file type (Glossary, Data Dictionary, or ELT Mapping).
2
Prepare Data: Fill the Excel template with your data. One record per row. Save the file.
3
Upload: Select file type, drag and drop file, click "Upload & Process."
4
Review: Check results — successes, duplicates skipped, errors. Fix any errors and re-upload if needed.

Workflow C: Building Data Lineage

1
Upload ELT Mappings: Use the Upload page with an ELT Mapping template filled with your source-to-target transformations.
2
Browse Lineage: Go to Data Flow > Lineage to see all mappings in a table.
3
Visualize: Go to Data Flow > Visualization for the interactive graph. Filter by term or table. Adjust depth.
4
View from Term: On any term detail page, click the Lineage tab to see that term's specific data flow.

Workflow D: AI-Assisted Domain Setup

1
Create Domain: Go to Domains, click "New Domain." Enter name (e.g., "Risk Management") and description.
2
AI Suggest Subject Areas: On the domain detail page, click "AI Suggest Subject Areas." Review and accept suggestions (e.g., Credit Risk, Market Risk, Operational Risk).
3
AI Suggest Terms: Click "AI Suggest Terms" for each subject area. Review and accept suggested business terms.
4
Refine: Edit each term, use "AI Suggest" for full definitions, then submit for approval.

18. Cloud Deployment

Environment Variables

The platform uses environment variables for all configuration. Set these in your cloud platform's dashboard:

VariableRequiredDescription
SECRET_KEYYesFlask session encryption key. Generate with: python -c "import secrets; print(secrets.token_hex(32))"
FLASK_ENVYesSet to production for cloud deployment
DATABASE_URLYes (cloud)Database connection string. PostgreSQL recommended: postgresql://user:pass@host:5432/dbname
ANTHROPIC_API_KEYFor AIClaude (Anthropic) API key
OPENAI_API_KEYFor AIOpenAI API key
DEFAULT_AI_PROVIDERNoclaude, openai, or azure_openai
PORTNoAuto-set by most cloud platforms

Deployment Options

Docker

Build and run with the included Dockerfile:

docker build -t governance .

docker run -p 5000:5000 --env-file .env governance

Heroku / Render / Railway

Push your code and set environment variables in the dashboard. The included Procfile handles the rest.

Security
Never commit your .env file to version control. Use the platform's secret management features to store API keys securely. The included .gitignore protects sensitive files.

19. Glossary of Key Terms

TermDefinition
Business TermA word or phrase representing a business concept (e.g., "Customer ID," "Net Revenue")
DomainA logical grouping of related business terms (e.g., "Customer Management," "Risk")
Subject AreaA sub-category within a domain (e.g., "Deposits" within "Retail Banking")
ClassificationSensitivity level of data: PUBLIC, INTERNAL, CONFIDENTIAL, HIGHLY CONFIDENTIAL, PII, PCI
CDECritical Data Element — high-impact data requiring special governance
Technical MetadataPhysical database structures: systems, tables, columns, data types
LineageThe path data takes from source through transformations to target
Data OwnerPerson accountable for business decisions about data quality and usage
Data CustodianPerson responsible for technical management, security, and access control
Data StewardPerson who creates, maintains, and manages governance artifacts day-to-day
WorkflowMulti-step approval process (Steward → Owner → Custodian)
VerticalIndustry context with pre-loaded data (Banking, Insurance, Manufacturing, Ecommerce)
Training ModeNon-authenticated mode for exploration with browser-only data persistence
Derivation FormulaThe calculation rule for a derived data element (e.g., Revenue - Cost = Profit)
SynonymAn alternative name for a business term (SYNONYM, ALIAS, ACRONYM, ABBREVIATION)

20. Quick Reference Card

Common Tasks

I Want To...Go ToAction
Find a termGlossary > BrowseType in the search box
Create a new termGlossary > CreateFill form, click "AI Suggest" for help
Edit an existing termTerm Detail pageClick "Edit" button
Approve a termOperations > InboxClick "Approve" or "Reject"
Link term to columnTerm Detail > Metadata tabClick "Link Metadata", pick column
View data lineageData Flow > VisualizationInteractive graph with zoom/filter
Upload bulk dataOperations > UploadDownload template, fill, upload
Get AI suggestionsCreate Term pageClick "AI Suggest" button
Chat with AIAI > AssistantType question, select provider
Check governance healthInsights > AnalyticsView coverage metrics and charts
View change historyInsights > Audit LogFilter by entity type or action
Manage usersAdmin > UsersCreate/edit users and assign roles
Configure AIAI > SettingsAdd API key, test connection
Switch industrySidebar topClick vertical selector, choose industry
Export to ExcelGlossary / Lineage pagesClick "Export" button
View CDE mappingsOrganization > CDE RegistryExpand CDE cards to see links

Need Help?

Use the AI Assistant (AI > Assistant) to ask any data governance question, or explore the Learning Center (Learn section) for concepts and regulatory guides.


Data Governance Platform Training Manual v1.0
© 2025 • All Rights Reserved