Practical Guidelines for Implementing Data Products

Data Catalog, Data Fabric, Data Products, Data Marketplace

Description

Most companies today are storing data and running applications in a hybrid multi-Cloud environment.  Analytical systems tend to be centralized and siloed like Data Warehouses and Data Marts for BI, Hadoop or Cloud storage Data Lakes for Data Science and stand-alone streaming analytical systems for real-time analysis. 

These centralized systems rely on Data Engineers and Data Scientists working within each silo to ingest data from many different sources, clean and integrate it for use in a specific analytical system or Machine Learning models.  There are many issues with this centralized, siloed approach.

To address these issues, a new  approach has emerged attempting to accelerate creation of data for use in multiple analytical workloads.  That approach is to create Data Products.

This is a decentralized business domain-oriented approach to data ownership and data engineering to create a mesh of reusable Data Products that can be created once and shared across multiple analytical systems and workloads. Multiple Data Architecture options are available to create a Data Products can be implemented in a number of ways.

This 2-day class looks at Data Products in detail and examines its strengths, and weaknesses.  It also looks at the strengths and weaknesses of Data Product implementation options.

It also looks at the organizational implications of democratized Data Product development.

Technologies discussed includes Data Catalogs, Data Fabric for collaborative development of data integration pipelines to create Data Products, DataOps to speed up the process, Data Orchestration automation, Data Marketplaces and Data Governance platforms.

 

What you will learn

  • The problems caused in existing analytical systems by a hybrid, multi-Cloud data landscape
  • Strengths and weaknesses of centralised Data Architectures used in analytics
  • What are Data Products and how do they differ from other approaches?
  • What benefits do Data Products offer and what are the implementation options?
  • What are the principles, requirements, and challenges of implementing Data Products?
  • How to organise to create Data Products in a decentralised environment so you avoid chaos
  • The critical importance of a Data Catalog in understanding what data is available
  • How Business Glossaries can help ensure Data Products are understood and semantically linked
  • A best practice organisational model for coordinating development of Data Products across different domains to succeed in implementation
  • What software is required to build, operate and govern Data Products for use in Data Science, a Data Warehouse, Graph Analysis and other analytical workloads?
  • What is Data Fabric software, how does it integrate with Data Catalogs and connect to data in your data estate
  • An Implementation methodology to produce ready-made, trusted, reusable Data Products
  • Collaborative domain-oriented development of modular and distributed DataOps pipelines to create Data Products
  • How a Data Catalog, Generative AI and Data Automation software can be used to generate DataOps pipelines to create Data Products
  • Managing data quality, privacy, access security, versioning, and the lifecycle of Data Products
  • Pros and cons of different Data Architecture options for implementing Data Products
  • Federated Data Architecture and Data Products
  • Persisting master Data Products in an MDM system
  • Consuming and assembling Data Products in multiple analytical workloads like Data Warehouses, Data Science and Graph Analytics to shorten time to value
  • How to implement federated Data Governance

Main Topics

  • What are Data Products and why are they needed?
  • Organising and standardising your environment to support democratised Data Product development
  • Methodologies for creating Data Products
  • Defining and designing Data Products using a Catalog Business Glossary and data modelling
  • Sourcing, mapping and data quality profiling data for your Data Products
  • Building DataOps pipelines to create reusable Data Products
  • Implementing federated Data Governance to produce and use compliant Data Products
Mike Ferguson

Cost

€1.200,00

Date

08 - 09 May 2025

Timing: from 9.30 am to 5 pm Italian time

Location

Online event

Book Event

Participation fee - 1200€
Available Places: 100
The "Participation fee - 1200€" ticket is sold out. You can try another ticket or another date.
Registration 30 days before the seminar date: 5% discount - € 1140
Available Places: 100
The "Registration 30 days before the seminar date: 5% discount - € 1140" ticket is sold out. You can try another ticket or another date.
Share on:
Facebook
Twitter
LinkedIn
Email
WhatsApp
Pocket
Reddit