Skip to main content

Data Products in SAP Business Data Cloud


Data Products is the fundamental building block of SAP Business Data Cloud and it serves as a standardized and efficient way to share and consume data across applications and domains. They enable analytic scenarios, AI applications, and facilitate data integration while being optimized for intensive reads. Managed with a product mindset, they are supported by high-quality metadata and governed by decentralized ownership principles.

Purpose of Data Products

Customers currently face challenges when accessing and using data for various tasks such as building new applications, integrating applications, creating analytical models, designing dashboards, and training AI applications. The process of finding relevant data is difficult due to the lack of consistency and clarity in the APIs that expose data and metadata. The metadata often only describes technical aspects, making it hard for business users to utilize, and some necessary data isn't available via APIs. To improve this, the consistent metadata exposed by Data Products will allow for a landscape-specific Data Product to be directly exposed in every customer landscape. This will be facilitated by the catalog, and will promote ODM compliant Data Products, making it easier to combine data from different applications, and automate scenarios like validation, discovery, and integration.

In SAP, a Data Product is a dataset exposed for consumption outside the boundaries of the producing application via APIs and described by high quality metadata that can be accessed through the Data Product Catalog.

  • It is a business dataset and consists of one or more business object entities, related objects, analytical data etc
  • It is consumable via APIs or events.Supported API types are SQL (incl. SQL interface views), Delta Sharing, Event, REST, (oData)
  • It is described with that is of high quality and provided via Open Resource Discovery (ORD), following ORD schema for Data Product
  • Is is discoverable via the Data Product Directory that is a service of UCL that aggregates metadata of all Data Products to make them discoverable in a landscape.

Key Characteristic of Data Product

  • Efficient Data Sharing Across Domains: Data Products facilitate replication-free integration and consumption by SAP, customer, and partner applications via APIs and Events.

  • Efficient Data Storage: Data Products are processed and curated in a modern object-store based data lake architecture with highly scalable Spark processing.

  • High-Quality Metadata: Metadata includes business semantics and is accessible through directories like the Data Product Directory and ORD Aggregators, enhancing discoverability.

  • Optimized for Analytics:Data sets are curated for intensive reads and consumed in a read-only fashion, ensuring efficiency and reliability.

  • Supporting Analytical and AI Applications: Data Products provide foundational data for dashboards, analytical models, and AI applications.

  • Decentralized Ownership: Inspired by Data Mesh principles, Data Products are owned and managed by domain experts or teams responsible for operational data.

  • Lifecycle Management: Data Products undergo needs analysis, design, and delivery phases, ensuring their accuracy and relevance.

Key Attributes of Data Products

AttributeDescription
TypeDefines the type (primary, derived).
CategoryCategorizes the data set (e.g., business-object, analytical).
VisibilitySpecifies exposure (public, internal, private).
Input PortsIntegration dependencies describing data inputs for lineage purposes.
Output PortsAPIs or Events through which the data can be accessed.
Integration DependenciesRelationships with external resources.
ResponsibleOrganization/team responsible for lifecycle management.

Types of Data Products

There are two main approaches to implementing data products in SAP:

  • Primary Data Products: Expose data directly from source applications (e.g., S/4HANA, SuccessFactors)
  • Derived Data Products: Transform and combine data from one or more existing data products
image of solution diagram
Copy to clipboard
Solution Diagram Resources
You can download the Solution Diagram as a .drawio file for offline use. Alternatively, you may view and edit the Solution Diagram directly on draw.io.
Please note that any changes made online will need to be saved locally if you wish to keep them.

A primary data product is directly provided from applications and is not based on other data products representing the original data from a source application.

A derived data product is curated by SAP and derived from other data set(s). They deliver value-add and are based on other data products or APIs.

Released Data Products

SAP Roadmaps - Business Capability -Data Product Management

Technical Aspects of Data Products

  • Foundation Services facilitate the discovery, extraction, and consumption of SAP-managed Data Products. These services will onboard the different Lines of Business (LoB) applications in SAP.

    • Data pipeline management
    • Discovery via ORD Protocol - Enables decentralized discoverability of Data Products.
    • Replication to Object Store
    • SAP HANA Cloud Data Lake - Stores data as Data Lake tables, following Delta Sharing principles.Follows Medallion architecture with Bronze, Silver and Gold (optionally) and processing of data via Spark processing framework.
    • File-based Storage
  • Core Protocols

    • Open Resource Discovery(ORD)
    • Delta Sharing is an open protocol for sharing data stored in cloud-based Data Lake tables. It enables:
      • Zero-Copy Data Access: No need to move or copy data, ensuring efficient data usage.
      • Central Governance: Controlled access and scalable sharing.
      • Multi-Tool Support: Compatible with tools like Apache Spark and Python.
  • Implementation Workflow

    • Provisioning via SAP for Me
    • Connecting source applications
    • Activating Data Products
    • Installing Intelligent Content
  • Consumption Tools

    • SAP HANA Cloud: Consumes data products via Delta Sharing protocol, enabling HANA Cloud workloads such as CDS views, calculation views, and CAP applications to leverage governed BDC data without replication.
    • SAP Databricks: Accesses data products directly through Unity Catalog integration for AI/ML and advanced analytics workloads.
    • SAP Analytics Cloud: Consumes data products for visualization, reporting, and planning scenarios.
    • Databricks: Uses Delta Sharing protocol for zero-copy data access in external Databricks environments.
    • Any Delta Sharing Client: Any application supporting the Delta Sharing protocol can consume BDC data products.

Workflow - Data Product Discovery, Generation, Delta-Sharing and Consumption

Discovery and Generation: Consumers query ORD Aggregators to find Data Products and associated metadata.This diagram shows how data product definitions are discovered and generated. Source Lines of Business (LoB) provide data product definitions, which are read and listed by the SAP Business Data Cloud catalog. These definitions are then used to generate data lake tables, making data products available for consumption.

Data Sharing View: This diagram illustrates how data is shared between providers and recipients using the Delta Sharing protocol. The Data Provider manages data lake tables and access permissions, which are exposed through a Delta Sharing Server.

Consumption Workflow: This diagram illustrates how consumers discover, integrate, and access data products.

Data Product Extensibility

SAP Data Products also support customer extensions within any SAP Sources & locally defined extension fields to build Data Product Customer Extensions. This requires activation of data package directly to enable custom modeling directly. This is one of the simple extensibility option for adapting data products to enable custom modeling.

image of solution diagram
Copy to clipboard
Solution Diagram Resources
You can download the Solution Diagram as a .drawio file for offline use. Alternatively, you may view and edit the Solution Diagram directly on draw.io.
Please note that any changes made online will need to be saved locally if you wish to keep them.
  1. SAP Data Products also support customer extensions within any SAP Sources & locally defined extension fields.
  2. Customer field extensions are supported on a “pass through basis” automatically added to the SAP Data Product and data processing pipeline
  3. Govern SAP data product + customer fields access in the data catalog and execute data product installation in SAP Datasphere.
  4. Create SAP Datasphere models & SAP Analytics Cloud stories for consumption.

Conclusion

Data Products in SAP Business Data Cloud streamline data sharing, integration, and analytics across domains. Supported by high-quality metadata, Delta Sharing protocols, and decentralized ownership principles, they are optimized for intensive reads and managed with a product mindset. Foundation Services ensure seamless discoverability and consumption, empowering businesses to make data-driven decisions efficiently.