You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+16-12
Original file line number
Diff line number
Diff line change
@@ -40,7 +40,11 @@ The fixed structure must be technology-agnostic. The first fields of teh fixed s
40
40
*`Email: [Option[String]]` point of contact between consumers and maintainers of the Data Product. It could be the owner or a distribution list, but must be reliable and responsive.
41
41
*`OwnerGroup [String]`: LDAP user/group that is owning the data product.
42
42
*`DevGroup [String]`: LDAP user/group that is in charge to develop and maintain the data product.
43
-
*`InformationSLA: [Option[String]]` describes what SLA the Data Product team is providing to answer additional information requests about the Data Product itself.
43
+
*`SupportSLA: [Option[String]]` describes what SLA the Data Product team is providing when some support is needed.
44
+
*`SupportHours: [Option[String]]` define when the suport is available. Ex During working days from 9 to 18
45
+
*`ResponseTime: [Option[String]]` define the amount of time needed to take care of an incoming feature
46
+
*`ResolutionTime: [Option[String]]` define the amount of time needed to fix the date
47
+
*`InformationTime: [Option[String]]` define the amount of time needed to answer clarification questions.
44
48
*`Status: [Option[String]]` this is an enum representing the status of this version of the Data Product. Allowed values are: `[Draft|Published|Retired]`. This is a metadata that communicates the overall status of the Data Product but is not reflected to the actual deployment status.
45
49
*`Maturity: [Option[String]]` this is an enum to let the consumer understand if it is a tactical solution or not. It is really useful during migration from Data Warehouse or Data Lake. Allowed values are: `[Tactical|Strategic]`.
46
50
*`Billing: [Option[Yaml]]` this is a free form key-value area where is possible to put information useful for resource tagging and billing.
@@ -94,18 +98,18 @@ Constraints:
94
98
*`IntervalOfChange: [Option[String]]` how often changes in the data are reflected.
95
99
*`Timeliness: [Option[String]]` the skew between the time that a business fact occurs and when it becomes visibile in the data.
96
100
*`UpTime: [Option[String]]` the percentage of port availability.
97
-
*`TermsAndConditions: [Option[String]]` If the data is usable only in specific environments.
98
101
*`Endpoint: [Option[URL]]` this is the API endpoint that self-describe the output port and provide insightful information at runtime about the physical location of the data, the protocol must be used, etc.
99
-
*`biTempBusinessTs: [Option[String]]` name of the field representing the business timestamp, as per the "bi-temporality" definition; it should match with a field in the related `Schema`
100
-
*`biTempWriteTs: [Option[String]]` name of the field representing the technical (write) timestamp, as per the "bi-temporality" definition; it should match with a field in the related `Schema`
101
-
*`DataSharingAgreement: [Yaml]` This part is covering usage, privacy, purpose, limitations and is independent by the data contract.
102
-
*`Purpose: [Option[String]]` what is the goal of this data set.
103
-
*`Billing: [Option[String]]` how a consumer will be charged back when it consumes this output port.
104
-
*`Security: [Option[String]]` additional information related to security aspects, like restrictions, masking, sensibile information and privacy.
105
-
*`IntendedUsage: [Option[String]]` any other information needed by the consumer in order to effectively consume the data, it could be related to technical stuff (e.g. extract no more than one year of data for good performances ) or to business domains (e.g. this data is only useful in the marketing domains).
106
-
*`Limitations: [Option[String]]` If any limitation is present it must be made super clear to the consumers.
107
-
*`LifeCycle: [Option[String]]` Describe how the data will be historicized and how and when it will be deleted.
108
-
*`Confidentiality: [Option[String]]` Describe what a consumer should do to keep the information confidential, how to process and store it. Permission to share or report it.
102
+
*`DataSharingAgreement: [Yaml]` This part is covering usage, privacy, purpose, limitations and is independent by the data contract.
103
+
*`TermsAndConditions: [Option[String]]` If the data is usable only in specific environments.
104
+
*`Purpose: [Option[String]]` what is the goal of this data set.
105
+
*`Billing: [Option[String]]` how a consumer will be charged back when it consumes this output port.
106
+
*`Security: [Option[String]]` additional information related to security aspects, like restrictions, masking, sensibile information and privacy.
107
+
*`IntendedUsage: [Option[String]]` any other information needed by the consumer in order to effectively consume the data, it could be related to technical stuff (e.g. extract no more than one year of data for good performances ) or to business domains (e.g. this data is only useful in the marketing domains).
108
+
*`Limitations: [Option[String]]` If any limitation is present it must be made super clear to the consumers.
109
+
*`LifeCycle: [Option[String]]` Describe how the data will be historicized and how and when it will be deleted.
110
+
*`Confidentiality: [Option[String]]` Describe what a consumer should do to keep the information confidential, how to process and store it. Permission to share or report it.
111
+
*`biTempBusinessTs: [Option[String]]` name of the field representing the business timestamp, as per the "bi-temporality" definition; it should match with a field in the related `Schema`
112
+
*`biTempWriteTs: [Option[String]]` name of the field representing the technical (write) timestamp, as per the "bi-temporality" definition; it should match with a field in the related `Schema`
109
113
*`Tags: [Array[Yaml]]` Tag labels at OutputPort level, here we can have security classification for example (please refer to [OpenMetadata documentation](https://docs.open-metadata.org/v1.0.0/main-concepts/metadata-standard/schemas/type/taglabel)).
110
114
*`SampleData: [Option[Yaml]]` provides a sample data of your Output Port (please refer to [OpenMetadata specification](https://docs.open-metadata.org/v1.0.0/main-concepts/metadata-standard/schemas/entity/data/table#properties)).
111
115
*`SemanticLinking: [Option[Yaml]]` here we can express semantic relationships between this output port and other outputports (also coming from other domains and data products). For example, we could say that column "customerId" of our SQL Output Port references the column "id" of the SQL Output Port of the "Customer" Data Product.
purpose: this output port want to provide a rich set of profitability KPIs related to the customer
71
-
billing: 5$ for each full scan
72
-
security: In order to consume this output port an additional security check with compliance must be done
73
-
intendedUsage: the dataset is huge so it is recommended to extract maximum 1 year of data and to use these KPIs in the marketing or sales domain, but not for customer care
74
-
limitations: is not possible to use this data without a compliance check
75
-
lifeCycle: the maximum retention is 10 years, and eviction is happening on the first of january
76
-
confidentiality: if you want to store this data somewhere else, PII columns must be masked
73
+
dataSharingAgreements:
74
+
termsAndConditions: only usable in development environment
75
+
purpose: this output port want to provide a rich set of profitability KPIs related to the customer
76
+
billing: 5$ for each full scan
77
+
security: In order to consume this output port an additional security check with compliance must be done
78
+
intendedUsage: the dataset is huge so it is recommended to extract maximum 1 year of data and to use these KPIs in the marketing or sales domain, but not for customer care
79
+
limitations: is not possible to use this data without a compliance check
80
+
lifeCycle: the maximum retention is 10 years, and eviction is happening on the first of january
81
+
confidentiality: if you want to store this data somewhere else, PII columns must be masked
0 commit comments