Add data-classification.md extension #1317

rob-sessink · 2024-11-13T08:27:00Z

Provides an extension where an event source can annotate an event with
information around data classification of an event and its payload. CloudEvents
may contain payload which is subjected to data protection regulations like GDPR
or HIPAA. For intermediaries and consumers knowing how event payload is
classified enables compliant processing of an event.

Adds an extension with attributes:

dataclassification (Required). Data classification level of an event and
payload within the context of a data protection regulation.
dataregulation (Optional). Applicable data protection regulation.
datacategory (Optional). Data category of the event payload within the
context of data classification and data protection regulation.

Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

cloudevents/extensions/data-classification.md

duglin · 2024-11-13T12:14:39Z

cloudevents/extensions/data-classification.md

+  `confidential`, `restricted`.
+- Constraints:
+  - REQUIRED
+  - SHOULD be applicable to data protection regulation.


Can you elaborate on what this "SHOULD" means? What does someone need to do (from a coding perspective) to adhere to this "SHOULD"?

The SHOULD statement is merely meant as an indication towards event producers that the data classification label should have its origin within the applicable data-regulation. But maybe this is stating the obvious and from a coding perspective not relevant. Being already stated in the description, it does not add value. I will remove it

duglin · 2024-11-13T12:19:41Z

cloudevents/extensions/data-classification.md

+`datacategory` attributes MAY be set to provide additional details on the
+classification context.
+
+Intermediaries and consumers SHOULD take these attributes into account and act


I wonder if this "SHOULD" should be a "MUST" instead? Should a consumer reject a request if it can't meet the data regulation requirements? Are clients expecting some kind of guarantee? Meaning, a non-error means "yup, got it and it'll be protected appropriately". Although, extensions can be ignored... maybe it would need to be worded like: "If an implementation supports this extension, then it MUST reject the event if it can not adhere to the requirements of the specified data classification attributes" ??

This raises an interesting possibility, which is too late for v1 but could be interesting in a future version: if an event could say "consumers/intermediaries must understand extensions x, y and z, and must otherwise reject/ignore the event" then we could be stricter. (So that would be an attribute that's part of the main spec, but the values of which would be names of extension attributes.)

https://www.w3.org/TR/soap12/#soapmu :-)

Yes changing this section to be more prescriptive towards consumers is warranted. When an implementation supports this extension, an event MUST be handled in a compliant manner or otherwise MUST be rejected/ignored.

I will adjust the phrasing.

duglin · 2024-11-13T12:20:56Z

Can you update the README in the "extensions" dir too?

duglin · 2024-11-13T12:23:38Z

cloudevents/extensions/data-classification.md

+- Type: `String`
+- Description: Data classification level for the event payload within the
+  context of a `dataregulation`. Typical labels are: `public`, `internal`,
+  `confidential`, `restricted`.


I suspect these values are probably defined by the data regulations being adhered to, but since dataregulation is optional, should this spec define some recommended values for cases where it's missing to provide some consistency?

Yes I feel that is a good approach. I did not want to make the dataregulation attribute required as I feel this is supportive information and not directly mandatory for processing. My intent is that usage of this extension should be as light as possible, meaning less required attributes as possible.

What do you think of:

Description: Data classification level for the event payload within the context of a dataregulation. In a situation where dataregulation is undefined, recommended labels are: public, internal, confidential, or restricted.

cloudevents/extensions/data-classification.md

cloudevents/languages/he/extensions/data-classification.md

…README.md and usage of MUST keyword in example use case - Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

cloudevents/extensions/data-classification.md

jskeet · 2024-11-14T14:05:58Z

cloudevents/extensions/data-classification.md

+accordingly to data regulations and/or internal policies when processing the
+event and payload.
+
+Intermediaries SHOULD NOT modify the `dataclassification`, `dataregulation`, and


Is a redacting intermediary a valid use case? (I'm guessing not - that they'd end up effectively being a new event producer, as it's no longer the same event really if a bunch of information has been removed. But I thought I'd mention it as a possibility.)

I agree that when an intermediary changes an event or payload it becomes a new event. Than also the role of the intermediary shifts to that of event producer and it has responsibility/freedom to define the classification attributes.

Reading the CloudEvents specification, intermediaries forward/route messages and don´t redact them, so I would want to leave it this way.

jskeet · 2024-11-14T14:08:10Z

cloudevents/extensions/data-classification.md

+`datacategory` attributes MAY be set to provide additional details on the
+classification context.
+
+Intermediaries and consumers SHOULD take these attributes into account and act


This raises an interesting possibility, which is too late for v1 but could be interesting in a future version: if an event could say "consumers/intermediaries must understand extensions x, y and z, and must otherwise reject/ignore the event" then we could be stricter. (So that would be an attribute that's part of the main spec, but the values of which would be names of extension attributes.)

Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

Add data-classification.md extension

eba302b

Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

duglin reviewed Nov 13, 2024

View reviewed changes

cloudevents/extensions/data-classification.md Outdated Show resolved Hide resolved

duglin reviewed Nov 13, 2024

View reviewed changes

cloudevents/extensions/data-classification.md Outdated Show resolved Hide resolved

duglin reviewed Nov 13, 2024

View reviewed changes

cloudevents/languages/he/extensions/data-classification.md Outdated Show resolved Hide resolved

FIX based upon PR comments: correct spelling, add link in extensions/…

eb45fb5

…README.md and usage of MUST keyword in example use case - Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

jskeet reviewed Nov 14, 2024

View reviewed changes

FIX based upon PR comments: improve spelling

58ff760

Signed-off-by: Rob Sessink <rob.sessink@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add data-classification.md extension #1317

Add data-classification.md extension #1317

rob-sessink commented Nov 13, 2024

duglin Nov 13, 2024

rob-sessink Nov 16, 2024

duglin Nov 13, 2024

jskeet Nov 14, 2024

duglin Nov 14, 2024

rob-sessink Nov 16, 2024

duglin commented Nov 13, 2024

duglin Nov 13, 2024

rob-sessink Nov 16, 2024

jskeet Nov 14, 2024

rob-sessink Nov 16, 2024 •

edited

Loading

jskeet Nov 14, 2024

Add data-classification.md extension #1317

Are you sure you want to change the base?

Add data-classification.md extension #1317

Conversation

rob-sessink commented Nov 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

duglin commented Nov 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rob-sessink Nov 16, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rob-sessink Nov 16, 2024 •

edited

Loading