clarify Content-Type / Content-Encoding / Content-Language handling #160

nigoroll · 2020-09-09T09:34:15Z

see #158 for discussion

Acconut · 2020-09-16T18:22:40Z

I am not sure if adding recommendations about Content-Encoding and Content-Language headers is a good idea. If I understand correctly, these headers are mostly relevant for GET requests, which tus does neither mention nor use. Of course, if your server supports GET requests nevertheless, the Content-Encoding and Content-Language headers may also be included in HEAD responses but that in itself is not relevant for tus. Is it understandable what I mean?

The changes regarding metadata are good, but let's clarify the other point first.

closes tus#158

nigoroll · 2020-09-17T08:36:33Z

(force-pushed only to resolve the merge conflict, no content changed)

re @Acconut regarding your interpretation that the HTTP metadata headers were specific to GET please see #158 (comment) - they are specific to the resource and in the case of tus that is the uploaded object.

Regarding Content-Encoding, if a client chose to use it and the tus server simply ignored it, things would get terribly wrong. For example, if a client uploaded text/html with Content-Encoding: gzip and the server neither decoded the upload not stored the information about the encoding, the upload would be corrupt in the sense that the encoded form simply is not text/html.

Please note that RFC7231 explicitly considers uploads in this paragraph:

An origin server MAY respond with a status code of 415 (Unsupported Media Type) if a representation in the request message has a content coding that is not acceptable.

smatsson · 2020-09-17T08:42:21Z

The specific metadata keys documented herein are reserved for the
respective use and MUST NOT be used for other purposes.

Note really a fan of this part at all. There are multiple implementations out there already and if they use any of the keys defined for anything else than this update says then they would be in violation with the spec.

In other parts I agree that this would be a good addition to the spec.

nigoroll · 2020-09-17T08:50:27Z

@smatsson would you have a better suggestion to reserving the metadata keys? My original suggestion was to make a breaking change and use Content-Type with the next tus major version.

smatsson · 2020-09-17T09:06:40Z

@nigoroll To be the devil's advocate: Why would we need to reserve metadata keys to begin with? The 1.0 version of protocol has been around since 2015 and haven't included any reserved keys. I think that the openness of not reserving metadata keys is a good thing as words have different meanings in different contexts. For example I would not translate "filetype" to Content-Type but rather to file extension ("It's a pdf"). I don't necessarily disagree with having reserved keys, it's just that it's 5 years to late. Breaking changes to a protocol that is out in the wild is not a good thing as issues will arise.

nigoroll · 2020-09-17T09:15:03Z

@smatsson we cannot have agreement on semantics without agreement on how to represent that semantics. My impression from the discussion in #158 was that the filename and filetype metadata keys were a de-facto standard because uppy/tusd use them.
But you are right, no matter if we reserved metadata keys or brought back the original Content-Type semantics, both would be breaking changes and thus formally require a protocol major version bump.
This PR was intended as a compromise trying to avoid that, but in fact I think that a new version with changed semantics would be the better option.

smatsson · 2020-09-17T09:25:27Z

@nigoroll I'm only concerned with the breaking changes and that the protocol will be ambiguous between implementations and when they were implemented (as the spec changed). As I said earlier I don't really object to the change per se, just the breaking change part. If you and @Acconut feel strongly that this should go in the spec it's fine by me.

Acconut · 2020-09-18T14:49:59Z

protocol.md

+
+##### [Content-Language](https://httpwg.org/specs/rfc7231.html#header.content-language)
+
+Clients and Servers SHOULD support the ``Content-Language`` header.


Can you elaborate more what "supporting the Content-Language header" means? To be concrete: Can you say what behavior a typical tus server (e.g. tusd) must have to support this header?

Acconut · 2020-09-18T14:50:55Z

protocol.md

+accept the ``Content-Encoding`` chosen by the client.
+
+Servers MUST either store the ``Content-Encoding`` and deliver it with
+subsequent requests, or properly decode the content before storing it.


deliver it with subsequent requests

Does this mean that the Content-Encoding header must be present in the response for every PATCH, DELETE and HEAD request?

Acconut · 2020-09-18T14:55:25Z

protocol.md

+* ``filename`` for a common file name
+
+The specific metadata keys documented herein are reserved for the
+respective use and MUST NOT be used for other purposes.


I agree with @smatsson that this sentence is too restrictive. It would be a breaking change and there are no plans to make a major release for tus. I don't think this would serve the ecosystem well.

That being said, I am happy to recommend (not force) the usage of the filename and filetype metadata keys for the purposes as laid out here.

Acconut · 2020-09-18T15:06:19Z

@nigoroll Would it be OK if I add some commits to this PR to show what I mean with some of my comments?

nigoroll · 2020-09-18T21:46:54Z

@Acconut sure

smatsson · 2020-09-28T10:32:09Z

Do we need to consider the Content-Encoding changing between requests? E.g. the file is created with Content-Encoding: gzip but the client then decides to not include it (and not encoding the content either) for subsequent requests?

Acconut · 2020-09-29T11:01:45Z

I adjusted the phrasing of the filename and filetype metadata keys to match the rest of the document and to also ensure that this is not a breaking change. Let me know what you think

@nigoroll There are also some questions from me above. Could you have a look at them when you have the time?

clarify Content-Type / Content-Encoding / Content-Language handling

9f96286

closes tus#158

nigoroll force-pushed the metadata branch from 0a6b061 to 9f96286 Compare September 17, 2020 08:20

Acconut reviewed Sep 18, 2020

View reviewed changes

Adjust phrasing of recommended metadata keys

5b307ae

KalleOlaviNiemitalo mentioned this pull request May 23, 2022

upload using gzip git-lfs/git-lfs#5015

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clarify Content-Type / Content-Encoding / Content-Language handling #160

clarify Content-Type / Content-Encoding / Content-Language handling #160

nigoroll commented Sep 9, 2020

Acconut commented Sep 16, 2020

nigoroll commented Sep 17, 2020

smatsson commented Sep 17, 2020

nigoroll commented Sep 17, 2020 •

edited

Loading

smatsson commented Sep 17, 2020

nigoroll commented Sep 17, 2020 •

edited

Loading

smatsson commented Sep 17, 2020

Acconut Sep 18, 2020

Acconut Sep 18, 2020

Acconut Sep 18, 2020

Acconut commented Sep 18, 2020

nigoroll commented Sep 18, 2020

smatsson commented Sep 28, 2020

Acconut commented Sep 29, 2020


		##### [Content-Language](https://httpwg.org/specs/rfc7231.html#header.content-language)

		Clients and Servers SHOULD support the ``Content-Language`` header.

clarify Content-Type / Content-Encoding / Content-Language handling #160

Are you sure you want to change the base?

clarify Content-Type / Content-Encoding / Content-Language handling #160

Conversation

nigoroll commented Sep 9, 2020

Acconut commented Sep 16, 2020

nigoroll commented Sep 17, 2020

smatsson commented Sep 17, 2020

nigoroll commented Sep 17, 2020 • edited Loading

smatsson commented Sep 17, 2020

nigoroll commented Sep 17, 2020 • edited Loading

smatsson commented Sep 17, 2020

Acconut Sep 18, 2020

Choose a reason for hiding this comment

Acconut Sep 18, 2020

Choose a reason for hiding this comment

Acconut Sep 18, 2020

Choose a reason for hiding this comment

Acconut commented Sep 18, 2020

nigoroll commented Sep 18, 2020

smatsson commented Sep 28, 2020

Acconut commented Sep 29, 2020

nigoroll commented Sep 17, 2020 •

edited

Loading

nigoroll commented Sep 17, 2020 •

edited

Loading