MultipartKit V5 #100

ptoffy · 2024-10-07T18:52:36Z

No description provided.

adam-fowler

Some initial comments.

Sources/MultipartKit/MultipartParser+AsyncStream.swift

Sources/MultipartKit/MultipartParser.swift

Sources/MultipartKit/MultipartParser+AsyncStream.swift

Sources/MultipartKit/MultipartParser.swift

ptoffy · 2024-10-30T12:05:04Z

@adam-fowler Do you want to take a look at this again and see if stuff makes more sense now? I added in some binary data (which even contains hex-CRLF 😄) to the tests so it should be able to parse anything now

ptoffy · 2024-11-18T20:10:28Z

@Joannis @simonjbeaumont @czechboy0 pulling you into this so you can take a look if you want

Package.swift

Sources/MultipartKit/FormDataDecoder/FormDataDecoder.SingleValueContainer.swift

Joannis · 2024-11-19T08:32:52Z

Sources/MultipartKit/FormDataDecoder/FormDataDecoder.swift

-    public func decode<D: Decodable>(_ decodable: D.Type, from data: String, boundary: String) throws -> D {
-        try decode(D.self, from: ByteBuffer(string: data), boundary: boundary)
+    public func decode<D: Decodable>(_ decodable: D.Type, from string: String, boundary: String) throws -> D {
+        try decode(D.self, from: Array(string.utf8), boundary: boundary)


This makes a copy from string, can't we use withContiguousMemoryIfAvailable?

Mhh I don't think that would work any better because we need the Collection there. We can't pass in the raw bytes and if we're copying them to an array we're still making a copy at that point right?

Since you're doing an .append on the parser you're right that you're already making a copy. Except right now you're making two copies of the same data, and each time you're also allocating space for that data.

No, I mean we can't pass the raw pointer to the decode method because it expects the collection of bytes, so this can't be done

try string.utf8.withContiguousStorageIfAvailable { bytes in decode(D.self, from: bytes, boundary: boundary) } ?? decode(D.self, from: Array(string.utf8), boundary: boundary)

And if we were to do something like

if let bytes = string.utf8.withContiguousStorageIfAvailable(Array.init) { try decode(D.self, from: bytes, boundary: boundary) } else { try decode(D.self, from: Array(string.utf8), boundary: boundary) }

we're still initialising an array with the raw bytes so I think this doesn't really save us a copy.
Unless you mean a different way of using withContiguousStorageIfAvailable

Joannis · 2024-11-19T08:35:11Z

Sources/MultipartKit/MultipartParser+parse.swift

+        var currentBody = Body()
+
+        // Append data to the parser and process the sections
+        parser.append(buffer: data)


Is it necessary to copy data out before parsing?

This way we avoid a whole bunch of needMoreData returns from the parser

Could it be restructured to be parseOrAppend then?

if parserBuffer.isEmpty { let result = parser.parse(data) if result == .needMoreData { parser.append(data) } } else { parser.append(data) parser.parse() }

But why? We're just doing that once at the beginning, this is the sync parse

adam-fowler

Generic parameter changes look good

Sources/MultipartKit/FormDataEncoder/FormDataEncoder.swift

Sources/MultipartKit/FormDataEncoder/Storage.swift

Sources/MultipartKit/FormDataDecoder/FormDataDecoder+Decoder.swift

Sources/MultipartKit/FormDataEncoder/FormDataEncoder+Encoder.swift

Sources/MultipartKit/FormDataEncoder/FormDataEncoder+KeyedContainer.swift

adam-fowler · 2024-11-19T10:43:13Z

Sources/MultipartKit/MultipartParserAsyncSequence.swift

+                }
+            }
+        }
+    }


As I understand it the user can receive a body as multiple MultipartSections if the underlying AsyncSequence has broken that body up. This is great as we don't want to pay the memory for large bodies if we can. But there are situations where we want to ensure we have a complete body section eg a block of data we want to run a JSON decode on. Is it possible to add a helper function to the Iterator to do this? eg Iterator.collectBody(upTo: memoryLimit) which returns a header, complete body section

There's currently a parse method on MultipartParser which loads the input all at once. I'm guessing you mean something of a middle ground between this and stream parsing? E.g just for one part of the message

Yes just one part of the message eg I have a Multipart message with a zip file in there plus some metadata. I want to save the zip files to disk (using the least amount of memory possible ie streaming it) but parse the metadata with Codable so need the whole of it in memory.

Start making the parser async

fc69abc

ptoffy added the semver-major Breaking changes label Oct 7, 2024

ptoffy self-assigned this Oct 7, 2024

ptoffy added 5 commits October 21, 2024 10:28

Make header parsing work

00d854e

Add body parsing support

00bc9d4

Housekeeping

232c2a8

Add more complex example test

86414b7

Move error throwing up one level

9822834

adam-fowler reviewed Oct 23, 2024

View reviewed changes

Apply suggestions and add binary data test

122018f

ptoffy requested a review from adam-fowler October 30, 2024 12:02

ptoffy added 3 commits November 5, 2024 15:23

Add sync parsing and serialising

64a014c

Wip

d6c26c3

Make encoding work again

d9a056e

ptoffy force-pushed the v5 branch from ad06379 to d9a056e Compare November 7, 2024 09:54

ptoffy added 2 commits November 7, 2024 10:55

Start generifying stuff

fee44de

Make encoders work with generics

e7c852a

ptoffy mentioned this pull request Nov 18, 2024

Compiler crash in Swift Testing swiftlang/swift#77674

Open

Finish up en/decoding

93e27c9

ptoffy marked this pull request as ready for review November 18, 2024 15:54

ptoffy requested review from 0xTim and gwynne as code owners November 18, 2024 15:54

Remove NIO and add some docs

2e683dd

ptoffy requested review from czechboy0 and Joannis and removed request for adam-fowler November 18, 2024 20:06

Joannis reviewed Nov 19, 2024

View reviewed changes

Fix imports and rename some files

eabe133

Fix imports again

c230a4a

adam-fowler reviewed Nov 19, 2024

View reviewed changes

Remove unnecessary Sendable conformances

b6b647e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultipartKit V5 #100

MultipartKit V5 #100

ptoffy commented Oct 7, 2024

adam-fowler left a comment

ptoffy commented Oct 30, 2024

ptoffy commented Nov 18, 2024

Joannis Nov 19, 2024

ptoffy Nov 19, 2024

Joannis Nov 19, 2024

ptoffy Nov 19, 2024 •

edited

Loading

Joannis Nov 19, 2024

ptoffy Nov 19, 2024

Joannis Nov 19, 2024

ptoffy Nov 19, 2024

adam-fowler left a comment

adam-fowler Nov 19, 2024

ptoffy Nov 19, 2024

adam-fowler Nov 19, 2024

MultipartKit V5 #100

Are you sure you want to change the base?

MultipartKit V5 #100

Conversation

ptoffy commented Oct 7, 2024

adam-fowler left a comment

Choose a reason for hiding this comment

ptoffy commented Oct 30, 2024

ptoffy commented Nov 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ptoffy Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adam-fowler left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ptoffy Nov 19, 2024 •

edited

Loading