Re: [swift-evolution] [Proposal] Foundation Swift Archival & Serialization

Itai Ferber via swift-evolution Thu, 16 Mar 2017 12:33:38 -0700

Thanks for the thorough and detailed review, Brent! Responses inline.


On 15 Mar 2017, at 21:19, Brent Royal-Gordon wrote:

On Mar 15, 2017, at 3:40 PM, Itai Ferber via swift-evolution<swift-evolution@swift.org> wrote:
Hi everyone,
The following introduces a new Swift-focused archival andserialization API as part of the Foundation framework. We’reinterested in improving the experience and safety of performingarchival and serialization, and are happy to receive communityfeedback on this work.
Thanks to all of the people who've worked on this. It's a greatproposal.
Specifically:
• It aims to provide a solution for the archival of Swift structand enum types
I see a lot of discussion here of structs and classes, and an exampleof an enum without associated values, but I don't see any discussionof enums with associated values. Can you sketch how you see peopleencoding such types?
For example, I assume that `Optional` is going to get some specialtreatment, but if it doesn't, how would you write its `encode(to:)`method?

`Optional` values are accepted and vended directly through the API. The`encode(_:forKey:)` methods take optional values directly, and`decodeIfPresent(_:forKey:)` vend optional values.

`Optional` is special in this way — it’s a primitive part of thesystem. It’s actually not possible to write an `encode(to:)` methodfor `Optional`, since the representation of null values is up to theencoder and the format it’s working in; `JSONEncoder`, for instance,decides on the representation of `nil` (JSON `null`). It wouldn’t bepossible to ask `nil` to encode itself in a reasonable way.

What about a more complex enum, like the standard library's`UnicodeDecodingResult`:
        enum UnicodeDecodingResult {
                case emptyInput
                case error
                case scalarValue(UnicodeScalar)
        }

Or, say, an `Error`-conforming type from one of my projects:

        public enum SQLError: Error {
            case connectionFailed(underlying: Error)
            case executionFailed(underlying: Error, statement: SQLStatement)
            case noRecordsFound(statement: SQLStatement)
            case extraRecordsFound(statement: SQLStatement)
case columnInvalid(underlying: Error, key: ColumnSpecifier,statement: SQLStatement)case valueInvalid(underlying: Error, key: AnySQLColumnKey,statement: SQLStatement)
        }
(You can assume that all the types in the associated values are`Codable`.)

Sure — these cases specifically do not derive `Codable` conformancebecause the specific representation to choose is up to you. Two possibleways to write this, though there are many others (I’m simplifyingthese cases here a bit, but you can extrapolate this):


```swift
// Approach 1

// This produces either {"type": 0} for `.noValue`, or {"type": 1,"value": …} for `.associated`.

public enum EnumWithAssociatedValue : Codable {
    case noValue
    case associated(Int)

    private enum CodingKeys : CodingKey {
        case type
        case value
    }

    public init(from decoder: Decoder) throws {
        let container = try decoder.container(keyedBy: CodingKeys.self)
        let type = try container.decode(Int.self, forKey: .type)
        switch type {
        case 0:
            self = .noValue
        case 1:
            let value = try container.decode(Int.self, forKey: .value)
            self = .associated(value)
        default:
            throw …
        }
    }

    public func encode(to encoder: Encoder) throws {
        let container = encoder.container(keyedBy: codingKeys.self)
        switch self {
        case .noValue:
            try container.encode(0, forKey: .type)
        case .associated(let value):
            try container.encode(1, forKey: .type)
            try container.encode(value, forKey: .value)
        }
    }
}

// Approach 2

// Produces `0`, `1`, or `2` for `.noValue1`, `.noValue2`, and`.noValue3` respectively.// Produces {"type": 3, "value": …} and {"type": 4, "value": …} for`.associated1` and `.associated2`.

public enum EnumWithAssociatedValue : Codable {
    case noValue1
    case noValue2
    case noValue3
    case associated1(Int)
    case associated2(String)

    private enum CodingKeys : CodingKey {
        case type
        case value
    }

    public init(from decoder: Decoder) throws {
        if let container = try? decoder.singleValueContainer() {}
            let type = container.decode(Int.self)
            switch type {
            case 0: self = .noValue1
            case 1: self = .noValue2
            case 2: self = .noValue3
            default: throw …
            }
        } else {

let container = try decoder.container(keyedBy:CodingKeys.self)

            let type = container.decode(Int.self, forKey: .type)
            switch type {
            case 3:
                let value = container.decode(Int.self, forKey: .value)
                self = .associated1(value)
            case 4:

let value = container.decode(String.self, forKey:.value)

                self = .associated2(value)
            default: throw ...
            }
        }
    }
}
```

There are, of course, many more approaches that you could take, butthese are just two examples. The first is likely simpler to read andcomprehend, but may not be appropriate if you’re trying to optimizefor space.

I don't necessarily assume that the compiler should write conformancesto these sorts of complicated enums for me (though that would benice!); I'm just wondering what the designers of this feature envisionpeople doing in cases like these.
• protocol Codable: Adopted by types to opt into archival.Conformance may be automatically derived in cases where allproperties are also Codable.
Have you given any consideration to supporting types which only needto decode? That seems likely to be common when interacting with webservices.

We have. Ultimately, we decided that the introduction of severalprotocols to cover encodability, decodability, and both was too much ofa cognitive overhead, considering the number of other types we’re alsointroducing. You can always implement `encode(to:)` as `fatalError()`.

• protocol CodingKey: Adopted by types used as keys for keyedcontainers, replacing String keys with semantic types. Conformancemay be automatically derived in most cases.• protocol Encoder: Adopted by types which can take Codable valuesand encode them into a native format.• class KeyedEncodingContainer<Key : CodingKey>: Subclasses ofthis type provide a concrete way to store encoded values byCodingKey. Types adopting Encoder should provide subclasses ofKeyedEncodingContainer to vend.• protocol SingleValueEncodingContainer: Adopted by types whichprovide a concrete way to store a single encoded value. Typesadopting Encoder should provide types conforming toSingleValueEncodingContainer to vend (but in many cases will be ableto conform to it themselves).• protocol Decoder: Adopted by types which can take payloads in anative format and decode Codable values out of them.• class KeyedDecodingContainer<Key : CodingKey>: Subclasses ofthis type provide a concrete way to retrieve encoded values fromstorage by CodingKey. Types adopting Decoder should providesubclasses of KeyedDecodingContainer to vend.• protocol SingleValueDecodingContainer: Adopted by types whichprovide a concrete way to retrieve a single encoded value fromstorage. Types adopting Decoder should provide types conforming toSingleValueDecodingContainer to vend (but in many cases will be ableto conform to it themselves).
I do want to note that, at this point in the proposal, I was sort ofthinking you'd gone off the deep end modeling this. Having read thewhole thing, I now understand what all of these things do, but thisreally is a very large subsystem. I think it's worth asking if some ofthese types can be eliminated or combined.

In the past, the concepts of `SingleValueContainer` and `Encoder` werenot distinct — all of the methods on `SingleValueContainer` were justpart of `Encoder`. Sure, this is a simpler system, but unfortunatelypromotes the wrong thing altogether. I’ll address this below.

Structured types (i.e. types which encode as a collection ofproperties) encode and decode their properties in a keyed manner.Keys may be String-convertible or Int-convertible (or both),
What does "may" mean here? That, at runtime, the encoder will test forthe preferred key type and fall back to the other one? That seems alittle bit problematic.

Yes, this is the case. A lot is left up to the `Encoder` because it canchoose to do something for its format that your implementation of`encode(to:)` may not have considered.If you try to encode something with an `Int` key in a string-keyeddictionary, the encoder may choose to stringify the integer ifappropriate for the format. If not, it can reject your key, ignore thecall altogether, `preconditionFailure()`, etc. It is also perfectlylegitimate to write an `Encoder` which supports a flat encoding format— in that case, keys are likely ignored altogether, in which casethere is no error to be had. We’d like to not arbitrarily constrain animplementation unless necessary.

FWIW, 99.9% of the time, the appropriate thing to do is to either simplythrow an error, or `preconditionFailure()`. Nasal demons should not bethe common case. But for some encoding formats, this is appropriate.

I'm also quite worried about how `Int`-convertible keys will interactwith code synthesis. The obvious way to assign integers—declarationorder—would mean that reordering declarations would invisibly breakarchiving, potentially (if the types were compatible) without breakinganything in an error-causing way even at runtime. You could sort thenames, but then adding a new property would shift the integers of theproperties "below" it. You could hash the names, but then there's noobvious relationship between the integers and key cases.
At the same time, I also think that using arbitrary integers is a poormatch for ordering. If you're making an ordered container, you don'twant arbitrary integers wrapped up in an abstract type. You wantadjacent integers forming indices of an eventual array. (Actually, youmay not want indices at all—you may just want to feed elements inone at a time!)

For these exact reasons, integer keys are not produced by codesynthesis, only string keys. If you want integer keys, you’ll have towrite them yourself. :)

Integer keys are fragile, as you point out yourself, and while we’dlike to encourage their use as appropriate, they require explicitthought and care as to their use.

So I would suggest the following changes:
* The coding key always converts to a string. That means we caneliminate the `CodingKey` protocol and instead use `RawRepresentablewhere RawValue == String`, leveraging existing infrastructure. Thatalso means we can call the `CodingKeys` associated type `CodingKey`instead, which is the correct name for it—we're not talking about an`OptionSet` here.
* If, to save space on disk, you want to also people to use integersas the serialized representation of a key, we might introduce aparallel `IntegerCodingKey` protocol for that, but every `CodingKey`type should map to `String` first and foremost. Using a protocol hereensures that it can be statically determined at compile time whether atype can be encoded with integer keys, so the compiler can select anoverload of `container(keyedBy:)`.
* Intrinsically ordered data is encoded as a single value containersof type `Array<Codable>`. (I considered having an `orderedContainer()`method and type, but as I thought about it, I couldn't think of anadvantage it would have over `Array`.)

This is possible, but I don’t see this as necessarily advantageousover what we currently have. In 99.9% of cases, `CodingKey` types willhave string values anyway — in many cases you won’t have to writethe `enum` yourself to begin with, but even when you do, derived`CodingKey` conformance will generate string values on your behalf.The only time a key will not have a string value is if the `CodingKey`protocol is implemented manually and a value is either deliberately leftout, or there was a mistake in the implementation; in either case, therewouldn’t have been a valid string value anyway.

/// Returns an encoding container appropriate for holding asingle primitive value.
    ///
    /// - returns: A new empty single value container.
/// - precondition: May not be called after a prior`self.container(keyedBy:)` call./// - precondition: May not be called after a value has beenencoded through a previous `self.singleValueContainer()` call.
    func singleValueContainer() -> SingleValueEncodingContainer
Speaking of which, I'm not sure about single value containers. Myfirst instinct is to say that methods should be moved from them to the`Encoder` directly, but that would probably cause code duplication.But...isn't there already duplication between the`SingleValue*Container` and the `Keyed*Container`? Why, yes, yes thereis. So let's talk about that.

In the Alternatives Considered section of the proposal, we detail havingdone just this. Originally, the requirements now on`SingleValueContainer` sat on `Encoder` and `Decoder`.Unfortunately, this made it too easy to do the wrong thing, and requiredextra work (in comparison) to do the right thing.

When `Encoder` has `encode(_ value: Bool?)`, `encode(_ value: Int?)`,etc. on it, it’s very intuitive to try to encode values that way:


```swift
func encode(to encoder: Encoder) throws {

// The very first thing I try to type is encoder.enc… and guesswhat pops up in autocomplete:

    try encoder.encode(myName)
    try encoder.encode(myEmail)
    try encoder.encode(myAddress)
}
```

This might look right to someone expecting to be able to encode in anordered fashion, which is _not_ what these methods do.In addition, for someone expecting keyed encoding methods, this is veryconfusing. Where are those methods? Where don’t these "default"methods have keys?

The very first time that code block ran, it would`preconditionFailure()` or throw an error, since those methods intend toencode only one single value.

open func encode<Value : Codable>(_ value: Value?, forKey key:Key) throws

    open func encode(_ value: Bool?,   forKey key: Key) throws
    open func encode(_ value: Int?,    forKey key: Key) throws
    open func encode(_ value: Int8?,   forKey key: Key) throws
    open func encode(_ value: Int16?,  forKey key: Key) throws
    open func encode(_ value: Int32?,  forKey key: Key) throws
    open func encode(_ value: Int64?,  forKey key: Key) throws
    open func encode(_ value: UInt?,   forKey key: Key) throws
    open func encode(_ value: UInt8?,  forKey key: Key) throws
    open func encode(_ value: UInt16?, forKey key: Key) throws
    open func encode(_ value: UInt32?, forKey key: Key) throws
    open func encode(_ value: UInt64?, forKey key: Key) throws
    open func encode(_ value: Float?,  forKey key: Key) throws
    open func encode(_ value: Double?, forKey key: Key) throws
    open func encode(_ value: String?, forKey key: Key) throws
    open func encode(_ value: Data?,   forKey key: Key) throws

Wait, first, a digression for another issue: I'm concerned that, ifyou look at the `decode` calls, there are plain `decode(…)` callswhich throw if a `nil` was originally encoded and `decodeIfPresent`calls which return optional. The result is, essentially, that theencoding system eats a level of optionality for its ownpurposes—seemingly good, straightforward-looking code like this:


        struct MyRecord: Codable {
                var id: Int?
                …
                
                func encode(to encoder: Encoder) throws {
                        let container = encoder.container(keyedBy: 
CodingKey.self)
                        try container.encode(id, forKey: .id)
                        …
                }
                
                init(from decoder: Decoder) throws {
                        let container = decoder.container(keyedBy: 
CodingKey.self)
                        id = try container.decode(Int.self, forKey: .id)
                        …
                }
        }

Will crash. (At least, I assume that's what will happen.)

The return type of `decode(Int.self, forKey: .id)` is `Int`. I’m notconvinced that it’s possible to misconstrue that as the correct thingto do here. How would that return a `nil` value if the value was `nil`to begin with?The only other method that would be appropriate is`decodeIfPresent(Int.self, forKey: .id)`, which is exactly what youwant.

I think we'd be better off having `encode(_:forKey:)` not take anoptional; instead, we should have `Optional` conform to `Codable` andbehave in some appropriate way. Exactly how to implement it might be alittle tricky because of nested optionals; I suppose a `none` wouldhave to measure how many levels of optionality there are between itand a concrete value, and then encode that information into the data.I think our `NSNull` bridging is doing something broadly similar rightnow.

`Optional` cannot encode to `Codable` for the reasons given above. It isa primitive type much like `Int` and `String`, and it’s up to theencoder and the format to represent it.

How would `Optional` encode `nil`?

I know that this is not the design you would use in Objective-C, butSwift uses `Optional` differently from how Objective-C uses `nil`.Swift APIs consider `nil` and absent to be different things; wherethey can both occur, good Swift APIs use doubled-up Optionals to beprecise about the situation. I think the design needs to be a littledifferent to accommodate that.
Now, back to the `SingleValue*Container`/`Keyed*Container` issue. Thelist above is, frankly, gigantic. You specify a *lot* of primitives in`Keyed*Container`; there's a lot to implement here. And then you haveto implement it all *again* in `SingleValue*Container`:
    func encode(_ value: Bool) throws
    func encode(_ value: Int) throws
    func encode(_ value: Int8) throws
    func encode(_ value: Int16) throws
    func encode(_ value: Int32) throws
    func encode(_ value: Int64) throws
    func encode(_ value: UInt) throws
    func encode(_ value: UInt8) throws
    func encode(_ value: UInt16) throws
    func encode(_ value: UInt32) throws
    func encode(_ value: UInt64) throws
    func encode(_ value: Float) throws
    func encode(_ value: Double) throws
    func encode(_ value: String) throws
    func encode(_ value: Data) throws
This is madness.
Look, here's what we do. You have two types: `Keyed*Container` and`Value*Container`. `Keyed*Container` looks something like this:
final public class KeyedEncodingContainer<EncoderType: Encoder, Key:RawRepresentable> where Key.RawValue == String {
            public let encoder: EncoderType
        
public let codingKeyContext: [RawRepresentable where RawValue ==String]
            // Hmm, we might need a CodingKey protocol after all.
// Still, it could just be `protocol CodingKey: RawRepresentablewhere RawValue == String {}`
        
            subscript (key: Key) -> ValueEncodingContainer {
                return encoder.makeValueEncodingContainer(forKey: key)
            }
        }
It's so simple, it doesn't even need to be specialized. You might evenbe able to get away with combining the encoding and decoding variantsif the subscript comes from a conditional extension. `Value*Container`*does* need to be specialized; it looks like this (modulo the`Optional` issue I mentioned above):

Sure, let’s go with this for a moment. Presumably, then, `Encoder`would be able to vend out both `KeyedEncodingContainer`s and`ValueEncodingContainer`s, correct?

        public protocol ValueEncodingContainer {
func encode<Value : Codable>(_ value: Value?, forKey key: Key)throws

I’m assuming that the key here is a typo, correct?

Keep in mind that combining these concepts changes the semantics of howsingle-value encoding works. Right now `SingleValueEncodingContainer`only allows values of primitive types; this would allow you to encode avalue in terms of a different arbitrarily-codable value.

            func encode(_ value: Bool?) throws
            func encode(_ value: Int?) throws
            func encode(_ value: Int8?) throws
            func encode(_ value: Int16?) throws
            func encode(_ value: Int32?) throws
            func encode(_ value: Int64?) throws
            func encode(_ value: UInt?) throws
            func encode(_ value: UInt8?) throws
            func encode(_ value: UInt16?) throws
            func encode(_ value: UInt32?) throws
            func encode(_ value: UInt64?) throws
            func encode(_ value: Float?) throws
            func encode(_ value: Double?) throws
            func encode(_ value: String?) throws
            func encode(_ value: Data?) throws

func encodeWeak<Object : AnyObject & Codable>(_ object: Object?)throws

Same comment here.

            var codingKeyContext: [CodingKey]
        }

And use sites would look like:

        func encode(to encoder: Encoder) throws {
                let container = encoder.container(keyedBy: CodingKey.self)
                try container[.id].encode(id)
                try container[.name].encode(name)
                try container[.birthDate].encode(birthDate)
        }

For consumers, this doesn’t seem to make much of a difference. We’veturned `try container.encode(id, forKey:. id)` into `trycontainer[.id].encode(id)`.

Decoding is slightly tricker. You could either make the subscript`Optional`, which would be more like `Dictionary` but would beinconsistent with `Encoder` and would give the "never force-unwrapanything" crowd conniptions, or you could add a `contains()` method to`ValueDecodingContainer` and make `decode(_:)` throw. Either oneworks.
Also, another issue with the many primitives: swiftc doesn't reallylike large overload sets very much. Could this set be reduced? I'm notsure what the logic was in choosing these particular types, but manyof them share protocols in Swift—you might get away with just this:
        public protocol ValueEncodingContainer {
func encode<Value : Codable>(_ value: Value?, forKey key: Key)throws
            func encode(_ value: Bool?,   forKey key: Key) throws
func encode<Integer: SignedInteger>(_ value: Integer?, forKeykey: Key) throwsfunc encode<UInteger: UnsignedInteger>(_ value: UInteger?, forKeykey: Key) throwsfunc encode<Floating: FloatingPoint>(_ value: Floating?, forKeykey: Key) throws
            func encode(_ value: String?, forKey key: Key) throws
            func encode(_ value: Data?,   forKey key: Key) throws
        
func encodeWeak<Object : AnyObject & Codable>(_ object: Object?,forKey key: Key) throws
        
            var codingKeyContext: [CodingKey]
        }

These types were chosen because we want the API to make staticguarantees about concrete types which all `Encoder`s and `Decoder`sshould support. This is somewhat less relevant for JSON, but morerelevant for binary formats where the difference between `Int16` and`Int64` is critical.This turns the concrete type check into a runtime check that `Encoder`authors need to keep in mind. More so, however, any type can conform to`SignedInteger` or `UnsignedInteger` as long as it fulfills the protocolrequirements. I can write an `Int37` type, but no encoder could makesense of that type, and that failure is a runtime failure. If you want aconcrete example, `Float80` conforms to `FloatingPoint`; no popularbinary format I’ve seen supports 80-bit floats, though — we cannotprevent that call statically…

Instead, we want to offer a static, concrete list of types that`Encoder`s and `Decoder`s must be aware of, and that consumers haveguarantees about support for.

To accommodate my previous suggestion of using arrays to representordered encoded data, I would add one more primitive:
            func encode(_ values: [Codable]) throws

Collection types are purposefully not primitives here:

* If `Array` is a primitive, but does not conform to `Codable`, then youcannot encode `Array<Array<Codable>>`.* If `Array` is a primitive, and conforms to `Codable`, then there maybe ambiguity between `encode(_ values: [Codable])` and `encode(_ value:Codable)`.* Even in cases where there are not, inside of `encode(_ values:[Codable])`, if I call `encode([[1,2],[3,4]])`, you’ve lost typeinformation about what’s contained in the array — all you see is`Codable`* If you change it to `encode<Value : Codable>(_ values: [Value])` tocompensate for that, you still cannot infinitely recurse on what type`Value` is. Try it with `encode([[[[1]]]])` and you’ll see what Imean; at some point the inner types are no longer preserved.

(Also, is there any sense in adding `Date` to this set, since it needsspecial treatment in many of our formats?)

We’ve considered adding `Date` to this list. However, this means thatany format that is a part of this system needs to be able to make adecision about how to format dates. Many binary formats have no nativerepresentations of dates, so this is not necessarily a guarantee thatall formats can make.


Looking for additional opinions on this one.

Encoding Container Types
For some types, the container into which they encode has meaning.Especially when coding for a specific output format (e.g. whencommunicating with a JSON API), a type may wish to explicitly encodeas an array or a dictionary:
// Continuing from before
public protocol Encoder {
func container<Key : CodingKey>(keyedBy keyType: Key.Type, typecontainerType: EncodingContainerType) -> KeyedEncodingContainer<Key>
}
/// An `EncodingContainerType` specifies the type of container an`Encoder` should use to store values.
public enum EncodingContainerType {
/// The `Encoder`'s preferred container type; equivalent toeither `.array` or `.dictionary` as appropriate for the encoder.
    case `default`
/// Explicitly requests the use of an array to store encodedvalues.
    case array
/// Explicitly requests the use of a dictionary to store encodedvalues.
    case dictionary
}
I see what you're getting at here, but I don't think this is fit forpurpose, because arrays are not simply dictionaries with integerkeys—their elements are adjacent and ordered. See my discussionearlier about treating inherently ordered containers as simplysingle-value `Array`s.

You’re right in that arrays are not simply dictionaries with integerkeys, but I don’t see where we make that assertion here.If an `Encoder` is asked for an array and is provided with integer keys,it can use those keys as indices. If the keys are non-contiguous, theintervening spaces can be filled with null values (if appropriate forthe format; if not, the operation can error out).

The way these containers are handled is completely up to the `Encoder`.An `Encoder` producing an array may choose to ignore keys altogether andsimply produce an array from the values given to it sequentially. (Thisis not recommended, but possible.)

Nesting
In practice, some types may also need to control how data is nestedwithin their container, or potentially nest other containers withintheir container. Keyed containers allow this by returning nestedcontainers of differing key types:
[snip]
This can be common when coding against specific external datarepresentations:
// User type for interfacing with a specific JSON API. JSON APIexpects encoding as {"id": ..., "properties": {"name": ...,"timestamp": ...}}. Swift type differs from encoded type, andencoding needs to match a spec:
This comes very close to—but doesn't quite—address something elseI'm concerned about. What's the preferred way to handle differences inserialization to different formats?
Here's what I mean: Suppose I have a BlogPost model, and I can bothfetch and post BlogPosts to a cross-platform web service, and storethem locally. But when I fetch and post remotely, I ned to conform tothe web service's formats; when I store an instance locally, I have afreer hand in designing my storage, and perhaps need to store someextra metadata. How do you imagine handling that sort of situation? Isthe answer simply that I should use two different types?

This is a valid concern, and one that should likely be addressed.

Perhaps the solution is to offer a `userInfo : [UserInfoKey : Any]`(`UserInfoKey` being a `String`-`RawRepresentable` struct or similar) on`Encoder` and `Decoder` set at the top-level to allow passing this typeof contextual information from the top level down.

To remedy both of these points, we adopt a new convention forinheritance-based coding — encoding super as a sub-object of self:
[snip]
        try super.encode(to: container.superEncoder())
This seems like a good idea to me. However, it brings up anotherpoint: What happens if you specify a superclass of the originallyencoded class? In other words:
        let joe = Employee(…)
        let payload = try SomeEncoder().encode(joe)
        …
        let someone = try SomeDecoder().decode(Person.self, from: payload)
print(type(of: someone)) // Person, Employee, or does`decode(_:from:)` fail?

We don’t support this type of polymorphic decoding. Because no typeinformation is written into the payload (there’s no safe way to dothis that is not currently brittle), there’s no way to tell what’sin there prior to decoding it (and there wouldn’t be a reasonable wayto trust what’s in the payload to begin with).We’ve thought through this a lot, but in the end we’re willing tomake this tradeoff for security primarily, and simplicity secondarily.

The encoding container types offer overloads for working with andprocessing the API's primitive types (String, Int, Double, etc.).However, for ease of implementation (both in this API and others), itcan be helpful for these types to conform to Codable themselves.Thus, along with these overloads, we will offer Codable conformanceon these types:
[snip]
Since Swift's function overload rules prefer more specific functionsover generic functions, the specific overloads are chosen wherepossible (e.g. encode("Hello, world!", forKey: .greeting) will chooseencode(_: String, forKey: Key) over encode<T : Codable>(_: T, forKey:Key)). This maintains performance over dispatching through theCodable existential, while allowing for the flexibility of feweroverloads where applicable.
How important is this performance? If the answer is "eh, not reallythat much", I could imagine a setup where every "primitive" typeeventually represents itself as `String` or `Data`, and each`Encoder`/`Decoder` can use dynamic type checks in`encode(_:)`/`decode(_:)` to define whatever "primitives" it wants forits own format.

Does this imply that `Int32` should decide how it’s represented as`Data`? What if an encoder forgets to implement that?Again, we want to provide a static list of types that `Encoder`s knowthey _must_ handle, and thus, consumers have _guarantees_ that thosetypes are supported.

* * *
One more thing. In Alternatives Considered, you present twodesigns—#2 and #3—where you generate a separate instance whichrepresents the type in a fairly standardized way for the encoder toexamine.
This design struck me as remarkably similar to the reflection systemand its `Mirror` type, which is also a separate type describing anoriginal instance. My question was: Did you look at the reflectionsystem when you were building this design? Do you think there might beanything that can be usefully shared between them?

We did, quite a bit, and spent a lot of time considering reflection andits place in our design. Ultimately, the reflection system does notcurrently have the features we would need, and although the Swift teamhas expressed desire to improve the system considerably, it’s notcurrently a top priority, AFAIK.

Thank you for your attention. I hope this was helpful!

Thanks for all of these comments! Looking to respond to your other emailsoon.

--
Brent Royal-Gordon
Architechies

_______________________________________________
swift-evolution mailing list
swift-evolution@swift.org
https://lists.swift.org/mailman/listinfo/swift-evolution

Re: [swift-evolution] [Proposal] Foundation Swift Archival & Serialization

Reply via email to