Episode #129: Parsing and Performance: Protocols

Parsing and Performance: Protocols

Episode #129 • Dec 14, 2020 • Subscriber-Only

The performance gains we have made with the parser type have already been super impressive, but we can take things even further. We will explore the performance characteristics of closures using the time profiler and make some changes to how we define parsers to unlock even more speed.

Previous episode

Parsing and Performance: Combinators

Parsing and Performance: Protocols

Introduction
0:05
Escaping closure overhead
1:30
Eliminating overhead via protocols
8:37
A parser protocol
14:19
Parser protocol benchmarks
25:14
Next time: comparing a complex parser
37:00

References

Downloads

Next episode

Parsing and Performance: The Point

Locked

Unlock This Episode

Our Free plan includes 1 subscriber-only episode of your choice, plus weekly updates from our newsletter.

Sign in with GitHub

Introduction

Pretty incredible. More than 10 times faster than the substring parser. This really does show that the performance gains to be had by working on UTF-8 can be truly substantial. This is the difference of being able to process more than 50 megabytes of logs per second, or being able to process a measly 4 megabytes of logs per second.

So this is pretty huge. We have found that our parser type is so general and so composable that we are not only able to parse at the low UTF-8 level in order to get huge performance gains, but we can also fluidly move between abstraction levels allowing us to choose the right balance of correctness and speed. This is absolutely incredible, and honestly it’s not something we’ve really ever seen in other parsing libraries.

And as hard as it may be to believe, it gets even better. There is even one more change we can make to our parser library that unlocks the final level of performance, and this will bring the performance of our parsers within a very slim margin of hand-rolled, ad hoc, imperative parsers, which are widely believed to be the fast way to write parsers even if they are a bit messy.

To see where this last performance gain could be hiding, let’s run one of our benchmarks in the time profile instrument and see what it exposes to us.

Escaping closure overhead

References

Why Combine has so many Publisher types
Thomas Visser • Jul 4, 2019
A detailed article on the technique of “operator fusion” that Combine employs.
https://www.thomasvisser.me/2019/07/04/combine-types/
An operator fusion primer
Jasdev Singh • Apr 1, 2020
A detailed article on the technique of “operator fusion” that Combine employs.
https://jasdev.me/fusion-primer
swift-benchmark
Google • Mar 13, 2020
A Swift library for benchmarking code snippets, similar to google/benchmark.
http://github.com/google/swift-benchmark
UTF-8
Michael Ilseman • Mar 20, 2019
Swift 5 made a fundamental change to the String API, making the preferred encoding UTF-8 instead of UTF-16. This brings many usability and performance improves to Swift strings.
https://swift.org/blog/utf8-string/
Strings in Swift 4
Ole Begemann • Nov 27, 2017
An excerpt from the Advanced Swift that provides a deep discussion of the low-level representations of Swift strings. Although it pre-dates the transition of strings to UTF-8 in Swift 5 it is still a factually correct accounting of how to work with code units in strings.
https://oleb.net/blog/2017/11/swift-4-strings/
Improve performance of Collection.removeFirst(_:) where Self == SubSequence
Stephen Celis • Jul 28, 2020
While researching the string APIs for this episode we stumbled upon a massive inefficiency in how Swift implements removeFirst on certain collections. This PR fixes the problem and turns the method from an O(n) operation (where n is the length of the array) to an O(k) operation (where k is the number of elements being removed).
https://github.com/apple/swift/pull/32451

Downloads

Sample code

0129-parsing-performance-pt3

Get started with our free plan

Our free plan includes 1 subscriber-only episode of your choice, access to 72 free episodes with transcripts and code samples, and weekly updates from our newsletter.

Sign up for free →

View plans and pricing