Snippets Groups Projects

Stream directory downloads to reduce latency and save RAM

Here's a tricky one, but it has the potential to save time and memory.

Right now, the tor-dirclient API downloads the entire requested object to RAM, decompressing it as we go. That's okay for stuff like consensus documents, where we will need the whole thing decompressed anyway, but it's less good for stuff like microdescriptors, where we'd like to handle each one as soon as we receive it, and we get a lot of them in a single document. This means that we're keeping like 10MB of temporary string data around when all but the most recent 3-4k is totally parsable.

It's also bad for latency, since we can be in a position where the information we would need to become bootstrapped is sitting in a download buffer, waiting for the download to complete.

We could save intermediate memory and latency by refactoring our downloader code to (optionally) return a bytestream of downloaded information, and then to write code to convert that bytestream into a Stream of Microdesc or AuthCert.

This would require significant refactoring in bootstrap.rs.

Found while doing #87

Edited to add: One caveat here. Many prefixes of a microdescriptor are themselves valid microdescriptors. Thus, when parsing a stream of microdescriptors, you can't safely parse the last one until the stream is finished.

Edited to add: Another application of this approach: we have some interest in being able to reject consensus documents early if their first 1k describes a consensus we wouldn't use.

Edited 2 years ago

Designs

Child items ...

Activity

Nick Mathewson changed milestone to %Arti 1.0.0: Ready for production use 3 years ago

changed milestone to %Arti 1.0.0: Ready for production use
Nick Mathewson mentioned in issue #87 3 years ago

mentioned in issue #87
Nick Mathewson changed the description 3 years ago

changed the description
Ian Jackson assigned to @Diziet 2 years ago

assigned to @Diziet
Ian Jackson added Next label 2 years ago

added Next label
Gaba added Q2 label 2 years ago

added Q2 label
Ian Jackson added Doing label 2 years ago

added Doing label
Ian Jackson removed Next label 2 years ago

removed Next label
Ian Jackson added Backlog label and removed Doing label 2 years ago

added Backlog label and removed Doing label
Ian Jackson unassigned @Diziet 2 years ago

unassigned @Diziet
eta assigned to @eta 2 years ago

assigned to @eta
Nick Mathewson changed the description 2 years ago

changed the description
eta unassigned @eta 2 years ago

unassigned @eta
eta assigned to @eta 2 years ago

assigned to @eta
Gaba added Q3 label and removed Q2 label 2 years ago

added Q3 label and removed Q2 label
micah marked this issue as related to #87 2 years ago

marked this issue as related to #87
micah added Performance Impact S101-O3 labels 2 years ago

added Performance Impact S101-O3 labels
Gaba added Roadmap::Future label and removed Backlog label 2 years ago

added Roadmap::Future label and removed Backlog label
Gaba removed Q3 label 2 years ago

removed Q3 label
eta unassigned @eta 2 years ago

unassigned @eta

Please register or sign in to reply