Parallize bootstrapping download attempts

changed milestone to %Arti 1.0.0: Ready for production use

changed time estimate to 8h

added Icebox label

added 1 deleted label

added Sponsor 119 label and removed 1 deleted label

Could I be assigned this?

If yes, any pointers on where to start would be great.

Sure, done!

Here's the idea: right now, when we start from zero trying to download a consensus, we launch a single attempt from a single fallback directory. (The actual launching happens in the tor-dirmgr crate.) Instead we should perhaps launch multiple parallel attempts from different fallbacks. As soon as one starts to give us data, then we can cancel the others.

Our current downloading logic happens from a state machine driver in tor_dirmgr::bootstrap, which asks a state object for a list of document IDs that it's missing, and then tries to find them in the cache, and then tries to download what it's missing. The downloads get launched from the fetch_multiple() function in that module.

I think we'd want to do this change as follows:

Add a new field to DocQuery::LatestConsensus to indicate the desired amount of parallelism. This should probably be 1 if we already have a verified consensus, and a configurable value otherwise.
Have this field get set when DocQuery::LatestConsensus is generated.
Alter query_into_requests so that it marks which request should be attempted in parallel.
Write a function that launches a directory request in parallel to different independent directory caches, waits till one succeeds, and cancels the others. (Perhaps it should cancel the others oncce the first one has reached a given level of completion)
Tie all these functions together, and write a bunch of tests.

Happy hacking!

assigned to @dagon

This could be related to #88

added Performance Impact label

added Roadmap::Future label and removed Icebox label

Parallize bootstrapping download attempts

Designs

Child items 0

Activity