Draft: Simplify BridgeLine struct and make it variable length

assigned to @cohosh

requested review from @meskio

Looks good. I like how protobuf simplifies the encoding/decoding. I left a comment, but I think is good to merge it if you prefer to ignore my comment.

approved this merge request

I have one more change I want to make to this. We got some feedback from Ian and Vecna that it would be nice if the uid_fingerprint was in fact the full bridge fingerprint. If we go that route, I want to make the change before merging/deploying this MR to avoid creating and tagging a new protobug entry.

Just added one commit to deal with this (the rest is unchanged): 287ee238

I'd like @onyinyang's opinion on this, since I don't recall the reason for going with a shortened hash of the fingerprint in the first place. Any reason not to just stick with the original fingerprint string we get from the resource parser?

The fingerprint should be 20 bytes, right? (So a 40-character hex string?) Should they be generated as 20-byte arrays converted to hex strings in the random() functions in bridge_table and request_handler?

The way this is implemented, there are no assumptions on how long the fingerprint is. And since random() is used for testing purposes, I don't think it matters too much.

That being said, I was just being lazy here

I'm going to just double check this but if I recall correctly, I was trying to avoid using strings/changing the bridgeline structure too much at the time and also ran into issues translating the fingerprint from go to rust, but it didn't matter for the uid_fingerprint if it was an exact match, as long as it would be unique for each bridge. The uid_fingerprint is important for identifying bridges in the bridgetable for updates/replacements/blockages but as long as this works as expected, I'm pretty sure it's fine to use the original fingerprint string as the uid_fingerprint.

resolved all threads

added 9 commits

90b0228c...c7aa2d2e - 2 commits from branch tpo/anti-censorship:main
20f90313 - Add tests for parsing bridgelines
80d146da - Decouple bucket encoding from table entry preparation
65cfdd2b - Simplify BridgeLine struct
0bb9603a - Modify BridgeLine and EncryptedTableEntry for protobufs
9359e266 - Use protobufs for encrypted bridge table
8135fb0f - Add test to check bridge table encryption
1a76fd79 - Use Vec<(_,_)> instead of HashMap<_,_> for bridge params

Compare with previous version

added 1 commit

287ee238 - Keep uid_fingerprint as String from resource parser

Compare with previous version

added 11 commits

287ee238...39531bd9 - 3 commits from branch tpo/anti-censorship:main
f1fc5f45 - Add tests for parsing bridgelines
cd43dd9a - Decouple bucket encoding from table entry preparation
c49cafe3 - Simplify BridgeLine struct
d8842075 - Modify BridgeLine and EncryptedTableEntry for protobufs
29ba0956 - Use protobufs for encrypted bridge table
d7e568d6 - Add test to check bridge table encryption
aaf0972d - Use Vec<(_,_)> instead of HashMap<_,_> for bridge params
cc734bed - Keep uid_fingerprint as String from resource parser

Compare with previous version

I wasn't able to give feedback on the MR earlier due to being afk for a while. In general, I support this idea and think the simplification of the bridge line and move to prost are good changes, however, I'm a bit concerned about the amount of information that will be leaked about Lox's bridge allocation and bucket status with the change from fixed length bridge lines.

Ian mentioned (in the aforementioned feedback) that this change would reveal the types of bridges in the bridge table, how they are allocated across buckets in the bridge table and the distribution of open-entry/trusted buckets. This will still not allow an adversary to learn which buckets are distributed to which users, but certainly leaks more information than we had originally considered and opens some additional side channels that we hadn't previously considered the security impacts of.

The way this is currently implemented, anyone that receives the encrypted bridge table will also be able to see the status of each bucket since the bucket reachability token is appended only to unblocked buckets. A patient adversary could collect this information from the lox authority on a regular basis to see how reachability of buckets, as well as the number of bridges/bucket, changes over time. This could provide useful feedback to an adversary about how their blocking strategy is working, as well as how long it takes us to notice and update a blockage. I think at least this issue could be mitigated by appending a dummy reachability token to each blocked bucket but we may also want to consider a different strategy for the bucket encoding itself to osbfuscate the makeup of buckets.

This may not be the extent of what an adversary can do with this information but I think we probably would want to have a bit more control over how we publicly share this information.

Thanks, I won't merge it in the current state. I've been thinking about how to make the encoding fixed length while keeping the flexibility to allow this length to change.

added 184 commits

cc734bed...ca30ff27 - 171 commits from branch tpo/anti-censorship:main
ca30ff27...6cd83025 - 3 earlier commits
d9cfd937 - Early continue to remove else block indentation
d8d6489b - De-duplicate some code in bridge_blocked
05aabdd6 - Add tests for parsing bridgelines
9023f53f - Decouple bucket encoding from table entry preparation
46fa4b6b - Simplify BridgeLine struct
507fd0b3 - Modify BridgeLine and EncryptedTableEntry for protobufs
01cc50ae - Use protobufs for encrypted bridge table
f865c813 - Add test to check bridge table encryption
034d8b67 - Use Vec<(_,_)> instead of HashMap<_,_> for bridge params
a240d14b - Keep uid_fingerprint as String from resource parser

Compare with previous version

requested review from @onyinyang and removed review request for @meskio

marked this merge request as draft

mentioned in issue #84

mentioned in merge request !305

Draft: Simplify BridgeLine struct and make it variable length

Merge request reports

Activity