These are research project ideas relating to anti-censorship work at Tor. If you're interested in working on any of them, feel free to reach out to us!
Snowflake enumeration attempts
Question: If an adversary were to try to enumerate snowflake proxies, how many would they see? How much churn is there in Snowflake proxies? How effectively can they block Snowflake this way?
Some relevant discussion/links:
- Discussion during anti-censorship meeting: http://meetbot.debian.net/tor-meeting/2021/tor-meeting.2021-02-04-15.58.html
- Ticket for implementing Snowflake churn metrics: tpo/anti-censorship/pluggable-transports/snowflake#34075 (closed)
Calibrate bridge users estimation with on-bridge socket counts
Tor Metrics bridge user graphs depict not unique IP addresses, but rather an average number of concurrently connected users per day. Simplifying slightly, the number of concurrent users is estimated by taking the number of directory requests and dividing by 10. The constant of 10 is somewhat arbitrary, reflecting an educated guess that an average Tor users remains connected for 2.4 hours per day.
The constant 10 is effectively a scaling factor for the user graphs. Its exact value does not matter when, for example, you want to compare two graphs to see which is bigger. But it would be nice if it were calibrated to match reality as closely as possible.
The idea is to repeatedly sample the number of sockets that are connected to the localhost ExtORPort to get an average per day, then compare that locally computed average with what Tor Metrics reports. If the currently used ExtOrPort is 127.0.0.1:1234, then you can sample the number of sockets currently connected to the ExtOrPort with a command like
ss -n state established 'dport = 1234' 'dst 127.0.0.1' | wc -l
(Subtract 1 to account for the header in the output.)
@dcf ran a socket counting script on two snowflake bridges for a few weeks in June–July 2023 and compared the sockets counts to Tor Metrics calculations. The snowflake-01 bridge was off by about a factor of 2.0, and the snowflake-02 bridge was off by a factor of about 1.5. The difference may be attributable to the fact that, at the time, the snowflake-02 bridge had been included in Tor Browser since version 12.0.3, but Orbot 17 (the first version to contain the snowflake-02 bridge) had not been fully released yet.
- https://www.bamsoftware.com/talks/pets-2023-metrics/
- https://www.bamsoftware.com/talks/pets-2023-metrics/pets-2023-metrics.zip
Compare results with "Understanding Tor Usage with Privacy-Preserving Measurement".