Skip to content

O.1. Enhance data collection methods for the metrics pipeline

We want to adopt a more proactive stance in regard to addressing threats impacting the network and users overall. In order to do this we need to review our current data collection tools, considering how we serve, store and retrieve our historical data, and if there are other datasets that we would like to ingest in our pipeline.

Desired outputs:

  • List of information and data about the network that is available.
  • Documented user needs discovery.
  • Updated data collection tools.
  • Historical network data that can be queried and analyzed.

Desired Outcomes:

  • Tor network data is stored in such a way that can be served and queried much faster and with less friction.
  • Tor network data collection system is more robust; data is less likely to be lost.
  • We are prepared for Objective 2, in which we need to be able to easily query large data sets from historical network data.
Edited by Georg Koppen
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information