Consider disabling dynamic intro point formula (numerology)
This ticket was repurposed and it's now about discussing the dynamic formula for determining the number of introduction points of a hidden service.
The formula leaks the popularity of the hidden service on an hourly basis. Furthermore, it's memoryless which causes the hidden service to use a much bigger number of introduction points than normal (comment:15:ticket:15513).
----
In legacy/trac#3825 rransom proposed to tune the number of IP to rebuilt once one expires based on the history of the Tor HS usage. I believe these numbers are not optimal and there should be a better way to do this.
These are the first numbers that I will try to estimate:
I = number of intro points
C = Connections made to an IP in it's lifetime
T = Total number of connections made to the HS in 24 hours
What we are interested in estimating is the value of NUM_INTRO_POINTS_MAX and this is based on the estimation of C. To determine this we will consider this equation:
`I = T/C`
I believe it is reasonable to suppose that a very busy HS will at most have 1M connections in 24 hours. This means that:
`I = 1'000'000/C`
Currently C is set to 16384. I am not sure why this number was chosen, but if this number is good we would need to set the value of I to 61.
Lets take for granted that 61 is a good value for NUM_INTRO_POINTS_MAX, this means that the number of active IP at a given time should be in the range of 3-61.
For this reason I believe it would be good to have the number of IP to recreate to be in the range of 1-20 dependent on the history of a Tor HS.
The basic thing we can do is use a linear function to determine this number x. We want a linear function that has these properties:
`f(0) = 1``f(4/3) = NUM_INTRO_POINTS_MAX ``(supposing that for lifetime of IP tends to end the fraction (time_since_publishing/IP_MIN_LT)*(accepted_ip_connection)/(IP_CON_LT) -> 4/3)`
This leads us to this:
`x = (1 - NUM_INTRO_POINTS_MAX)*((time_since_publishing/IP_MIN_LT)*(accepted_ip_connection)/(IP_CON_LT)) + 1`
in the case of NUM_INTRO_POINTS_MAX = 20 this means:
`x = 25.3333333 * (time_since_publishing/IP_MIN_LT)*(accepted_ip_connection)/(IP_CON_LT) + 1`
A better way to do this is have an exponential function that converges asymptotically to 20.
Does this seem sane?
issue