tor spends a lot of time in malloc/free

on my Fast Guard, Tor spends about 25% (!) of its user CPU time in _int_malloc and _int_free. I tried switching to jemalloc, but I just got significantly worse memory fragmentation.