Rewrite the UTF-8 conformity section of prop#285

I rewrote the UTF-8 conformity section of prop#285 to be more specific, use terminology from The Unicode Standard, and ban byte-swapped byte order marks.

Please see my branch utf-8-extra on https://github.com/teor2345/torspec.git