But here you can have a problem of load distribution. According to the
algorithm, if 2 servers map to points very close on the unit circle, the
load distribution will be screwed badly.

possible solution - dont use hashing for mapping servers to the unit
circle, instead maintain a map, and pick the longest arc to place the
new server.

The paper also mentions in section 2.2: 
For technical reasons detailed in [6], it is quite important to make a
small number of copies of each cache point--that is, to map several
copies of each cache to different ``random'' points on the unit circle.
This produces a more uniform distribution of URLs to caches.

