Wednesday, September 23, 2009

15. Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication

This paper deals with the TCP incast problem when inbound data overflows the switch buffers by using high resolution timers for TCP time-outs. When there are multiple end-node sending out queries, the client side cannot make forward progress until they receive responses from every server, this is referred to barrier-synchronized requests in the article.

TCP incast collapse:
  • high-bandwidth, low-latency networks with small buffers in the switches
  • clients issue barrier-sync requests in parallel
  • servers respond with small amount of data per request

By enabling microsecond-granularity retransmission time-outs (RTO), the authors intended to solve the incast problem commonly seen in data centers,
  1. modify TCP implementation to use high-resolution kernel timers, timeout = 2^backoff (RTO+ rand(0.5) * RTO )
  2. prevent TCP incast collapse for up to 47 concurrent senders
  3. recover in data centers do not affect performance
Comment: This is a different approach to solve the problems in data centers. The other papers we read strived for backward-compatibility, while this one modifies the TCP stack. Since data centers are usually confined to a fixed location and managed by normally one company, I guess it is a feasible way to solve TCP incast collapse.

1 comment:

  1. Although modifying the TCP stack might be ok for data centers, it seems like one of the conditions for a solution to the Incast problems is that it should be generalizable to the wide area network, making backward-compatibility more important.

    ReplyDelete