I. Introduction
In datacenter, hundreds of thousands of servers are inter-connected through datacenter network with high link speed (40∼100 Gbps) and low latency (10∼100 µs) [1], [2]. Today's datacenters host numerous real-time applications, e.g., web search, retail, and social networking [3]–[6]. These applications generate lots of small latency-sensitive requests and response messages in datacenter networks. The user experience is determined by how fast the application collects all (or most) response messages. Hence, datacenter networks have stringent requirements on the delay performance to achieve good user experience.