Rethinking Distributed Query Execution on High-Speed NetworksAbdallah Salama; Carsten Binnig; Tim Kraska; Ansgar Scherp; Tobias Ziegler
In: IEEE Data Engineering Bulletin, Vol. 40, No. 1, Pages 27-37, IEEE, 2017.
In modern high-speed RDMA-capable networks, the bandwidth to transfer data across machines is getting close to the bandwidth of the local memory bus. Recent work has started to investigate how to redesign individual distributed query operators to best leverage RDMA. However, all these novel RDMA-based query operators are still designed for a classical shared-nothing architecture that relies on a shuffle-based execution model to redistribute the data. In this paper, we revisit query execution for distributed database systems on fast networks in a more holistic manner by reconsidering all aspects from the overall database architecture, over the partitioning scheme to the execution model. Our experiments show that in the best case our prototype database system called I-Store, which is designed for fast networks from scratch, provides 3× speed-up over a shuffle-based execution model that was optimized for RDMA.