ISSN 1000-1239 CN 11-1777/TP

Journal of Computer Research and Development ›› 2020, Vol. 57 ›› Issue (12): 2703-2716.doi: 10.7544/issn1000-1239.2020.20190686

Previous Articles    

Causal-Pdh: Causal Consistency Model for NoSQL Distributed Data Storage Using HashGraph

Tian Junfeng, Wang Yanbiao   

  1. (School of Cyber Security and Computer, Hebei University, Baoding, Hebei 071002) (Key Laboratory on High Trusted Information System in Hebei Province (Hebei University), Baoding, Hebei 071002)
  • Online:2020-12-01
  • Supported by: 
    This work was supported by the National Natural Science Foundation of China for Young Scientists (61802106).

Abstract: The causal consistency of data in a distributed environment means that when data with causal dependence is updated, the dependency metadata in other distributed copies must be updated simultaneously, while meeting higher availability and performance requirements. To solve the problem of users put latency and updating visible latency in existing results, based on the data center stable vectors, combined with the principle of hybrid logical clocks and the HashGraph, we propose the Causal-Pdh model. To reduce the communication overhead caused by exchanging data between replicates, partial stabel vectors required by synchronizing data and Hash value as the message signatures are used instead of the whole data center stable vectors. The principle of virtual voting in HashGraph is used to improve the process of synchronizing the latest entries in each data center. Just like Gossip about Gossip: each parent node also randomly exchanges the latest status, and updates the clock regularly. This progress reduces the time of virtual voting between the replicates. Finally, it is verified by experiments that the Causal-Pdh model not only doesnt affect the throughput of the client query, but also reduces the wait latency of users put operation by 20.85% when the clock skew is severe. When the query is amplified in the system, the response time of request is reduced by 23.37%.

Key words: data consistency, causal consistency, distributed storage, HashGraph, hybrid logical clocks

CLC Number: