Wikidata Graph Pattern Benchmark

Wikidata is a free and open knowledge base that can be read and edited by both humans and machines.
Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.

Wikidata Graph Pattern Benchmark

The Wikidata Graph Pattern Benchmark (WGPB) is a benchmark consisting of 50 instances of 17 different abstract query patterns giving a total of 850 SPARQL queries. The goal of the benchmark is to test the performance of query engines for more complex basic graph patterns. The benchmark was designed for evaluating worst-case optimal join algorithms but also serves as a general-purpose benchmark for evaluating (basic) graph patterns. The queries are provided in SPARQL syntax and all return at least one solution. We limit the number of results returned to a maximum of 1,000.

Data Loaded

https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.nt.gz (mid-April 2022 time frame).

Triple count – 16,882,554,798

Size on Disk – 1,486GB

Load using only 16 cores took <6 hours.



32core (2×16 AMD EPYC 7302, 3.0GHz)

Local SSD


CentOS Linux release 7.9.2009



74% of the queries took <100 milliseconds.

98.5% finished in <1 Second.

There was a single query that took 7.19 seconds which will be optimized in future AllegroGraph releases.




AllegroGraph with FedShard™ delivers Knowledge Graph Solutions