Wikidata is a free and open knowledge base that can be read and edited by both humans and machines.
Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.
Wikidata Graph Pattern Benchmark
The Wikidata Graph Pattern Benchmark (WGPB) is a benchmark consisting of 50 instances of 17 different abstract query patterns giving a total of 850 SPARQL queries. The goal of the benchmark is to test the performance of query engines for more complex basic graph patterns. The benchmark was designed for evaluating worst-case optimal join algorithms but also serves as a general-purpose benchmark for evaluating (basic) graph patterns. The queries are provided in SPARQL syntax and all return at least one solution. We limit the number of results returned to a maximum of 1,000.
Data Loaded
https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.nt.gz (mid-April 2022 time frame).
Triple count – 16,882,554,798
Size on Disk – 1,486GB
Load using only 16 cores took <6 hours.
Hardware
32core (2×16 AMD EPYC 7302, 3.0GHz)
Local SSD
256GB RAM
CentOS Linux release 7.9.2009
Results
74% of the queries took <100 milliseconds.
98.5% finished in <1 Second.
There was a single query that took 7.19 seconds which will be optimized in future AllegroGraph releases.