Wikidata Graph Pattern Benchmark

Wikidata is a free and open knowledge base that can be read and edited by both humans and machines.
Wikidata acts as central storage for the structured data of its Wikimedia sister projects including Wikipedia, Wikivoyage, Wiktionary, Wikisource, and others.

<!-- benchmarks-lubm 8000 -->

<!--

Lubm Query
# Triples
Time
Query 1
4
0.007
Query 2
2,528
278.321
Query 3
6
0.004
Query 4
34
0.027
Query 5
719
0.076
Query 6
83,557,706
389.062
Query 7
67
0.014
Query 8
7,790
0.484
Query 9
2,178,420
96.695
Query 10
4
0.009
Query 11
224
0.009
Query 12
15
0.029
Query 13
37,118
0.030
Query 14
63,400,587
36.867
Summary of LUBM(8000) Results
-->

Wikidata Graph Pattern Benchmark

The Wikidata Graph Pattern Benchmark (WGPB) is a benchmark consisting of 50 instances of 17 different abstract query patterns giving a total of 850 SPARQL queries. The goal of the benchmark is to test the performance of query engines for more complex basic graph patterns. The benchmark was designed for evaluating worst-case optimal join algorithms but also serves as a general-purpose benchmark for evaluating (basic) graph patterns. The queries are provided in SPARQL syntax and all return at least one solution. We limit the number of results returned to a maximum of 1,000.

Data Loaded

https://dumps.wikimedia.org/wikidatawiki/entities/latest-all.nt.gz (mid-April 2022 time frame).

Triple count – 16,882,554,798

Size on Disk – 1,486GB

Load using only 16 cores took <6 hours.

 

Hardware

32core (2×16 AMD EPYC 7302, 3.0GHz)

Local SSD

256GB RAM

CentOS Linux release 7.9.2009

 

Results

74% of the queries took <100 milliseconds.

98.5% finished in <1 Second.

There was a single query that took 7.19 seconds which will be optimized in future AllegroGraph releases.

 

 

 


AllegroGraph with FedShard™ delivers Knowledge Graph Solutions