LDBC SNB Business Intelligence (BI) workload implementations

Implementations for the BI workload of the LDBC Social Network Benchmark. See our VLDB 2023 paper and its presentation for details on the design and implementation of the benchmark.

To get started with the LDBC SNB benchmarks, visit the ldbcouncil.org site.

📜 If you wish to cite the LDBC SNB, please refer to the documentation repository (bib snippet).

Implementations

The repository contains the following implementations:

neo4j: an implementation using the Neo4j graph database management system with queries expressed in the Cypher language
umbra: an implementation using the Umbra JIT-compiled columnar relational database management system with expressed in SQL queries written in the PostgreSQL dialect
tigergraph: an implementation using the TigerGraph graph database management system with queries expressed in the GSQL language

All implementations use Docker containers for ease of setup and execution. However, the setups can be adjusted to use a non-containerized DBMS.

Reproducing SNB BI experiments

Running an SNB BI experiment requires the following steps.

Pick a system, e.g. Umbra. Make sure you have the required binaries and licenses available.
Generate the data sets using the SNB Datagen according to the format described in the system's README.
Generate the substitution parameters using the paramgen tool.
Load the data set: set the required environment variables and run the tool's scripts/load-in-one-step.sh script.
Run the benchmark: set the required environment variables and run the tool's scripts/benchmark.sh script.
Collect the results in the output directory of the tool.

⚠️ Note that deriving official LDBC results requires commissioning an audited benchmark, which is a more complex process as it entails code review, cross-validation, etc. For details, see LDBC's auditing process, the specification's Auditing chapter and the audit questionnaire.

Cross-validation

To cross-validate the results of two implementations, use two systems. Load the data into both, then run the benchmark in validation mode, e.g. Neo4j and Umbra results. Then, run:

export SF=10

cd neo4j
scripts/benchmark.sh --validate
cd ..

cd umbra
scripts/benchmark.sh --validate
cd ..

scripts/cross-validate.sh neo4j umbra

Usage

See .circleci/config.yml for an up-to-date example on how to use the projects in this repository.

Data sets

We have pre-generated data sets and parameters.

Scoring

To run the scoring on a full benchmark run, use the scripts/score-full.sh script, e.g.:

scripts/score-full.sh umbra 100

The script prints its summary to the standard output and saves the detailed output tables in the scoring directory (as .tex files).

Name	Name	Last commit message	Last commit date
Latest commit szarnyasg Merge pull request #178 from ldbc/rename-directory Mar 27, 2025 0991694 · Mar 27, 2025 History 2,602 Commits
.circleci	.circleci	Rename project	Mar 27, 2025
common	common	Extract result_mapping map to a common script	Oct 1, 2022
naive-paramgen	naive-paramgen	Naive paramgen: Fix scalar subqueries by adding LIMIT 1	Sep 14, 2024
neo4j	neo4j	Rename project	Mar 27, 2025
parameters	parameters	Fixes in README and Umbra output script	Sep 20, 2022
paramgen	paramgen	Paramgen: Use uppercase type names	Jun 13, 2024
scoring	scoring	Trim delimiters in TeX output	Dec 6, 2023
scripts	scripts	Rename project	Mar 27, 2025
tigergraph	tigergraph	TigerGraph: fix license expiration	Sep 11, 2024
umbra	umbra	Nit	Feb 17, 2025
.gitignore	.gitignore	Add scripts to archive outputs without gitignore files. Fixes #74	Oct 9, 2022
LICENSE.txt	LICENSE.txt	Add LICENSE and NOTICE files	May 30, 2021
NOTICE.txt	NOTICE.txt	Bump copyright year to 2025	Jan 3, 2025
README.md	README.md	Rename project	Mar 27, 2025
snb-bi-audit-questionnaire.md	snb-bi-audit-questionnaire.md	Extend questionnaire	Oct 10, 2023
snb-bi-pre-generated-data-sets.md	snb-bi-pre-generated-data-sets.md	Update URLs	Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LDBC SNB Business Intelligence (BI) workload implementations

Implementations

Reproducing SNB BI experiments

Cross-validation

Usage

Data sets

Scoring

About

Releases 6

Packages

Contributors 26

Languages

License

ldbc/ldbc_snb_bi

Folders and files

Latest commit

History

Repository files navigation

LDBC SNB Business Intelligence (BI) workload implementations

Implementations

Reproducing SNB BI experiments

Cross-validation

Usage

Data sets

Scoring

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 26

Languages

Packages