Skip to content

A benchmark for serverless analytic databases.

License

Notifications You must be signed in to change notification settings

jcamachor/cab

 
 

Repository files navigation

Cloud Analytics Benchmark (CAB)

This repository contains the implementatio of the CAB benchmark, as described in this paper.

A short description of the individual folders and how to use them.

cab/benchmark-gen

A c++ program to generate the query streams. Can be configured by changing the variables in the main function:

   const uint64_t total_size = 4_TB;
   const uint64_t total_cpu_hours = 40;
   const uint64_t total_duration_in_hours = 1;
   const uint64_t database_count = 20;

The output is written to benchmark-gen/query_streams. It can be compiled with:

clang++ -std=c++17 -Wall -Werror=return-type -Werror=non-virtual-dtor -Werror=sequence-point -Wsign-compare -march=native -O2 -Wfatal-errors benchmark.cpp

Note that the distributions in c++ are platform dependent, hence the generated query streams might look different depending on the platform. However, the overall distribution/pattern is the same.

cab/benchmark-query-streams

Contains pre generated query streams. These are the ones we used in the experiments in the paper.

cab/benchmark-results

Contains the results we obtained by running the experiments as described in the paper.

cab/benchmark-run

A set of java script programs for running and analyzing the benchmark results.

cab/snowset-analysis

All R scripts we used to analyze the snowset and to plot the results of the benchmark run.

About

A benchmark for serverless analytic databases.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 45.9%
  • PLpgSQL 20.9%
  • JavaScript 18.8%
  • C++ 14.2%
  • Shell 0.2%