Big data benchmark tool
Big data benchmark tool – BigDataBench – supports measuring, benchmarking, and evaluating hardware systems, software systems, and business systems. It provides full life-cycle benchmarking services for the design, selection, acceptance review, expansion, and optimization of the big data systems. BigDataBench includes diverse workloads implemented with mainstream big data systems like Hadoop, Spark, Flink, and covered various types like search engine, e-commerce, social network, cognitive science, and medicine. From the perspective of dataset, BigDataBench uses real-world datasets and covers multiple data sources, e.g., text, table, graph, and image, and data types, e.g., structural, semi-structural, and un-structural. In addition, to support the scale-up and scale-out scalability, BigDataBench provides big data generator suite (BDGS) which supports user defined large-scale data size and preserves the characteristics of the real-world datasets.