Example: Loading TPC-H Data into SingleStore

This guide will show you how to load data into your SingleStore cluster using Pipelines. Pipelines are a powerful feature that can extract, transform, and load external data without the need for third-party tools or middleware. This guide will show you how to load the TPC-H dataset and then query it to get business-critical metrics.

What is TPC-H?

The TPC Benchmark™H (TPC-H) is a decision support benchmark. It consists of a suite of business oriented ad-hoc queries and concurrent data modifications. The queries and the data populating the database have been chosen to have broad industry-wide relevance. This benchmark illustrates decision support systems that examine large volumes of data, execute queries with a high degree of complexity, and give answers to critical business questions.

Before You Begin

You need a SingleStore cluster or a self-managed cluster running on Amazon Web Services.

For a SingleStore cluster, see the SingleStore Dev Image deployment option for more details.

In this section

Last modified: July 29, 2024

Was this article helpful?