This guide will show you how to load data into your SingleStore cluster using Pipelines. Pipelines are a powerful feature that can extract, transform, and load external data without the need for third-party tools or middleware. This guide will show you how to load the TPC-H dataset and then query it to get business-critical metrics.
What is TPC-H?
The TPC Benchmark™H (TPC-H) is a decision support benchmark. It consists of a suite of business oriented ad-hoc queries and concurrent data modifications. The queries and the data populating the database have been chosen to have broad industry-wide relevance. This benchmark illustrates decision support systems that examine large volumes of data, execute queries with a high degree of complexity, and give answers to critical business questions.
Before You Begin
You need a SingleStore cluster or a self-managed cluster running on Amazon Web Services.
For a SingleStore cluster, see the SingleStore Dev Image deployment option for more details.