# Load Data with Pipelines

This part of the tutorial shows how to ingest TPC-H data from a public AWS S3 bucket into the SingleStore database using pipelines.

> **📝 Note**: The **SQL Editor** only runs the queries that you select, so ensure you have them all selected before selecting **Run**.

1. Run the following SQL commands to create the pipelines:
   ```sql
   USE tpch;

   CREATE OR REPLACE PIPELINE tpch_100_lineitem
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/lineitem/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE lineitem
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_customer
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/customer/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE customer
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_nation
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/nation/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE nation
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_orders
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/orders/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE orders
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_part
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/part/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE part
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_partsupp
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/partsupp/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE partsupp
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_region
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/region/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE region
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   CREATE OR REPLACE PIPELINE tpch_100_supplier
       AS LOAD DATA S3 'memsql-tpch-dataset/sf_100/supplier/'
       config '{"region":"us-east-1"}'
       SKIP DUPLICATE KEY ERRORS
       INTO TABLE supplier
       FIELDS TERMINATED BY '|'
       LINES TERMINATED BY '|\n';

   ```

2. Run the following SQL commands to start the pipelines:
   ```sql
   USE tpch;
   START ALL PIPELINES;

   ```
   Once the **Success** message is returned for all the created pipelines, SingleStore starts ingesting the data from the S3 bucket.

***

Modified at: October 28, 2024

Source: [/db/v9.1/introduction/sample-data/load-tpc-h-data-into-singlestore/load-data-with-pipelines/](https://docs.singlestore.com/db/v9.1/introduction/sample-data/load-tpc-h-data-into-singlestore/load-data-with-pipelines/)

(An index of the documentation is available at /llms.txt)
