Load Data with Pipelines
Note
The SQL Editor only runs the queries you have selected, so make sure you have them all selected before selecting Run.
-
Run the following SQL commands to create the pipelines:
USE tpch;CREATE OR REPLACE PIPELINE tpch_100_lineitemAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/lineitem/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE lineitemFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_customerAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/customer/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE customerFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_nationAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/nation/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE nationFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_ordersAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/orders/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE ordersFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_partAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/part/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE partFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_partsuppAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/partsupp/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE partsuppFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_regionAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/region/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE regionFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n';CREATE OR REPLACE PIPELINE tpch_100_supplierAS LOAD DATA S3 'memsql-tpch-dataset/sf_100/supplier/'config '{"region":"us-east-1"}'SKIP DUPLICATE KEY ERRORSINTO TABLE supplierFIELDS TERMINATED BY '|'LINES TERMINATED BY '|\n'; -
Run the following SQL commands to start the pipelines:
USE tpch;START ALL PIPELINES;Once the Success message is returned for all the created pipelines, SingleStore starts ingesting the data from the S3 bucket.
Last modified: October 17, 2024