Pipeline Overview
On this page
A pipeline is a mechanism for continuously loading data into a SingleStore database from external sources.
Users can monitor the status and performance of pipelines in SingleStore via the dashboard feature.
Pipelines and Resource Pools
Resource pools are used to group queries to prevent non-critical workloads from overburdening the system.MAX_
for a resource pool will limit the amount of SQL statements that run simultaneously, thereby reducing the burden placed on a system.
Consult the Release Notes for updated information regarding MAX_
.
Pipelines waiting in the queue will use the thread pool slots reserved for background pipelines based on the settings of MAX_
and MAX_
.pipelines_
engine variable.
Supported Data Sources
Data Source |
Data Source Version |
MemSQL/SingleStore Version |
---|---|---|
Apache Kafka |
0. |
5. |
Amazon S3 |
N/A |
5. |
Filesystem Extractor |
N/A |
5. |
Azure Blob |
N/A |
5. |
HDFS |
2. |
6. |
Google Cloud Storage |
N/A |
7. |
Supported File Formats
Pipeline support the following file formats:
-
JSON
-
Avro
-
Parquet
-
CSV
Last modified: October 24, 2023