Skip to main content

Load Data from HDFS using a Pipeline

When you use HDFS pipelines, you extract data from an HDFS file path, optionally transform the data, and load it to a SingleStoreDB table.

With Enabling Wire Encryption and Kerberos on HDFS Pipelines, you can encrypt your pipeline’s connection to HDFS and you can authenticate your pipeline using Kerberos. SingleStoreDB supports Data Transfer Protocol (DTP), which encrypts your pipeline’s connection to HDFS.

This topic assumes that you have set up HDFS in your cluster and you are familiar with how HDFS works.

Note

HDFS Pipelines cannot run Hadoop jobs.