Load Data from HDFS using a Pipeline

When you use HDFS pipelines, you extract data from an HDFS file path, optionally transform the data, and load it to a SingleStoreDB table.

With Enabling Wire Encryption and Kerberos on HDFS Pipelines, you can encrypt your pipeline’s connection to HDFS and you can authenticate your pipeline using Kerberos. SingleStoreDB supports Data Transfer Protocol (DTP), which encrypts your pipeline’s connection to HDFS.

This topic assumes that you have set up HDFS in your cluster and you are familiar with how HDFS works.


HDFS Pipelines cannot run Hadoop jobs.

In this section

Last modified: April 12, 2023

Was this article helpful?