Load Data from Spark

You can use SingleStore and Spark together to accelerate workloads by taking advantage of computational power of Spark in tandem with the fast ingest and persistent storage SingleStore has to offer. The SingleStore Spark Connector allows you to connect your Spark and SingleStore environments. The connector supports both data loading and extraction from database tables and Spark DataFrames.

The connector is implemented as a native Spark SQL plugin, and supports Spark’s DataSource API. Spark SQL supports operating on a variety of data sources through the DataFrame interface, and the DataFrame API is the widely used framework for how Spark interacts with other systems.

In addition, the connector is a true Spark data source; it integrates with the Catalyst query optimizer, supports robust SQL pushdown, and leverages SingleStore LOAD DATA to accelerate ingest from Spark via compression.

In this section

Last modified: September 27, 2023

Was this article helpful?