Prerequisites
On this page
The following are the prerequisites for launching Ingest on Amazon EC2:
-
Selection of the Ingest volume.
-
Selection of the EC2 instance type.
-
Ensure connectivity between the server/EC2 hosting the Ingest software and the source.
Additionally, ensure connectivity to DynamoDB if the high availability option is required.
To create the necessary AWS services, refer to Environment Setup.
The following are the steps to take before launching Ingest in AWS via custom installation on an EC2:
-
Create a policy with a relevant name for EC2, such as
FlowEC2Policy
.Refer to the Define custom IAM permissions with customer managed policies for creating policies. -
Refer to AWS Identity and Access Management (IAM) for SingleStore Flow for JSON policy.
-
Create an IAM role called
FlowEC2Role
.Refer to Create a role to delegate permissions to an IAM user for creating roles. -
Attach the
FlowEC2Policy
to the role. -
Create a Lambda policy for disk checks and attach the Lambda policy JSON.
Refer to AWS Recovery for Lambda Policy JSON.
The following are the recommended EC2 options for replicating source data volumes.
Total Data Volume |
EC2 Recommended |
---|---|
< 100 GB |
t2. |
100GB – 300GB |
t2. |
300GB – 1TB |
t2. |
> 1TB |
Contact SingleStore Support |
These recommendations serve as a starting point.
The following are the system requirements when not using the Amazon EC2:
-
Port
8081
must be open on the server hosting the Ingest software. -
Google Chrome is required as the internet browser on the server hosting Ingest software.
-
Java version 21 or higher is required.
-
If using Microsoft SQL Server as a source, download and install the BCP utility.
-
Ensure connectivity between the server hosting the Ingest software and the source, and DynamoDB (if the high availability option is required).
Recommended Hardware Configuration
The following describes the hardware configuration for a Windows server, assuming that there are a few sources and target combinations (3 medium, ideally).
The following describes the hardware configuration for a Windows server; similar configuration is recommended for a Linux or Ubuntu based server.
Component |
Specification |
---|---|
Processor |
4 cores |
Memory |
16 GB |
Disk requirements |
Varies based on the data being extracted, with a minimum of 300 GB |
Network performance |
High |
Prerequisites for Software on Server
The following software must be installed on the server:
-
64-bit Open JDK 21: Amazon Corretto 21 JRE
-
For SQL Server sources only, install the following tools and drivers:
-
Note
BCP may not work on Linux.
If BCP is not available or cannot be used, use JDBC. To disable BCP, see workaround in SingleStore Operational FAQ.
-
-
For a MySQL server only, install
mysqlbinlog.
on the server and include it in the system path.exe
Required Skills
Flow is a suite of robust applications that makes seamless data replication to the cloud.
-
AWS Cloud Fundamentals
-
Basic database skills, including writing and executing database queries (for RDBMS endpoints)
-
Familiarity with using Microsoft Windows or Linux-based systems
Installation
For details on how to install Ingest and other Flow components, refer to Install SingleStore Flow.
Last modified: September 1, 2025