S3 Pipeline Errors
On this page
For more information about creating pipelines with S3, refer to Load Data from Amazon Web Services (AWS) S3.
S3 Authentication Errors
You may receive authentication errors if you attempt to create an S3 pipeline without providing credentials or if the provided credentials are invalid.
NoCredentialProviders: no valid providers in chain.
This error is caused by one or more of the following conditions:
-
No
CREDENTIALS
were specified in theCREATE PIPELINE
statement or the JSON was malformed. -
An IAM role was specified, but your EC2 instance was not configured with an instance profile.
"aws_ access_ key_ id" specified, but not "aws_ secret_ access_ key"
This error is caused by a missing aws_
key in the CREDENTIALS
JSON of your CREATE PIPELINE
statement, or if the JSON key is malformed.
"aws_ secret_ access_ key" specified, but not "aws_ access_ key_ id"
This error is caused by a missing aws_
key in the CREDENTIALS
JSON of your CREATE PIPELINE
statement, or if the JSON is malformed.
InvalidAccessKeyID: The access key ID you provided does not exist in our records
This error is caused by specifying an Access Key ID that does not exist.
SignatureDoesNotMatch: The request signature we calculated does not match the signature you provided. Check your key and signing method
This error is caused by specifying an invalid combination of an Access Key ID and a Secret Access Key.
High Memory Usage for S3 Pipeline
When using an S3 pipeline, over time, an increase in the memory used by the md_
table may occur.ENABLE OFFSETS METADATA GC
.
By default, the pipeline garbage collector (GC) for S3 is not enabled.ENABLE OFFSETS METADATA GC
should be added to the CREATE PIPELINE
query block to enable pipeline garbage collection on new pipelines.ALTER PIPELINE
statement with the ENABLE OFFSETS METADATA GC
clause.
See the S3 Pipeline Using Metadata Garbage Collection (GC) section in the CREATE PIPELINE or the ALTER PIPELINE topics.
To check the memory usage, use the query below:
SELECT * FROM information_schema.INTERNAL_TABLE_STATISTICS WHERE table_name LIKE "md_extractors_offsets" ORDER BY memory_use DESC;
+---------------+-----------------------+---------+-----------+------+------------+----------------+------+------------+-------------------+----------------+
| DATABASE_NAME | TABLE_NAME | ORDINAL | HOST | PORT | NODE_TYPE | PARTITION_TYPE | ROWS | MEMORY_USE | STORAGE_TYPE | ROWS_IN_MEMORY |
+---------------+-----------------------+---------+-----------+------+------------+----------------+------+------------+-------------------+----------------+
| ticket_test | md_extractors_offsets | NULL | 127.0.0.1 | 3306 | Aggregator | Reference | 2 | 524544 | INTERNAL_METADATA | 2 |
| ticket_test | md_extractors_offsets | NULL | 127.0.0.1 | 3307 | Leaf | Reference | 2 | 524544 | INTERNAL_METADATA | 2 |
| ticket_test | md_extractors_offsets | 1 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 7 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 6 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 5 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 4 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 3 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 2 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
| ticket_test | md_extractors_offsets | 0 | 127.0.0.1 | 3307 | Leaf | Master | 0 | 0 | INTERNAL_METADATA | 0 |
+---------------+-----------------------+---------+-----------+------+------------+----------------+------+------------+-------------------+----------------+
Last modified: October 8, 2024