Pipeline Troubleshooting
On this page
Concepts
This topic requires an understanding of pipeline batches, which are explained in The Lifecycle of a Pipeline.
Address specific errors
The following table lists errors that can occur when running a pipeline, such as CREATE PIPELINE
, and errors that can occur while a pipeline is extracting, shaping, or loading data.
Error |
Resolution |
---|---|
Syntax error when running |
Both |
Error |
The master aggregator is likely unable to connect to the pipeline's data source. |
|
The bucket name is case-sensitive. |
Error |
This error can occur when a pipeline attempts to run a transform.
|
|
An incorrect path to the transform was likely specified. |
Error: |
This error may occur when the heap memory usage exceeds the value of |
A parsing error occurs in your transform. |
To debug your transform, you can run |
Error: An error that isn't associated with any specific source partition of the pipeline occurred during the batch loading process. The whole batch will be failed.
Issue
The batch loading process was able to load the data from the source, but it failed to ingest the data into the SingleStore database.
Solution
Address the secondary error to solve the issue.
For example, consider the following error:
Error,2790,"An error that isn't associated with any specific source partition of the pipeline
occurred during the batch loading process. The whole batch will be failed.
Error 1205 : ""Leaf Error (svchost:3306): Lock wait timeout exceeded; try restarting transaction.
Unique key Row Value lock owned by connection id xxxx, query `open idle transaction`"""
In this case, the pipelines failed because the query was unable to acquire row locks for ingesting data.
Rename a table referenced by a pipeline
Trying to rename a table that is referenced by a pipeline returns the following error:
ERROR 1945 ER_CANNOT_DROP_REFERENCED_BY_PIPELINE: Cannot rename table because it is referenced by pipeline <pipeline_name>
The following sequence demonstrates how to rename a pipeline referenced table:
-
Save the pipeline settings.
SHOW CREATE PIPELINE <pipeline_name> EXTENDED; -
Stop the pipeline.
STOP PIPELINE <pipeline_name>; -
Drop the pipeline.
DROP PIPELINE <pipeline_name>; -
Change the name of the table.
ALTER TABLE <old_table_name> RENAME <new_table_name>; -
Recreate the pipeline with the required configuration options, and change the table name to reflect the new table name.
-
Start the pipeline.
START PIPELINE <pipeline_name>;
Pipeline errors that are handled automatically
Typical error handing scenario
In most situations, an error that occurs while a pipeline is running is handled in this way:
If an error occurs while a batch b
is running, then b
will fail and b
's transaction rolls back.b
is retried at most pipelines_
times.pipelines_
is set to ON
, the pipeline stops.nb
,which processes the same files or objects that b
attempted to process, excluding any files or objects that may have caused the error.
The following table lists events, which may or may not cause errors, and how the events are handled.
Event |
How the Event is Handled |
---|---|
The pipeline cannot access a file or object. |
|
The pipeline cannot read a file or object because it is corrupted. |
After fixing the issue with the corrupted file/object, you can have the pipeline reprocess the file/object by running |
A file or object is removed from the filesystem after the batch has started processing the file/object. |
The batch does not fail; the file or object is processed. |
A file is removed from the filesystem (or an object is removed from an object store) after the pipeline registers the file/object in |
|
The cluster restarts while the batch is being processed. |
The typical error handling scenario (mentioned earlier in this topic) applies. Once the cluster is online, |
A leaf node is unavailable before the pipeline starts. |
This does not cause the pipeline to fail. |
A leaf node fails while the pipeline is running. |
The batch fails. |
An aggregator fails while the pipeline is running |
The batch fails. |
The pipeline reaches the allocated storage space for errors. |
The pipeline pauses.
|
Additional Information
For information on troubleshooting pipeline errors and performance issues, refer to Pipeline Dashboards.
In this section
Last modified: October 8, 2024