[adapters] Delta input: revamp error handling and retry logic. by ryzhyk · Pull Request #6015 · feldera/feldera

ryzhyk · 2026-04-10T00:40:11Z

The connector already had retry logic in some places, but mostly relied on delta-rs for retries. This wasn't always enough and we saw timeouts and expired token errors bubbling up.

This commit adds retry loops around all object store accesses. The loops are controlled by the new max_retries setting, similar to the output connector. By default, it will retry forever. The retry loops set health status to UNHEALTHY while retrying.

If the pipeline is stopped and restarted during a retry, the connector resumes from the last successfully ingested table version. After exhausting retry attempts the connector fails permanently with a fatal error, which eliminates the possibility of data loss.

There is an important caveat:

Because retries may occur after partial progress (e.g., after partially processing a Delta log entry), the same data may be ingested more than once. This is consistent with the connector’s at-least-once delivery guarantee.

Describe Manual Test Plan

Tested using @anandbraman 's e-commerce demo pipeline.

Checklist

Unit tests added/updated
Integration tests added/updated
Documentation updated
Changelog updated

Breaking Changes?

Mark if you think the answer is yes for any of these components:

OpenAPI / REST HTTP API / feldera-types / manager (What is a breaking change?)
Feldera SQL (Syntax, Semantics)
feldera-sqllib (incl. dependencies fxp, etc.) (What is a breaking change?)
Python SDK (What is a breaking change?)
fda (CLI arguments)
Adapters (including configuration)
Storage Format / Checkpoints
Others (specify)

Describe Incompatible Changes

In the past if the connector wasn't able to read a table version, it signaled an error and moved to the next version. This could cause data loss. With this change the connector will either retry forever or fail and stop producing input after exhausting retry attempts.

The second behavioral change is that the connector can now produce duplicate inputs even without a pipeline restart.

mythical-fred

One blocker: please squash the commit. Dirty history is still a hard no for ready PRs.

mythical-fred

To be explicit: the commit is 2f741c2f with subject [ci] apply automatic fixes. Please squash it into the main commit before merge.

ryzhyk · 2026-04-10T03:46:52Z

One blocker: please squash the commit. Dirty history is still a hard no for ready PRs.

It's literally a single commit 🤷‍♂️

mythical-fred

LGTM

swanandx

I do not understand this thoroughly, but LGTM

swanandx · 2026-04-10T09:22:41Z

 | `checkpoint_interval` | <p>Checkpoint interval (i.e., the number of commits after which a new checkpoint should be created) for newly created Delta tables.</p><p>The option is only available when creating the Delta table (`mode = append` and there is no existing table at the target location or `mode = truncate`). It configures the `checkpointInterval` table property, which determines the number of commits after which a new checkpoint should be created.</p><p>0 means no checkpoints are created.</p><p>Default: 10.</p>|
 | `max_retries`|<p>Maximum number of retries for failed Delta Lake operations like writing Parquet files and committing transactions.</p><p>The connector performs retries on several levels: individual S3 operations, Delta Lake transaction commits, and overall operation retries. This setting controls the overall operation retries. When a write to the table fails, because of an S3 timeout or any other reason that was not resolved by lower-level retries, the connector will retry the entire operation.</p><p>When not specified, the connector performs infinite retries. When set to 0, the connector doesn't retry failed operations.</p>|
 | `threads` | Number of parallel threads used by the connector. Increasing this value can improve Delta Lake write throughput by enabling concurrent writes. Default: `1`. |
+| `max_retries`| |


this seems to be a mistake

Suggested change

| `max_retries`| |

swanandx · 2026-04-10T09:50:13Z

+        input_stream: &mut dyn ArrowStream,
+        receiver: &mut Receiver<PipelineState>,
+        transaction: &Option<Option<String>>,
+    ) -> Result<usize, String> {


why return String here? AnyError doesn't work?

AnyError would work, but we only use it to form an error message one level up the stack, so String seems more straightforward.

swanandx · 2026-04-10T09:52:49Z

+        result.sort();
+        result.dedup();


can there be unexpected duplicates in result that we fail to catch now due to calling dedup?

if yes, shall we call dedup based on inject_failure was used?

that's a good idea.

The connector already had retry logic in some places, but mostly relied on delta-rs for retries. This wasn't always enough and we saw timeouts and expired token errors bubbling up. This commit adds retry loops around all object store accesses. The loops are controlled by the new `max_retries` setting, similar to the output connector. By default, it will retry forever. The retry loops set health status to UNHEALTHY while retrying. If the pipeline is stopped and restarted during a retry, the connector resumes from the last successfully ingested table version. After exhausting retry attempts the connector fails permanently with a fatal error, which eliminates the possibility of data loss. There is an important caveat: Because retries may occur after partial progress (e.g., after partially processing a Delta log entry), the same data may be ingested more than once. This is consistent with the connector’s at-least-once delivery guarantee. Signed-off-by: Leonid Ryzhyk <ryzhyk@gmail.com>

ryzhyk requested a review from swanandx April 10, 2026 00:40

ryzhyk added the connectors Issues related to the adapters/connectors crate label Apr 10, 2026

ryzhyk force-pushed the delta_input_retry branch from 0f16560 to 03479af Compare April 10, 2026 00:48

mythical-fred suggested changes Apr 10, 2026

View reviewed changes

mythical-fred reviewed Apr 10, 2026

View reviewed changes

ryzhyk force-pushed the delta_input_retry branch from 2f741c2 to 6b18d0e Compare April 10, 2026 03:45

mythical-fred approved these changes Apr 10, 2026

View reviewed changes

ryzhyk force-pushed the delta_input_retry branch from 6b18d0e to e0be55e Compare April 10, 2026 07:12

swanandx approved these changes Apr 10, 2026

View reviewed changes

ryzhyk force-pushed the delta_input_retry branch from e0be55e to 4df22f2 Compare April 10, 2026 15:41

ryzhyk force-pushed the delta_input_retry branch from 4df22f2 to 544e802 Compare April 10, 2026 15:52

ryzhyk temporarily deployed to ci April 10, 2026 16:11 — with GitHub Actions Inactive

ryzhyk added this pull request to the merge queue Apr 12, 2026

Merged via the queue into main with commit c0f7eae Apr 12, 2026
38 checks passed

ryzhyk deleted the delta_input_retry branch April 12, 2026 21:40

ryzhyk mentioned this pull request Apr 14, 2026

[adapters] delta input connectors drops batches due to transient S3 timeout #5750

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[adapters] Delta input: revamp error handling and retry logic.#6015

[adapters] Delta input: revamp error handling and retry logic.#6015
ryzhyk merged 1 commit intomainfrom
delta_input_retry

ryzhyk commented Apr 10, 2026 •

edited

Loading

Uh oh!

mythical-fred left a comment

Uh oh!

mythical-fred left a comment

Uh oh!

ryzhyk commented Apr 10, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

swanandx left a comment

Uh oh!

swanandx Apr 10, 2026

Uh oh!

ryzhyk Apr 10, 2026

Uh oh!

swanandx Apr 10, 2026

Uh oh!

ryzhyk Apr 10, 2026

Uh oh!

swanandx Apr 10, 2026

Uh oh!

ryzhyk Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ryzhyk commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe Manual Test Plan

Checklist

Breaking Changes?

Describe Incompatible Changes

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

ryzhyk commented Apr 10, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

swanandx left a comment

Choose a reason for hiding this comment

Uh oh!

swanandx Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

ryzhyk Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

swanandx Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

ryzhyk Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

swanandx Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

ryzhyk Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ryzhyk commented Apr 10, 2026 •

edited

Loading