[Feature Request]: Support draining Spanner Change Stream connectors #30167

ianb-pomelo · 2024-01-31T18:28:15Z

liferoad · 2024-02-01T02:58:16Z

Abacn · 2024-02-01T03:02:14Z

Draining streaming should work in general. If it's not draining there is something missing in the SDF, a fix similar to #25716 may work

nielm · 2024-02-01T15:33:33Z

cc @thiagotnunes (change streams author) for comment,
But I believe ChangeStreams does not use SDFs, which is probably why the drain is not working. #

The partitions are generated by Spanner itself and are read by a normal DoFn. (SpannerIO:1751)

thiagotnunes · 2024-02-01T21:19:58Z

cc @nancyxu123 , current owner here

efalkenberg · 2024-02-01T22:22:49Z

Hey @ianb-pomelo

Thanks for the feedback!
Draining is something that we have in our backlog, but not prioritized yet.
I really appreciate the context that you provided, I will add that to our internal ticket and we'll update here when this gets prioritized.

Thanks!

Eike

ianb-pomelo · 2024-02-05T19:03:19Z

Thanks for the update, looking forward to seeing it prioritized!

bangau1 · 2024-09-27T14:08:01Z

Hi all,

I also recently experimented a bit with SpannerIO's changestream to GCS Storage (from the provided template from google: https://cloud.google.com/dataflow/docs/guides/templates/provided/cloud-spanner-change-streams-to-cloud-storage). I've been trying to dig into any documentation that I can found, and to realized that the draining operation isn't supported. But I can confirm that the update in place is working.

The other thing that I found is that while cancelling the job is working, submitting another job with the same jobname and same metadata's table name doesn't work. I expect that it can continue ingesting the changestream from the previous checkpoint (that's what the metadata table is for CMIIW?).

I asked in the stackoverflow about the detail here: https://stackoverflow.com/questions/79027920/restarting-the-spannerios-changestream-to-gcs-text-json-pipeline-got-error

Abacn · 2024-10-01T15:35:44Z

submitting another job with the same jobname and same metadata's table name doesn't work.

This is working as intended. Dataflow cannot have two jobs with the same job name unless one is in Done status (not running, cancelling, draining, etc)

bangau1 · 2024-10-03T00:49:41Z

submitting another job with the same jobname and same metadata's table name doesn't work.

This is working as intended. Dataflow cannot have two jobs with the same job name unless one is in Done status (not running, cancelling, draining, etc)

@Abacn I meant I cancelled it (stopped it), then proceed by submit a new pipeline with the same jobName and metadata table. But it returns error.

Abacn · 2024-11-04T15:27:27Z

@Abacn I meant I cancelled it (stopped it), then proceed by submit a new pipeline with the same jobName and metadata table. But it returns error.

It takes time to have job move to "Done" status, usually minutes or longer.

bangau1 · 2024-11-05T02:15:24Z

@Abacn I meant I cancelled it (stopped it), then proceed by submit a new pipeline with the same jobName and metadata table. But it returns error.

It takes time to have job move to "Done" status, usually minutes or longer.

@Abacn just want to clarify if my comment wasn't clear: i submitted the second same jobName once the previous job was completely stopped (cancelled). The second job got error

Should I submit different issue for this? I already asked in stackoverflow, which error being shown up, etc.

ianb-pomelo added awaiting triage new feature labels Jan 31, 2024

github-actions bot added java P2 labels Jan 31, 2024

liferoad added the spanner label Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Support draining Spanner Change Stream connectors #30167

[Feature Request]: Support draining Spanner Change Stream connectors #30167

ianb-pomelo commented Jan 31, 2024

liferoad commented Feb 1, 2024

Abacn commented Feb 1, 2024

nielm commented Feb 1, 2024 •

edited

Loading

thiagotnunes commented Feb 1, 2024

efalkenberg commented Feb 1, 2024

ianb-pomelo commented Feb 5, 2024

bangau1 commented Sep 27, 2024 •

edited

Loading

Abacn commented Oct 1, 2024

bangau1 commented Oct 3, 2024 •

edited

Loading

Abacn commented Nov 4, 2024

bangau1 commented Nov 5, 2024

[Feature Request]: Support draining Spanner Change Stream connectors #30167

[Feature Request]: Support draining Spanner Change Stream connectors #30167

Comments

ianb-pomelo commented Jan 31, 2024

What would you like to happen?

Issue Priority

Issue Components

liferoad commented Feb 1, 2024

Abacn commented Feb 1, 2024

nielm commented Feb 1, 2024 • edited Loading

thiagotnunes commented Feb 1, 2024

efalkenberg commented Feb 1, 2024

ianb-pomelo commented Feb 5, 2024

bangau1 commented Sep 27, 2024 • edited Loading

Abacn commented Oct 1, 2024

bangau1 commented Oct 3, 2024 • edited Loading

Abacn commented Nov 4, 2024

bangau1 commented Nov 5, 2024

nielm commented Feb 1, 2024 •

edited

Loading

bangau1 commented Sep 27, 2024 •

edited

Loading

bangau1 commented Oct 3, 2024 •

edited

Loading