Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[marketplace] Kubeflow Pipelines doesn't work on Kubernetes 1.19 or above #5663

Closed
revathijay opened this issue May 18, 2021 · 5 comments
Closed
Labels

Comments

@revathijay
Copy link

What steps did you take

Tried out the default kubeflow pipeline for TFX taxi prediction model in GCP marketplace Kubeflow pipeline version 1.4.1

What happened:

The error message for the csvexamplegen step is as follows.
This step is in Error state with this message: failed to save outputs: Error response from daemon: No such container: b715c336e667d305429003d1ce1c9b795ced595aa76b2bf252c679d2947b3643

Screen Shot 2021-05-18 at 1 07 09 pm

What did you expect to happen:

The pipeline was getting executed end to end earlier . The main container itself for the first step is getting executed successfully. But its failing at the wait container stage.

Environment:

  • How do you deploy Kubeflow Pipelines (KFP)?
  • GCP AI pipeline V(1.4.1)
  • KFP version:
  • KFP SDK version:

Anything else you would like to add:

Labels


Impacted by this bug? Give it a 👍. We prioritise the issues with the most 👍.

@Ark-kun
Copy link
Contributor

Ark-kun commented May 18, 2021

Do any other pipelines run successfully?

@carlosbertoncelli
Copy link

carlosbertoncelli commented May 19, 2021

I had the same issue (with a custom pipeline), this started to happen after i launched a large number of runs and i had to stop the workloads from the GKE panel because the Kubeflow Pipelines dashboard wasnt responding.

Right now i cant run any kind of pipeline, not even the examples

@carlosbertoncelli
Copy link

I tried to reinstall the KFP using GCP Marketplace and didnt worked, i tried to delete the cluster and recreate everything from scratch and the KFP doesnt work anymore (i've also deleted the artifacts from GCP Bucket). Do you have any clues @Ark-kun ?

I've never experienced this issue before

@zijianjoy
Copy link
Collaborator

Hello @cabjr , Kubeflow Pipelines doesn't work on Kubernetes 1.19 or above, would you like to confirm the k8s version of your Kubeflow cluster? If it is the case, you might need to create a new cluster with K8s 1.18, and deploy Kubeflow pipelines on it.

Reference: #1654

@zijianjoy zijianjoy changed the title [bug] <TFX - Taxi tip prediction model trainer Not working anymore > [marketplace] Kubeflow Pipelines doesn't work on Kubernetes 1.19 or above May 21, 2021
@revathijay
Copy link
Author

I finally got it working. it was because the creation of the cluster was blocked from the marketplace page. And I had to tweak the creation of the cluster manually with the rt set of options and the pipeline started working.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants