diff --git a/applications/rag/README.md b/applications/rag/README.md index cd8c3d016..d8b09bfaf 100644 --- a/applications/rag/README.md +++ b/applications/rag/README.md @@ -117,8 +117,8 @@ gcloud container clusters get-credentials ${CLUSTER_NAME} --location=${CLUSTER_L * `KAGGLE_USERNAME` * `KAGGLE_KEY` -4. Generate vector embeddings: Run all the cells in the notebook to generate vector embeddings for the Netflix shows dataset (https://www.kaggle.com/datasets/shivamb/netflix-shows) and store them in the `pgvector` CloudSQL instance via a Ray job. - * When the last cell says the job has succeeded (eg: `Job 'raysubmit_APungAw6TyB55qxk' succeeded`), the vector embeddings have been generated and we can launch the frontend chat interface. Note that running the job can take up to 10 minutes. +4. Generate vector embeddings: Run all the cells in the notebook to generate vector embeddings for the Netflix shows dataset (https://www.kaggle.com/datasets/shivamb/netflix-shows) via a Ray job and store them in the `pgvector` CloudSQL instance. + * When the last cell succeeded, the vector embeddings have been generated and we can launch the frontend chat interface. Note that the Ray job can take up to 10 minutes to finish. * Ray may take several minutes to create the runtime environment. During this time, the job will appear to be missing (e.g. `Status message: PENDING`). * Connect to the Ray dashboard to check the job status or logs: - If IAP is disabled (`ray_dashboard_add_auth = false`): diff --git a/applications/rag/example_notebooks/rag-kaggle-ray-sql-interactive.ipynb b/applications/rag/example_notebooks/rag-kaggle-ray-sql-interactive.ipynb index 2b80e437e..7a7de79a4 100644 --- a/applications/rag/example_notebooks/rag-kaggle-ray-sql-interactive.ipynb +++ b/applications/rag/example_notebooks/rag-kaggle-ray-sql-interactive.ipynb @@ -297,7 +297,8 @@ " \n", " return results\n", " \n", - "results = ray.get(ray_data_task.remote(ds_embed))" + "results = ray.get(ray_data_task.remote(ds_embed))\n", + "ray.shutdown()" ] }, {