Deploy Not Completing #986

davidsmejia · 2023-03-06T21:03:02Z

Context

Recently ran a deploy to production to verify if cron jobs were being correctly initiated on deploy and confirmed they are not.
This is most likely due to the api-server-instance-user-data.tpl.sh script not completing or errorring out at some point during execution. Additionally the api docker image did not start on deploy.

In order to manually fix:

sudo docker remove resources_portal_api
sudo ./start_api_with_migrations.sh
manually add back the cron jobs defined at the bottom of api-server-instance-user-data.sh

Solution or next step

Determine if the api-server-instance-user-data script is indeed erroring out and where
apply a fix and redeploy

The text was updated successfully, but these errors were encountered:

davidsmejia · 2023-05-19T20:48:01Z

This issue is currently happening on staging though now it is erroring out at around apt update.

arkid15r · 2023-06-07T15:02:01Z

After inspecting the logs I came to a conclusion that the issue could be caused by a transient network error (Network is unreachable) or/and deb package mirrors error (Connection refused, Service Unavailable). This resulted in an incomplete package installation making awscli and certbot unavailable.

I ran the following commands to return the box into a usable state:

rm /var/log/cloud-init.log \
&& rm -rf /var/lib/cloud/* \
&& cloud-init -d init \
&& cloud-init -d modules --mode final

arkid15r · 2023-06-08T19:01:29Z

In order to get this closed we need to make sure the deploy process works fine. To do that the 1password issue needs to be resolved first.

davidsmejia added the High Priority label Mar 6, 2023

davidsmejia mentioned this issue Mar 6, 2023

Double check crontab on next deploy #971

Closed

davidsmejia assigned arkid15r May 23, 2023

arkid15r assigned davidsmejia Jun 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy Not Completing #986

Deploy Not Completing #986

davidsmejia commented Mar 6, 2023

davidsmejia commented May 19, 2023

arkid15r commented Jun 7, 2023

arkid15r commented Jun 8, 2023

Deploy Not Completing #986

Deploy Not Completing #986

Comments

davidsmejia commented Mar 6, 2023

Context

Solution or next step

davidsmejia commented May 19, 2023

arkid15r commented Jun 7, 2023

arkid15r commented Jun 8, 2023