Skip to content
This repository has been archived by the owner on Feb 8, 2024. It is now read-only.

CORTX-33786:CORTX Deployment timeout in statefulset of cortx-server #2152

Merged
merged 2 commits into from
Aug 22, 2022

Conversation

Vinoth2101
Copy link
Contributor

@Vinoth2101 Vinoth2101 commented Aug 18, 2022

Problem:
rgw admin utility in server init containers is stuck or taking
long time around 10mins for a 5 node setup.

Solution:
This could be due to this rgw utility can connect to
the stale hax process of data pods init container.
To make sure that this utility is connected to data pods
run time hax process, added a delay of 30 secs after
"mkfs done" is completed. This improved the timings of
deployments to within 3 mins.

Signed-off-by: Vinoth.V vinoth.v@seagate.com

Testing:
Ran 10 iteration of deployment on 5 node and 9 node using custom build 2.0.0-7596-custom-ci

@cla-bot
Copy link

cla-bot bot commented Aug 18, 2022

Thanks for your contribution!
The CLA bot has flagged your contribution as not having a Contributor License Agreement
in place. Note that this is not needed in the overwhelming majority of instances and this warning will usually be ignored.
The code reviewers will make a determination and may ask you to sign a CLA or may choose to ignore this warning.
More information about this can be found here.

@cla-bot
Copy link

cla-bot bot commented Aug 19, 2022

Thanks for your contribution!
The CLA bot has flagged your contribution as not having a Contributor License Agreement
in place. Note that this is not needed in the overwhelming majority of instances and this warning will usually be ignored.
The code reviewers will make a determination and may ask you to sign a CLA or may choose to ignore this warning.
More information about this can be found here.

@vaibhavparatwar
Copy link
Contributor

retest this please

1 similar comment
@Vinoth2101
Copy link
Contributor Author

retest this please

@vaibhavparatwar
Copy link
Contributor

@Vinoth2101 I see 1N deployment is failing for this PR as well.. could you check?

@cla-bot cla-bot bot added the cla-signed label Aug 22, 2022
Problem:
rgw admin utility in server init containers is stuck or taking
long time around 10mins for a 5 node setup.

Solution:
	This could be due to this rgw utility can connect to
	the stale hax process of data pods init container.
	To make sure that this utility is connected to data pods
	run time hax process, added a delay of 30 secs after
	"mkfs done" is completed. This improved the timings of
	deployments to within 3 mins.

Signed-off-by: Vinoth.V <vinoth.v@seagate.com>
@d-nayak d-nayak merged commit f8eaf12 into Seagate:main Aug 22, 2022
@d-nayak
Copy link
Contributor

d-nayak commented Aug 22, 2022

@Vinoth2101 to raise another ticket and PR for checking for all confd's running.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants