ECS Service always wants to be recreated due to capacity provider. #22823

spatel96 · 2022-01-28T17:42:02Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Terraform CLI and Terraform AWS Provider Version

$ terraform -v
Terraform v0.13.6
+ provider.aws v3.73.0

Affected Resource(s)

aws_ecs_service

Terraform Configuration Files

Terraform Plan:

  # module.my_service.aws_ecs_service.ecs_service must be replaced
+/- resource "aws_ecs_service" "ecs_service" {
        cluster                            = "arn:aws:ecs:us-west-1:***:cluster/ecs-related-tapir"
        deployment_maximum_percent         = 200
        deployment_minimum_healthy_percent = 100
        desired_count                      = 2
        enable_ecs_managed_tags            = false
        enable_execute_command             = false
        health_check_grace_period_seconds  = 120
      ~ iam_role                           = "aws-service-role" -> (known after apply)
      ~ id                                 = "arn:aws:ecs:us-west-1:***:service/my-cluster/my-service-5e" -> (known after apply)
      ~ launch_type                        = "EC2" -> (known after apply)
        name                               = "my-service-service-5e"
      + platform_version                   = (known after apply)
      - propagate_tags                     = "NONE" -> null
        scheduling_strategy                = "REPLICA"
      - tags                               = {} -> null
      ~ tags_all                           = {} -> (known after apply)
      ~ task_definition                    = "arn:aws:ecs:us-west-1:***:task-definition/my-service-:23" -> "arn:aws:ecs:us-west-1:***:task-definition/my-service:1"
        wait_for_steady_state              = false

      + capacity_provider_strategy { # forces replacement
          + base              = 0
          + capacity_provider = "ecs-capacity-provider-related-tapir"
          + weight            = 100
        }

        deployment_controller {
            type = "CODE_DEPLOY"
        }

        load_balancer {
            container_name   = "my-service"
            container_port   = 7171
            target_group_arn = "arn:aws:elasticloadbalancing:us-west-1:***:targetgroup/abcdef/abcdef"
        }
    }

Plan: 1 to add, 0 to change, 1 to destroy.

Terraform Apply error:

Error: error creating ECS service (my-service): InvalidParameterException: Creation of service was not idempotent.

Expected Behavior

No infrastructure changes should be made

Actual Behavior

The ECS Service resource will be recreated, but the apply with fail with the error logs specified above.

Steps to Reproduce

Provision an ECS service with a capacity provider
terraform apply

The text was updated successfully, but these errors were encountered:

gvwirth · 2022-04-14T16:55:11Z

FYI we are still seeing this bug in the provider version 4.9.

anGie44 · 2022-04-25T18:40:40Z

Possibly related to existing issue: #2283 (destroy/create behavior)

*Correction -- as the update was not expected behavior, i'm guessing the capacity_provider_strategy is inherited from the aws_ecs_cluster where it is defined. Do you mind confirming @spatel96 ?

a-nych · 2022-05-05T09:18:55Z

This issue is very destructive.

When an ECS cluster has a default_capacity_provider_strategy setting defined, Terraform will mark all services that don't have

  lifecycle {
    ignore_changes = [
      capacity_provider_strategy
    ]
  }

to be recreated.

nitrocode · 2022-05-25T03:37:32Z

It's the only differences I can see when comparing capacity_provider_strategy and deployment_controller are MaxItems and DiffSuppressFunc. I wonder if that is what's causing this recreation... I would have thought that the removing the ForceNew would have also removed recreating capacity_provider_strategy...

terraform-provider-aws/internal/service/ecs/service.go

Lines 96 to 107 in 611b473

    
           "deployment_controller": { 
        
           	Type:     schema.TypeList, 
        
           	Optional: true, 
        
           	MaxItems: 1, 
        
           	// Ignore missing configuration block 
        
           	DiffSuppressFunc: func(k, old, new string, d *schema.ResourceData) bool { 
        
           		if old == "1" && new == "0" { 
        
           			return true 
        
           		} 
        
           		return false 
        
           	}, 
        
           	Elem: &schema.Resource{

terraform-provider-aws/internal/service/ecs/service.go

Lines 44 to 47 in 611b473

    
           "capacity_provider_strategy": { 
        
           	Type:     schema.TypeSet, 
        
           	Optional: true, 
        
           	Elem: &schema.Resource{

anGie44 · 2022-05-26T14:14:40Z

Hi @nitrocode thanks for looking through the code! My initial thinking was that @spatel96 is using both the aws_ecs_capacity_provider and aws_ecs_service resources so while capacity_provider_strategy is not explicitly configured in the aws_ecs_service terraform configuration, the value is inherited from the separate aws_ecs_capacity_provider resource after an initial terraform apply, so the next apply or plan will show that diff (though this still just my conjecture as the original configuration is not yet known). And then that diff is handled with this portion of the code

terraform-provider-aws/internal/service/ecs/service.go

Lines 354 to 372 in a2843eb

    
           func capacityProviderStrategyCustomizeDiff(_ context.Context, d *schema.ResourceDiff, meta interface{}) error { 
        
           	// to be backward compatible, should ForceNew almost always (previous behavior), unless: 
        
           	//   force_new_deployment is true and 
        
           	//   neither the old set nor new set is 0 length 
        
           	if v := d.Get("force_new_deployment").(bool); !v { 
        
           		return capacityProviderStrategyForceNew(d) 
        
           	} 
        
           	old, new := d.GetChange("capacity_provider_strategy") 
        
           	ol := old.(*schema.Set).Len() 
        
           	nl := new.(*schema.Set).Len() 
        
           	if (ol == 0 && nl > 0) || (ol > 0 && nl == 0) { 
        
           		return capacityProviderStrategyForceNew(d) 
        
           	} 
        
           	return nil 
        
           }

which is forcing the new resource. The logic needs to account for cases where the provider strategy is inherited from an outside configuration or simply mark the capacity_provider_strategy as Computed so that the diff is ignored.

relsqui · 2022-08-16T16:58:51Z

I was seeing this same issue and can confirm that adding a capacity_provider_strategy block in my aws_ecs_service, duplicating my default_capacity_provider_strategy, resolved it.

ericdahl · 2022-08-31T00:07:37Z

This has been a big annoyance for us. We have many production ECS Services that are using LaunchType: EC2 and we'd like to convert them to using a newly defined default Capacity Provider strategy on the cluster.

If we simply set the capacity provider, it will force the re-create of the ECS Service leading to temporary disruption/downtime. This isn't necessary as AWS supports the graceful transition of LaunchType: EC2 to Capacity Provider (but not the other way around). It does a "force new deployment" of the ECS Tasks, but it uses the standard ECS rollout mechanism (e.g., minHealthy) so there's no disruption.

Our current workaround is to use the ignore_changes as above, plus converting ECS Services to Capacity Provider via separate CLI type automation.

(Also, tangentially related is #26533 - for transitioning existing ECS Services to use the Cluster's default capacity provider strategy)

remil1000 · 2023-02-09T11:56:23Z

if I may add, empty capacity_provider_strategy list could be useful also
it seems this support was added to the AWS cli and API - aws/containers-roadmap#838 (comment) so that

$ aws ecs update-service --cluster cluster-name --service service-name --capacity-provider-strategy '[]' --force-new-deployment

removes strategy from a ECS service (when inherited from default defined at the ECS cluster level) which is useful if you're planning to remove the default capacity provider strategy from the ECS cluster

It seems that currently if no capacity_provider_strategy is defined in the aws_ecs_service resource the AWS API call will not have any value set and the default strategy will be used

vishwa-trulioo · 2023-02-18T19:18:06Z

It's sad to see that It's been over 1 year and still not fixed. :-( AWS has to do a better job than this if they want people to keep using ECS and keep it stay alive.

bbratchiv · 2023-07-12T11:34:34Z

any updates on this? I see the PR is pending

rmccarthy-ellevation · 2023-07-14T16:04:11Z

Any update on this?

1oglop1 · 2023-10-15T12:50:13Z

@breathingdust Hi, is this something you can look into? The AWS side has been fixed, and now Terraform incorrectly causes replacement.

claudiosf · 2023-11-22T18:01:46Z

Issue still exists.

Luis-3M · 2023-11-23T10:39:30Z

Issue still exists.

Yep we're facing the same problem too

It's a workaround of a bug hashicorp/terraform-provider-aws#22823 Terraform never converged and wanted to re-create the service. ``` Terraform used the selected providers to generate the following execution plan. Resource actions are indicated with the following symbols: -/+ destroy and then create replacement Terraform will perform the following actions: # module.test.aws_ecs_service.ecs must be replaced -/+ resource "aws_ecs_service" "ecs" { - health_check_grace_period_seconds = 0 -> null ~ iam_role = "/aws-service-role/ecs.amazonaws.com/AWSServiceRoleForECS" -> (known after apply) ~ id = "arn:aws:ecs:us-east-2:303467602807:service/test-terraform-aws-ecs/test-terraform-aws-ecs" -> (known after apply) + launch_type = (known after apply) name = "test-terraform-aws-ecs" + platform_version = (known after apply) - propagate_tags = "NONE" -> null - tags = {} -> null ~ triggers = {} -> (known after apply) # (10 unchanged attributes hidden) - capacity_provider_strategy { # forces replacement - base = 1 -> null - capacity_provider = "test-terraform-aws-ecs" -> null - weight = 100 -> null } - deployment_circuit_breaker { - enable = false -> null - rollback = false -> null } - deployment_controller { - type = "ECS" -> null } # (1 unchanged block hidden) } ```

harbinder-kleene · 2023-12-18T09:31:22Z

When the fix would be released? It is affecting my team too.

ZilvinasKucinskas · 2024-01-28T13:42:17Z

+1

This is a major issue. We are running many FARGATE instances and would like to increase the capacity further by adding FARGATE SPOT instances. However, it is not possible to do without downtime (it destroys the whole ECS service and recreates it).

dejanzele · 2024-09-24T14:37:54Z

Hi all,

I am interested in submiting a fix for this issue as it is impacting our internal usage also.

Is the community in agreement what are the latest requirements on how the update should work, as in the comments a couple of ideas are mentioned?

spatel96 mentioned this issue Jan 28, 2022

Allow ECS Service Capacity Provider Updates #20707

Merged

github-actions bot added needs-triage Waiting for first response or review from a maintainer. service/ecs Issues and PRs that pertain to the ecs service. labels Jan 28, 2022

breathingdust added the regression Pertains to a degraded workflow resulting from an upstream patch or internal enhancement. label Feb 3, 2022

justinretzolk removed the needs-triage Waiting for first response or review from a maintainer. label Mar 15, 2022

anGie44 self-assigned this Apr 25, 2022

anGie44 removed their assignment Jun 23, 2022

mon23c mentioned this issue Jul 14, 2022

Ignore capacity_provider_strategy gudangada/terraform-aws-ecs-fargate#4

Merged

andrewradamis-paay mentioned this issue Feb 24, 2023

d/aws_ecs_cluster: add capacity_provider attributes #29639

Open

akuzminsky mentioned this issue Dec 5, 2023

Workaround for re-creation of ECS service infrahouse/terraform-aws-ecs#5

Merged

justinretzolk mentioned this issue Jun 11, 2024

[Enhancement]: Updating capacity providers in an ECS service shouldn't require recreation #37848

Open

dejanzele mentioned this issue Oct 15, 2024

remove capacity_provider_strategy validation for aws_ecs_service resource #39723

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ECS Service always wants to be recreated due to capacity provider. #22823

ECS Service always wants to be recreated due to capacity provider. #22823

spatel96 commented Jan 28, 2022 •

edited

Loading

gvwirth commented Apr 14, 2022

anGie44 commented Apr 25, 2022 •

edited

Loading

a-nych commented May 5, 2022 •

edited

Loading

nitrocode commented May 25, 2022

anGie44 commented May 26, 2022 •

edited

Loading

relsqui commented Aug 16, 2022

ericdahl commented Aug 31, 2022

remil1000 commented Feb 9, 2023

vishwa-trulioo commented Feb 18, 2023 •

edited

Loading

bbratchiv commented Jul 12, 2023

rmccarthy-ellevation commented Jul 14, 2023

1oglop1 commented Oct 15, 2023

claudiosf commented Nov 22, 2023

Luis-3M commented Nov 23, 2023

harbinder-kleene commented Dec 18, 2023

ZilvinasKucinskas commented Jan 28, 2024

dejanzele commented Sep 24, 2024 •

edited

Loading

ECS Service always wants to be recreated due to capacity provider. #22823

ECS Service always wants to be recreated due to capacity provider. #22823

Comments

spatel96 commented Jan 28, 2022 • edited Loading

Community Note

Terraform CLI and Terraform AWS Provider Version

Affected Resource(s)

Terraform Configuration Files

Expected Behavior

Actual Behavior

Steps to Reproduce

gvwirth commented Apr 14, 2022

anGie44 commented Apr 25, 2022 • edited Loading

a-nych commented May 5, 2022 • edited Loading

nitrocode commented May 25, 2022

anGie44 commented May 26, 2022 • edited Loading

relsqui commented Aug 16, 2022

ericdahl commented Aug 31, 2022

remil1000 commented Feb 9, 2023

vishwa-trulioo commented Feb 18, 2023 • edited Loading

bbratchiv commented Jul 12, 2023

rmccarthy-ellevation commented Jul 14, 2023

1oglop1 commented Oct 15, 2023

claudiosf commented Nov 22, 2023

Luis-3M commented Nov 23, 2023

harbinder-kleene commented Dec 18, 2023

ZilvinasKucinskas commented Jan 28, 2024

dejanzele commented Sep 24, 2024 • edited Loading

spatel96 commented Jan 28, 2022 •

edited

Loading

anGie44 commented Apr 25, 2022 •

edited

Loading

a-nych commented May 5, 2022 •

edited

Loading

anGie44 commented May 26, 2022 •

edited

Loading

vishwa-trulioo commented Feb 18, 2023 •

edited

Loading

dejanzele commented Sep 24, 2024 •

edited

Loading