EMR recreates resources when `ebs_config.volumes_per_instance` is greater than 1 #10446

software-opal · 2019-10-10T04:02:55Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Terraform Version

Terraform v0.11.13

Affected Resource(s)

aws_emr_cluster

Terraform Configuration Files

resource "aws_emr_cluster" "emr_cluster" {
  name          = "my-cluster"
  release_label = "emr-5.26.0"
  applications  = ["Hadoop", "Hive", "Pig", "Hue", "Spark"]

  ebs_root_volume_size = 10

  termination_protection            = false
  keep_job_flow_alive_when_no_steps = true
  scale_down_behavior               = "TERMINATE_AT_TASK_COMPLETION"

  master_instance_group {
    instance_type = "m5.xlarge"

    ebs_config {
      size                 = 32
      type                 = "gp2"
      volumes_per_instance = 2
    }
  }

  core_instance_group {
    instance_type  = "m5.xlarge"
    instance_count = 2

    ebs_config {
      size                 = 128
      type                 = "gp2"
      volumes_per_instance = 2
    }
  }
}

Expected Behavior

Subsequent terraform apply calls should not recreate the cluster when the resource hasn't changed.

Actual Behavior

The cluster is recreated every time a terraform apply is run.

-/+ aws_emr_cluster.emr_cluster (new resource required)
...snip...
      core_instance_group.0.ebs_config.2569541827.iops:                   "0" => "0"
      core_instance_group.0.ebs_config.2569541827.size:                   "128" => "0" (forces new resource)
      core_instance_group.0.ebs_config.2569541827.type:                   "gp2" => "" (forces new resource)
      core_instance_group.0.ebs_config.2569541827.volumes_per_instance:   "1" => "0" (forces new resource)
      core_instance_group.0.ebs_config.2986691328.iops:                   "" => ""
      core_instance_group.0.ebs_config.2986691328.size:                   "" => "128" (forces new resource)
      core_instance_group.0.ebs_config.2986691328.type:                   "" => "gp2" (forces new resource)
      core_instance_group.0.ebs_config.2986691328.volumes_per_instance:   "" => "2" (forces new resource)
      core_instance_group.0.id:                                           "ig-1MCB05JHBWE0E" => <computed>
      core_instance_group.0.instance_count:                               "2" => "2"
      core_instance_group.0.instance_type:                                "m5.xlarge" => "m5.xlarge"
      core_instance_type:                                                 "m5.xlarge" => <computed>
...snip...
      master_instance_group.0.ebs_config.2636219798.iops:                 "0" => "0"
      master_instance_group.0.ebs_config.2636219798.size:                 "32" => "0" (forces new resource)
      master_instance_group.0.ebs_config.2636219798.type:                 "gp2" => "" (forces new resource)
      master_instance_group.0.ebs_config.2636219798.volumes_per_instance: "1" => "0" (forces new resource)
      master_instance_group.0.ebs_config.3054294613.iops:                 "" => ""
      master_instance_group.0.ebs_config.3054294613.size:                 "" => "32" (forces new resource)
      master_instance_group.0.ebs_config.3054294613.type:                 "" => "gp2" (forces new resource)
      master_instance_group.0.ebs_config.3054294613.volumes_per_instance: "" => "2" (forces new resource)
      master_instance_group.0.id:                                         "ig-B0L2KTB8QGNY" => <computed>
      master_instance_group.0.instance_count:                             "1" => "1"
      master_instance_group.0.instance_type:                              "m5.xlarge" => "m5.xlarge"
...snip...

Steps to Reproduce

terraform apply
terraform apply

References

Maybe related to aws_emr_cluster "New resource required" while updating instance_count inside instance_groups #5111, EMR instance gets "forces new instance" when no changes made #5075
Terraform docs: https://www.terraform.io/docs/providers/aws/r/emr_cluster.html#ebs_config-3
EMR docs for describe-cluster and list-instances

The text was updated successfully, but these errors were encountered:

admssa · 2020-02-03T14:28:01Z

the same for aws_emr_instance_group resource.
lifecycle ignore_changes has no effect.

joestump · 2020-05-20T21:59:54Z

Just spent a little time looking. Both aws_emr_instance_group and aws_emr_cluster use the flattenEBSConfig to create/set ebs_config.

Looking at the plan output I'm suspicious that scheme.NewSet and resourceAwsEMRClusterEBSConfigHash are conspiring somehow to make Terraform think the ebs_config set is "new" when it is clearly not:

core_instance_group.0.ebs_config.2569541827.iops:                   "0" => "0"
core_instance_group.0.ebs_config.2569541827.size:                   "128" => "0" (forces new resource)
core_instance_group.0.ebs_config.2569541827.type:                   "gp2" => "" (forces new resource)
core_instance_group.0.ebs_config.2569541827.volumes_per_instance:   "1" => "0" (forces new resource)
core_instance_group.0.ebs_config.2986691328.iops:                   "" => ""
core_instance_group.0.ebs_config.2986691328.size:                   "" => "128" (forces new resource)
core_instance_group.0.ebs_config.2986691328.type:                   "" => "gp2" (forces new resource)
core_instance_group.0.ebs_config.2986691328.volumes_per_instance:   "" => "2" (forces new resource)

homiakos · 2020-05-23T21:23:43Z

Is there a fix timeline or workaround?

pmacdougall · 2020-05-27T21:41:51Z

This is happening for me too using:
Terraform v0.12.25
provider.aws v2.63.0

yizhu-wish · 2020-06-03T19:43:40Z

It looks the # of volumes is hard-coded to 1?
https://github.com/terraform-providers/terraform-provider-aws/blob/master/aws/resource_aws_emr_cluster.go#L1745

deedoz · 2020-08-20T08:37:56Z

It looks the # of volumes is hard-coded to 1?
https://github.com/terraform-providers/terraform-provider-aws/blob/master/aws/resource_aws_emr_cluster.go#L1745

Agrees :
`func flattenEBSConfig(ebsBlockDevices []*emr.EbsBlockDevice) *schema.Set {

ebsConfig := make([]interface{}, 0)
for _, ebs := range ebsBlockDevices {
	ebsAttrs := make(map[string]interface{})
	if ebs.VolumeSpecification.Iops != nil {
		ebsAttrs["iops"] = int(*ebs.VolumeSpecification.Iops)
	}
	if ebs.VolumeSpecification.SizeInGB != nil {
		ebsAttrs["size"] = int(*ebs.VolumeSpecification.SizeInGB)
	}
	if ebs.VolumeSpecification.VolumeType != nil {
		ebsAttrs["type"] = *ebs.VolumeSpecification.VolumeType
	}
	ebsAttrs["volumes_per_instance"] = 1

	ebsConfig = append(ebsConfig, ebsAttrs)
}

return schema.NewSet(resourceAwsEMRClusterEBSConfigHash, ebsConfig)

}`

ebsAttrs["volumes_per_instance"] = 1

dusty73 · 2020-08-21T12:18:03Z

It looks the # of volumes is hard-coded to 1?
https://github.com/terraform-providers/terraform-provider-aws/blob/master/aws/resource_aws_emr_cluster.go#L1745

Agrees :
`func flattenEBSConfig(ebsBlockDevices []*emr.EbsBlockDevice) *schema.Set {
ebsConfig := make([]interface{}, 0)
for _, ebs := range ebsBlockDevices {
	ebsAttrs := make(map[string]interface{})
	if ebs.VolumeSpecification.Iops != nil {
		ebsAttrs["iops"] = int(*ebs.VolumeSpecification.Iops)
	}
	if ebs.VolumeSpecification.SizeInGB != nil {
		ebsAttrs["size"] = int(*ebs.VolumeSpecification.SizeInGB)
	}
	if ebs.VolumeSpecification.VolumeType != nil {
		ebsAttrs["type"] = *ebs.VolumeSpecification.VolumeType
	}
	ebsAttrs["volumes_per_instance"] = 1

	ebsConfig = append(ebsConfig, ebsAttrs)
}

return schema.NewSet(resourceAwsEMRClusterEBSConfigHash, ebsConfig)
}`

ebsAttrs["volumes_per_instance"] = 1

Yes, I think it should be:

ebsAttrs["volumes_per_instance"] = len(ebsBlockDevices)

c4po · 2020-08-26T21:58:22Z

I created a PR to fix this bug and add test case to guard this feature.

bflad · 2020-08-28T18:46:55Z

The fix for this has been merged and will release with version 3.5.0 of the Terraform AWS Provider, later next week. Thanks to @c4po for the implementation. 👍

ghost · 2020-09-03T19:32:09Z

This has been released in version 3.5.0 of the Terraform AWS provider. Please see the Terraform documentation on provider versioning or reach out if you need any assistance upgrading.

For further feature requests or bug reports with this functionality, please create a new GitHub issue following the template for triage. Thanks!

ghost · 2020-09-28T17:10:09Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks!

…form-provider-aws#10446

ghost added the service/emr Issues and PRs that pertain to the emr service. label Oct 10, 2019

bflad added the needs-triage Waiting for first response or review from a maintainer. label Oct 10, 2019

homiakos mentioned this issue May 23, 2020

EMR recreates resources when master_instance_group_instance_count is greater than 1 cloudposse/terraform-aws-emr-cluster#15

Closed

c4po mentioned this issue Aug 26, 2020

fix issue-10446, add test case for ebs_config.volumes_per_instance check #14858

Merged

bflad self-assigned this Aug 28, 2020

bflad added bug Addresses a defect in current functionality. and removed needs-triage Waiting for first response or review from a maintainer. labels Aug 28, 2020

bflad added this to the v3.5.0 milestone Aug 28, 2020

bflad closed this as completed in #14858 Aug 28, 2020

dusty73 mentioned this issue Sep 9, 2020

Upgrade aws provider version cloudposse/terraform-aws-emr-cluster#26

Closed

ghost locked and limited conversation to collaborators Sep 28, 2020

dusty73 pushed a commit to dusty73/terraform-aws-emr-cluster that referenced this issue Oct 15, 2020

Forced aws provider to version 3.5.0 or higher, fixes hashicorp/terra…

c42adec

…form-provider-aws#10446

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EMR recreates resources when `ebs_config.volumes_per_instance` is greater than 1 #10446

EMR recreates resources when `ebs_config.volumes_per_instance` is greater than 1 #10446

software-opal commented Oct 10, 2019

admssa commented Feb 3, 2020 •

edited

Loading

joestump commented May 20, 2020

homiakos commented May 23, 2020

pmacdougall commented May 27, 2020

yizhu-wish commented Jun 3, 2020

deedoz commented Aug 20, 2020

dusty73 commented Aug 21, 2020

c4po commented Aug 26, 2020

bflad commented Aug 28, 2020

ghost commented Sep 3, 2020

ghost commented Sep 28, 2020

EMR recreates resources when ebs_config.volumes_per_instance is greater than 1 #10446

EMR recreates resources when ebs_config.volumes_per_instance is greater than 1 #10446

Comments

software-opal commented Oct 10, 2019

Community Note

Terraform Version

Affected Resource(s)

Terraform Configuration Files

Expected Behavior

Actual Behavior

Steps to Reproduce

References

admssa commented Feb 3, 2020 • edited Loading

joestump commented May 20, 2020

homiakos commented May 23, 2020

pmacdougall commented May 27, 2020

yizhu-wish commented Jun 3, 2020

deedoz commented Aug 20, 2020

dusty73 commented Aug 21, 2020

c4po commented Aug 26, 2020

bflad commented Aug 28, 2020

ghost commented Sep 3, 2020

ghost commented Sep 28, 2020

EMR recreates resources when `ebs_config.volumes_per_instance` is greater than 1 #10446

EMR recreates resources when `ebs_config.volumes_per_instance` is greater than 1 #10446

admssa commented Feb 3, 2020 •

edited

Loading