Add map and array attribute value type support #1656

kbrockhoff · 2020-08-27T21:50:48Z

Description: Map and array attribute value types which were recently added to the specification had not been implemented in the collector yet. This PR implements across all tracing receivers and exporters.

Link to tracking Issue: #1590

Testing: Added map and array attribute values to golden dataset. All unit and correctness tests continue to pass.

Documentation: Public methods have comments.

codecov · 2020-08-27T21:57:33Z

Codecov Report

Merging #1656 into master will increase coverage by 0.22%.
The diff coverage is 95.52%.

@@            Coverage Diff             @@
##           master    #1656      +/-   ##
==========================================
+ Coverage   91.67%   91.89%   +0.22%     
==========================================
  Files         262      261       -1     
  Lines       18602    18681      +79     
==========================================
+ Hits        17053    17167     +114     
+ Misses       1117     1082      -35     
  Partials      432      432

Impacted Files	Coverage Δ
translator/trace/protospan_translation.go	`92.02% <92.36%> (+72.02%)`	⬆️
internal/goldendataset/generator_commons.go	`90.00% <100.00%> (+1.53%)`	⬆️
internal/goldendataset/resource_generator.go	`98.41% <100.00%> (+0.18%)`	⬆️
internal/goldendataset/span_generator.go	`98.95% <100.00%> (+0.02%)`	⬆️
translator/conventions/opentelemetry.go	`100.00% <100.00%> (ø)`
translator/internaldata/oc_to_resource.go	`100.00% <100.00%> (ø)`
translator/internaldata/resource_to_oc.go	`87.87% <100.00%> (-4.72%)`	⬇️
translator/internaldata/traces_to_oc.go	`87.87% <100.00%> (+5.65%)`	⬆️
translator/trace/jaeger/traces_to_jaegerproto.go	`88.36% <100.00%> (+0.12%)`	⬆️
translator/trace/zipkin/traces_to_zipkinv2.go	`93.51% <100.00%> (+0.95%)`	⬆️
... and 15 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6b4dce0...18f5ce4. Read the comment docs.

… conversions

testbed/testbed/test_case.go

translator/conventions/opentelemetry.go

translator/internaldata/oc_to_resource.go

tigrannajaryan · 2020-09-02T20:43:25Z

translator/trace/protospan_translation.go

+}
+
+// DetermineValueType returns the native OTLP attribute type the string translates to.
+func DetermineValueType(value string, omitSimpleTypes bool) pdata.AttributeValueType {


I am not sure I understand why this smartness exists in our translation. I was not aware we had this for Zipkin, and I don't understand why we extend this to other translations either.
Do you know why we had this for Zipkin and why we would want it for other formats? Jaeger has proper support for all data types we need, I don't know why we would need a smart conversion like this.

tigrannajaryan · 2020-09-02T20:51:35Z

translator/trace/protospan_translation.go

+		dest.UpsertBool(key, bVal)
+	case pdata.AttributeValueMAP:
+		var attrs map[string]interface{}
+		err := json.Unmarshal([]byte(val), &attrs)


I don't think we should try to treat every string that looks like a JSON as a JSON and convert it to a map. This is can be an unexpected behavior. The recommendation in the spec is for emitters to use JSON if they have no other way to represent structured data. But there is no recommendation in the spec for readers to try to treat it as a JSON. I do not think this is Collector's business. If backends want to do this they are free to but we are overloading Collector by doing this here.

tigrannajaryan · 2020-09-02T21:18:16Z

@kbrockhoff OK, I think I understand where the problem is coming from.

You extended the correctness tests to include array and map values and for such values to pass correctly through our translations you had to make these autodetecting translators for formats that don't support arrays and maps, which is not only Zipkin but virtually everything else.

I understand why you did it, but I believe it is based on a slightly false premise: that no matter what combination of receiver and exporter you use the Collector will guarantee that the mirroring translations are always loseless. So if you generate OTLP data, translate to Zipking, send to Collector, which will internally translate it to OTLP again, then to Zipkin again, and the testbed will translate that one last time to OTLP to compare then we expect that the data is intact.

This is only partially correct premise: we only expect this to be to for representible data. If the particular format is not capable to represent the particular data type that OTLP can then we should not have an expectation of loseless translation.

In this particular case arrays and maps are not representible anywhere expect OTLP so we do not need to test for such data for any other format except OTLP.

I believe we need to slightly modify the correctness tests to account for this and generate data that contains values that are only needed for the particular format. For OTLP we will generate the full set of possible values, for other formats we will generate a subset.

I do not suggest to do this it in this PR since it may be a bit of bigger change. What I suggest to do in this PR is to avoid generating array and map values in the correctness tests for now. Just add support for arrays and maps elsewhere (as you did) but do not make them part of the correctness test goldensets.

In a later PR we can add an extended goldenset for OTLP only.

What do you think?

tigrannajaryan · 2020-09-02T21:22:46Z

Just to clarify: I believe it is not necessary to do these complicated data type conversions for Jaeger and OC since they already have their own type system for attributes which is although not as rich as OTLP is good enough to limit our support to.

Seems the the smartness in Zipkin translations was driven by the bug fixes which I did not look into, but I am assuming it was necessary so we can keep it to avoid breaking stuff that we fixed.

kbrockhoff · 2020-09-05T17:00:53Z

It was relatively simple to change validator to support array and map comparisons with attributes where it has been converted to strings so I left the array and map attributes in the golden dataset.

kbrockhoff · 2020-09-05T17:55:35Z

I think we should support conversion of Zipkin string values to their respective types. It does make sense for it to be configurable. This can be done with a configuration parameter on the Zipkin receiver. Parameter would specify whether to leave all values as strings or do automatic conversions. If this is an agreeable approach, I will implement as part of this PR.

tigrannajaryan · 2020-09-09T16:38:16Z

I think we should support conversion of Zipkin string values to their respective types. It does make sense for it to be configurable. This can be done with a configuration parameter on the Zipkin receiver. Parameter would specify whether to leave all values as strings or do automatic conversions. If this is an agreeable approach, I will implement as part of this PR.

I am not completely sure we want to do this, but even if we do I'd prefer it to be a separate PR. Large PRs are difficult to review.

Can we go with the simpler approach where we don't have the complicated automatic type conversions first and if we see the need to have them we add them in the future?

We can also discuss in the SIG meeting.

kbrockhoff · 2020-09-09T21:31:49Z

I will change to the simpler approach

…rings

tigrannajaryan

Sorry, forgot to discuss this in the SIG meeting. I will continue replying to the comments offline but if you want to discuss this live we can setup a meeting before the next week's SIG meeting.

translator/internaldata/oc_to_resource.go

translator/internaldata/oc_to_traces.go

tigrannajaryan

Looks good to me now.
Please fix the conflict and we can merge.

tigrannajaryan · 2020-09-16T01:20:58Z

Thanks a lot @kbrockhoff !

to provide consistent naming across the code base, deprecate pusher in favor of exporter naming convention. Signed-off-by: ldelossa <ldelossa@redhat.com> Co-authored-by: Tyler Yahn <MrAlias@users.noreply.github.com>

kbrockhoff added 5 commits August 22, 2020 16:07

added support for attribute value map types

a9081ec

Merge branch 'master' into array-map-support

6db5930

fix merge error

9bc0007

Merge branch 'master' into array-map-support

fb9218d

added array attribute value support

f91d3b9

kbrockhoff requested review from bogdandrutu, dmitryax, james-bebbington, nilebox, owais, pjanotti and tigrannajaryan as code owners August 27, 2020 21:50

kbrockhoff added 8 commits August 28, 2020 08:03

improved sub map and array support

a1d9bd8

added new resource semantic attributes

2904995

Merge branch 'master' into array-map-support

0256346

added more logging to test setup failures

6380e0e

added more max time for test completion

a38ce00

added resource test for array attribute and fixed internal to OC lang…

1e807a5

… conversions

added resource tests for map attribute and large number of attributes

b89b3b8

Merge branch 'master' into array-map-support

72998eb

kbrockhoff mentioned this pull request Sep 1, 2020

Bump github.com/openzipkin/zipkin-go from 0.2.2 to 0.2.3 #1562

Closed

removed multi-level attribute arrays as per spec

bc97085

tigrannajaryan self-assigned this Sep 2, 2020

tigrannajaryan reviewed Sep 2, 2020

View reviewed changes

kbrockhoff added 2 commits September 3, 2020 07:38

merge from master

c60882e

fix PR requested changes

11acdd8

kbrockhoff added 2 commits September 5, 2020 10:10

Merge branch 'master' into array-map-support

69f4d27

revert Jaeger to OTLP translators to master branch behavior

96b0de1

kbrockhoff requested a review from tigrannajaryan September 5, 2020 17:56

kbrockhoff added 5 commits September 9, 2020 16:35

Merge branch 'master' into array-map-support

311f48e

Change Zipkin to OTLP translators to leave all attribute values as st…

b7bab37

…rings

add more tests

9e4ea6b

add more tests

54d69b8

fix asserts which should be requires

9c43dd9

tigrannajaryan reviewed Sep 10, 2020

View reviewed changes

translator/internaldata/oc_to_resource.go Outdated Show resolved Hide resolved

removed oc to otlp smart conversions

a4244c4

tigrannajaryan reviewed Sep 14, 2020

View reviewed changes

translator/internaldata/oc_to_resource.go Outdated Show resolved Hide resolved

translator/internaldata/oc_to_traces.go Outdated Show resolved Hide resolved

kbrockhoff added 2 commits September 14, 2020 15:29

Merge branch 'master' into array-map-support

98c51b5

removed oc to otlp smart conversions

7fa3ad2

tigrannajaryan approved these changes Sep 15, 2020

View reviewed changes

Merge branch 'master' into array-map-support

18f5ce4

tigrannajaryan merged commit 471c4a6 into open-telemetry:master Sep 16, 2020

chris-smith-zocdoc mentioned this pull request Oct 1, 2020

Zipkin receiver no longer converts strings to Int/Bool/Floats #1888

Closed

hughesjj pushed a commit to hughesjj/opentelemetry-collector that referenced this pull request Apr 27, 2023

Update core/contrib deps to v0.53.0 (open-telemetry#1656)

9d952a8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add map and array attribute value type support #1656

Add map and array attribute value type support #1656

kbrockhoff commented Aug 27, 2020

codecov bot commented Aug 27, 2020 •

edited

Loading

tigrannajaryan Sep 2, 2020

tigrannajaryan Sep 2, 2020

tigrannajaryan commented Sep 2, 2020

tigrannajaryan commented Sep 2, 2020

kbrockhoff commented Sep 5, 2020

kbrockhoff commented Sep 5, 2020

tigrannajaryan commented Sep 9, 2020

kbrockhoff commented Sep 9, 2020

tigrannajaryan left a comment

tigrannajaryan left a comment

tigrannajaryan commented Sep 16, 2020

Add map and array attribute value type support #1656

Add map and array attribute value type support #1656

Conversation

kbrockhoff commented Aug 27, 2020

codecov bot commented Aug 27, 2020 • edited Loading

Codecov Report

tigrannajaryan Sep 2, 2020

Choose a reason for hiding this comment

tigrannajaryan Sep 2, 2020

Choose a reason for hiding this comment

tigrannajaryan commented Sep 2, 2020

tigrannajaryan commented Sep 2, 2020

kbrockhoff commented Sep 5, 2020

kbrockhoff commented Sep 5, 2020

tigrannajaryan commented Sep 9, 2020

kbrockhoff commented Sep 9, 2020

tigrannajaryan left a comment

Choose a reason for hiding this comment

tigrannajaryan left a comment

Choose a reason for hiding this comment

tigrannajaryan commented Sep 16, 2020

codecov bot commented Aug 27, 2020 •

edited

Loading