Add bounding box #219

peterdesmet · 2022-08-19T14:25:01Z

See discussion in #203. Best solution was to add bounding box as a property of the observation. What isn't defined is the expected format.

ddachs · 2022-09-28T09:40:57Z

I just want to confirm the need for the bounding box info in the observations.csv table. I guess the format is rather secondary, as long as it is defined, because you can easily transform the coordinates.

peterdesmet · 2023-01-26T13:28:37Z

Discussed with @kbubnicki

Name term boundingBox (most recognizable)
Insert right after mediaID (to zoom in further)
Only use it for media-observations table (not event-observations)
Definition to be provided
Recommended format to be provided

ddachs · 2023-01-26T13:40:34Z

I recommend the YOLO format to be used.
This way the coordinates will be independent of the image size (which can vary)

peterdesmet · 2023-02-06T09:57:09Z

@kbubnicki for Agouti, we would like if the bounding box field could also support the [x,y] position of animals. I guess that should be possible in yolo format ([x_center, y_center, width, height]) by having it as x, y, 0, 0?

peterdesmet · 2023-05-24T17:14:55Z

@danstowell in reply to #314 (comment), if you want to classify a media file containing 3 sparrows with bounding boxes, you would have the following 3 observations:

observationID	mediaID	scientificName	start	end	boundingBox
obs1	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	`[x1, y1, width1, height1]`
obs2	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	`[x2, y2, width2, height2]`
obs3	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	`[x3, y3, width3, height3]`

kbubnicki · 2023-05-25T07:26:33Z

Alternatively, we could store a bounding box data in 4 separate columns, thus enforcing exactly one bounding box per observation row:

observationID	mediaID	scientificName	start	end	bboxX	bboxY	bboxWidth	bboxHeight
obs1	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	x1	y1	width1	height1
obs2	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	x2	y2	width2	height2
obs3	med1	Passer domesticus	2020-08-02T05:00:15Z	2020-08-02T05:00:15Z	x3	y3	width3	height3

@danstowell I remember your comment about storing structured data within a CSV cell. What do you think?

kbubnicki · 2023-05-25T10:00:08Z

The format would be:

[
    {
        "name": "bboxX",
        "description": "The relative X coordinate of a bounding box center, normalized to the image width.",
        "type": "number",
        "constraints": {
            "required": false,
            "minimum": 0,
            "maximum": 1
        },
        "example": 0.5
    },
    {
        "name": "bboxY",
        "description": "The relative Y coordinate of a bounding box center, normalized to the image height.",
        "type": "number",
        "constraints": {
            "required": false,
            "minimum": 0,
            "maximum": 1
        },
        "example": 0.5
    },
    {
        "name": "bboxWidth",
        "description": "The relative width of a bounding box, normalized to the image width.",
        "type": "number",
        "constraints": {
            "required": false,
            "minimum": 0,
            "maximum": 1
        },
        "example": 0.5
    },
    {
        "name": "bboxHeight",
        "description": "The relative height of a bounding box, normalized to the image height.",
        "type": "number",
        "constraints": {
            "required": false,
            "minimum": 0,
            "maximum": 1
        },
        "example": 0.5
    }
]

It is YOLO format (also suggested by @ddachs ). The advantage of this format (i.e. coordinates of the center instead of e.g. upper-left corner) is that bboxX and bboxY columns can be used to store information on the relative position of an animal on an image (e.g. estimated using image-calibration methods for distance sampling applications) without defining an entire bounding box. Then bboxWidth and bboxHeight are simply zeros.

peterdesmet · 2023-05-25T11:06:30Z

I like that approach.

danstowell · 2023-05-25T11:37:05Z

Yes, this is indeed a bit clearer. I wasn't planning to comment on that aspect though, because I don't know which of those two options (i.e. single compound column, or separated into columns) will be easier for your target users to produce/consume. If it matches YOLO format then that's an argument in support of it.

Within AudioVisual Core we specified something similar except it was a top-left corner. I rather wish the centrepoint had been an option we considered, since it has some handy properties. (I note also that in AC, zero-sized rectangles are explicitly disallowed, though zero-sized circles are to be used instead! So that's compatible.)

peterdesmet · 2023-05-25T16:23:00Z

Thanks @danstowell! Given that AudioVisual Core adopted top-left corner we might consider that too ... so we can reference the terms?

bboxX -- skos:exactMatch --> http://rs.tdwg.org/ac/terms/xFrac
bboxY -- skos:exactMatch --> http://rs.tdwg.org/ac/terms/yFrac

@danstowell @kbubnicki Or would you advise against that?

Note: the advantage to split into columns is that we can write easier validation (e.g. x should be between 0 and 1).

peterdesmet · 2023-05-26T09:40:55Z

@danstowell @baskaufs I'd like to know how we should reference the AC terms and how important the AC Notes are.

For example, our bboxWidth follows the of definition of http://rs.tdwg.org/ac/terms/widthFrac exactly:

The width of the bounding rectangle, expressed as a decimal fraction of the width of the media item.

But we might allow 0 widths, which contracts with the notes of http://rs.tdwg.org/ac/terms/widthFrac:

Zero-sized bounding rectangles are not allowed. To designate a point, use the radius option with a zero value.

Is our bboxWidth than still an exact match or is it broader (because we allow more)?

peterdesmet · 2023-05-26T12:37:27Z

Update based on #323

We have now adopted top-left corner rather than center. It aligns with Megadetector format and AC
We don't allow 0 values anymore
AC terms are broader than Camtrap DP terms, because the bounding boxes should encompass observed individuals, not just any object.

baskaufs · 2023-05-26T20:33:44Z

@peterdesmet Cool. Prior to adopting the AC terms, we looked at a number of systems for defining bounding boxes. Most (nearly all?) had 0,0 as the upper left corner. So following that convention simplifies the conversion to other systems.

peterdesmet added level:observations term:new labels Aug 19, 2022

peterdesmet added this to the 1.0 milestone Aug 19, 2022

peterdesmet assigned kbubnicki Aug 19, 2022

peterdesmet mentioned this issue Aug 22, 2022

classificationMethod Terms #224

Closed

peterdesmet mentioned this issue Jan 26, 2023

Add time range to observations #273

Closed

This was referenced Feb 6, 2023

Add speed, distance and angle for observations #210

Closed

Add tags term to observations #287

Closed

peterdesmet mentioned this issue Feb 23, 2023

Camtrap DP 0.6 #297

Merged

peterdesmet mentioned this issue May 24, 2023

Single observation table #314

Merged

peterdesmet closed this as completed May 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bounding box #219

Add bounding box #219

peterdesmet commented Aug 19, 2022

ddachs commented Sep 28, 2022

peterdesmet commented Jan 26, 2023

ddachs commented Jan 26, 2023 •

edited

Loading

peterdesmet commented Feb 6, 2023 •

edited

Loading

peterdesmet commented May 24, 2023

kbubnicki commented May 25, 2023

kbubnicki commented May 25, 2023

peterdesmet commented May 25, 2023

danstowell commented May 25, 2023

peterdesmet commented May 25, 2023

peterdesmet commented May 26, 2023

peterdesmet commented May 26, 2023

baskaufs commented May 26, 2023

Add bounding box #219

Add bounding box #219

Comments

peterdesmet commented Aug 19, 2022

ddachs commented Sep 28, 2022

peterdesmet commented Jan 26, 2023

ddachs commented Jan 26, 2023 • edited Loading

peterdesmet commented Feb 6, 2023 • edited Loading

peterdesmet commented May 24, 2023

kbubnicki commented May 25, 2023

kbubnicki commented May 25, 2023

peterdesmet commented May 25, 2023

danstowell commented May 25, 2023

peterdesmet commented May 25, 2023

peterdesmet commented May 26, 2023

peterdesmet commented May 26, 2023

baskaufs commented May 26, 2023

ddachs commented Jan 26, 2023 •

edited

Loading

peterdesmet commented Feb 6, 2023 •

edited

Loading