Contact - srinivasksun@google.com
The RICO Semantics dataset consists of around 500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements (like icons, form fields, radio buttons, text inputs) and their text labels. The annotations also include human annotated bounding boxes which are more accurate and have a greater coverage of UI elements than using bounding boxes from the view hierarchy. This dataset enables building models for understanding the contents of user interfaces and hence can act as a stepping stone for improving the accessibility of user interfaces.
The datasets are provided "AS IS" without any warranty, express or implied. Google disclaims all liability for any damages, direct or indirect, resulting from the use of this dataset.
- Paper - Towards Better Semantic Understanding of Mobile Interfaces
- Data processing and modeling code
- RICO dataset and paper.
The RICO Semantics datasets augments the RICO dataset with high-quality human
annotations aimed at increasing the semantic understanding of various UI
elements. We provide annotations for 3 different tasks: Icon Shape
consisting
of icon annotations based on their shape, Icon Semantics
consists of icon
annotaitons based on their functionality and Label Association
identifying
various UI elements and associating these UI elements with their text labels.
Compared to previous approaches which rely on bounding boxes from the View
Hierarchy, we release bounding boxes manually annotated by human raters. We
observe that this results in a significantly higher coverage for UI elements. In
total, the annotations include 350k
annotations for Icon Shape
, 78k
annotations for Icon Semantics
and 66k
UI elements with associated text
labels. Please refer to [our paper] for more details.
The annotations are released as JSON files. Each entry in the JSON file contains
the ID of the raw RICO data in the field: screen_id
. All the annotations are
present in the field screen_elements
. This field contains a list of
annotations for the UI elements present. The annotation data includes the
relative coordinates (normalized to the range [0, 1]) of the bounding box for
each label and the class associated with the bounding box. The screen elements
are not in any particular order. The annotations are also split into train
,
val
and test
splits. These splits correspond to the numbers reported in
Section 4 of our paper.
The RICO Semantics dataset is released under CC BY-SA 4.0 license. For the full license, see LICENSE. Please cite the following paper if you use the dataset in your work:
@article{https://doi.org/10.48550/arxiv.2210.02663,
author = {Srinivas Sunkara and
Maria Wang and
Lijuan Liu and
Gilles Baechler and
Yu-Chung Hsiao and
Jindong Chen and
Abhanshu Sharma and
James Stout},
title = {Towards Better Semantic Understanding of Mobile Interfaces},
journal = {CoRR},
volume = {abs/2210.02663},
year = {2022},
url = {https://arxiv.org/abs/2210.02663},
eprinttype = {arXiv},
eprint = {2210.02663},
timestamp = {Thu, 06 Oct 2022 15:00:01 +0100},
}
The following table is necessary for this dataset to be indexed by search engines such as Google Dataset Search.
property | value | ||||||
---|---|---|---|---|---|---|---|
name | Rico Semantics dataset |
||||||
alternateName | Rico Semantic Annotations dataset |
||||||
url | https://github.com/google-research-datasets/rico-semantics |
||||||
description | The dataset adds annotations to the RICO dataset to improve the semantic understanding of UIs. This dataset consists of human annotated bounding boxes and class labels for 3 tasks: Icon Shape, Icon Semantics and Label Association. |
||||||
provider |
|
||||||
citation | https://identifiers.org/arxiv:2201.12409 |