Skip to content

ESoWC 2021: Comparing Atmospheric Composition Datasets

License

Notifications You must be signed in to change notification settings

albavilanova/adc-toolbox

 
 

Repository files navigation

ADC Toolbox

Comparing Atmospheric Composition Datasets


2021 ESoWC Project Image - ADC Toolbox

-----------------------------------------------------

Contents

Contents
  1. Description
  2. Datasets
  3. Requirements
  4. Organization
  5. References
  6. Feedback
  7. Author
  8. Support

-----------------------------------------------------

1. Description

Atmospheric Datasets Comparison (ADC) Toolbox is aimed to have a set of tools that allows users to compare atmospheric composition datasets from different sources. Currently, it is possible to compare:

  • Forecast data from CAMS model vs. L2 near real-time and offline data from TROPOMI (NO2, O3, CO, SO2, HCHO), IASI (O3, CO, SO2) and GOME-2 (NO2, O3, SO2, HCHO) sensors.
  • Reanalysis monthly data from CAMS model vs. L3 monthly data from TROPOMI (NO2), IASI (O3, CO) and GOME-2 (NO2) sensors.

A full explanation of the initial version of this software and its theoretical background can be read in Research Gate.

Users can install the software in their desktop by following the steps described in Requirements. A case study was created to show the usability of the toolbox for deriving long-term trends in air pollution.

-----------------------------------------------------

2. Datasets

ADC Toolbox facilitates the retrieval of all the datasets presented in Table 1, since the dates they became available to the public. As an exception, the retrieval of IASI L2 data is currently available only since May 14, 2019.

Table 1. Temporal availability (start date - end date) by data source.

Source Type Platform NO2 O3 CO SO2 HCHO
CAMS Forecast ADS 01.2015 - Present 01.2015 - Present 01.2015 - Present 01.2015 - Present 01.2015 - Present
CAMS Reanalysis ADS 01.2003 - 06.2021 01.2003 - 06.2021 01.2003 - 06.2021 01.2003 - 06.2021 01.2003 - 06.2021
TROPOMI L2 S5-P hub 07.2018 - Present 07.2018 - Present 07.2018 - Present 10.2018 - Present 07.2018 - Present
TROPOMI L3 TEMIS 07.2018 - Present - - - -
IASI L2 AERIS Portal - 01.2008 - Present 10.2007 - Present 10.2007 - Present -
IASI L3 AERIS Portal - 01.2008 - Present 10.2007 - Present - -
GOME-2 L2 AC SAF 01.2007 - Present 01.2007 - Present - 01.2007 - Present 01.2007 - Present
GOME-2 L3 AC SAF 02.2007 - 11.2017 - - - -
GOME-2 L3 TEMIS 02.2007 - Present - - - -

The temporal availability of each dataset differs from others. CAMS forecasts became available in 2015 and the reanalysis datasets are accessible since 2003. Besides, the satellites that carry the instruments were launched in different years. Sentinel 5-P was launched in 2017, whereas Metop-A, the first European polar-orbiting satellite, was launched in 2006 and de-orbited in 2021. Metop-B and Metop-C have been operational since 2012 and 2018, respectively.

-----------------------------------------------------

3. Requirements

3.1. General

The needed packages to run this software can be found in requirements.txt. The virtual environment environment.yml was generated to simplify the installation process, users just need to clone the project, create the environment and activate it with:

$ conda config --set channel_priority false
$ conda env create -f environment.yml
$ conda activate ADC Toolbox

If this takes too long, they can manually create it instead (recommended):

$ conda create --name ADC Toolbox
$ conda activate ADC Toolbox
$ conda install -c conda-forge/label/cartopy_dev cartopy
$ pip install -r requirements.txt

To download data from CAMS, users will need to create an account, get the ADS API key, create a file with the name keys.txt under the folder data, and write their API key in one line.

It is important to know that this code only runs in Linux operative systems due to the fact that CAMS model data must be downloaded in GRIB format. This can only be done using ecCodes, a package developed by ECMWF that is only available for Linux. In order to install this library, refer to this installation guide for clear instructions.

3.2. Maps visualization

Lately, there have been problems retrieving the data from NACIS Natural Earth to generate the maps. It has also been found out that there are compatibility issues between the libraries Cartopy and Shapely. It is recommended that Shapely is reinstalled after installing Cartopy and that the maps data are downloaded manually with the following commands:

!pip install cartopy
!pip uninstall -y shapely
!pip install shapely --no-binary shapely
!wget https://raw.githubusercontent.com/SciTools/cartopy/master/tools/cartopy_feature_download.py
!python cartopy_feature_download.py physical
!python cartopy_feature_download.py cultural

3.3. Scatter plots by country

In case users want to see the distribution of the components by countries, they will need a Google API Key, Client User ID and Secret. The library Geocoder is used to get the data from Google, but it also allows the users of the ADC Toolbox to use other APIs to do reverse geocoding (retrieving location information by coordinates). In the function scatter_plot, they will find the following line. You can rewrite as they wish. The list of providers can be found in Geocoder's Github repository.

>>> merge_df['Country'] = merge_df.apply(lambda row: geocoder.google([row['latitude'], row['longitude']], 
                                                     method = 'reverse', key = google_api_key).country_long, 
                                                     axis = 1)

If they do not want to edit anything and prefer to run the code using Google API, then they should edit the file keys.txt under the folder data, and write three lines, under their ADS API key. They should contain, in this order, the Google API Key, Client User ID and Secret. They can get their credentials in Google Cloud Platform.

-----------------------------------------------------

4. Organization

├── README.md                               <- Details about project
├── LICENSE                                 <- Details about license
├── apt.txt                                 <- Details about requirements (apt-get)
├── requirements.txt                        <- Details about requirements (pip)
├── environment.yml                         <- Details about virtual environment
├── data                                    <- Folder where datasets are stored
│   │
│   └── cams                                <- Folder where CAMS datasets are stored
│       │
│       ├── 137-levels-definition.csv       <- Details about CAMS L137 levels
│       └── 60-levels-definition.csv        <- Details about CAMS L60 levels
│
├── functions                               <- Folder that contains all functions
│   │
│   ├── functions_general.ipynb             <- Functions that apply to all datasets
│   ├── functions_cams.ipynb                <- Functions that apply to CAMS datasets
│   ├── functions_iasi.ipynb                <- Functions that apply to IASI datasets
│   ├── functions_tropomi.ipynb             <- Functions that apply to TROPOMI datasets
│   └── functions_gome.ipynb                <- Functions that apply to GOME-2 datasets
│
├── report                                  <- Folder where the images of the README.md are stored
│
├── thesis                                  <- Folder for case study (master's thesis)
│
├── trainings                               <- Folder for trainings material 
│   │
│   └── 2021-12-atmospheric-composition     <- 3rd EUMETSAT/ESA/ECMWF Joint Training in AC
│     │
│     ├── images                            <- Folder where the images of the training are stored
│     ├── data-comparison-practical.ipynb   <- Data comparison practical
│     └── keys-example.txt                  <- File format example of keys.txt (hidden)
│
├── main_cams_gome_L2.ipynb                 <- Comparison between CAMS and GOME-2 L2 datasets
├── main_cams_gome_L3.ipynb                 <- Comparison between CAMS and GOME-2 L3 datasets
├── main_cams_iasi_L2.ipynb                 <- Comparison between CAMS and IASI L2 datasets
├── main_cams_iasi_L3.ipynb                 <- Comparison between CAMS and IASI L3 datasets
├── main_cams_tropomi_L2.ipynb              <- Comparison between CAMS and TROPOMI L2 datasets
└── main_cams_tropomi_L3.ipynb              <- Comparison between CAMS and TROPOMI L3 datasets

-----------------------------------------------------

5. References

5.1. Data sources

5.2. Documentation

5.3. Other useful references

-----------------------------------------------------

6. User feeback form

If you are interested in the ADC toolbox and would like it to have other functionalities or datasets, you can write your feedback in this form. Thank you very much!

-----------------------------------------------------

7. Author

Alba Vilanova Cortezón
Alba Vilanova Cortezón
Research Engineer at Barcelona Supercomputing Center
BEng in Mechanical Engineering at University of Lleida and Inha University
MSc in Geospatial Technologies at NOVA University of Lisbon, WWU Münster and UJI
GitHub: @albavilanova
LinkedIn: @albavilanova
Website: https://albavilanova.es/

-----------------------------------------------------

8. Support

ADC Toolbox has been developed under the framework set by ECMWF Summer of Weather Code:

Logo of ESoWC

A special thanks to my mentors Federico Fierli, Antje Inness and Miha Razinger for their support and guidance.

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%