Skip to content

Luigi workflow for finding ard products in the CEDA archive.

License

Notifications You must be signed in to change notification settings

jncc/ceda_ard_finder

Repository files navigation

CEDA ARD finder

Find ARD in CEDA by searching using their Elasticsearch API.

The CEDA data portal ARD data

Installation and setup

Create virtual env

cd cd_ard_finder

python3 -m venv .venv

source .venv/bin/activate

pip install -r requirements.txt

Integrating this workflow into another using a python package

Installing the specific version from tags (Recommended)

pip install git+https://github.com/jncc/ceda_ard_finder.git@v0.0.1

Then import the required tasks into your workflow and use them:

...
from ceda_ard_finder import CreateSymlinks

@requires(CreateSymlinks)
class GetArdProducts(luigi.Task):
    ...

Add to requirements.txt of your workflow

...
ceda_ard_finder @ git+https://git@github.com/jncc/ceda_ard_finder.git@v0.0.1
...

Setup and runnning

All search filters are optional except startDate and endDate. See ceda-ard-finder-luigi.cfg.example for more filter examples.

Generate state file with search results:

LUIGI_CONFIG_PATH=ceda-ard-finder-luigi.cfg PYTHONPATH='.' luigi --module ceda_ard_finder SearchForProducts --startDate=2021-01-01 --endDate=2021-01-31 --local-scheduler

Output results as symlinks to CEDA locations:

LUIGI_CONFIG_PATH=ceda_ard_finder/ceda-ard-finder-luigi.cfg PYTHONPATH='.' luigi --module ceda_ard_finder CreateSymlinks --startDate=2021-01-01 --endDate=2021-01-31 --local-scheduler

Output results in a text file instead of symlinks. The date range queried is included in the filename:

LUIGI_CONFIG_PATH=ceda_ard_finder/ceda-ard-finder-luigi.cfg PYTHONPATH='.' luigi --module ceda_ard_finder CreateTextFileList --startDate=2021-01-01 --endDate=2021-01-31 --local-scheduler

About

Luigi workflow for finding ard products in the CEDA archive.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages