Skip to content

Commit

Permalink
eradicate our confounding of release and submission
Browse files Browse the repository at this point in the history
Co-authored-by: Cornelius Roemer <cornelius.roemer@gmail.com>
  • Loading branch information
j23414 and corneliusroemer committed Nov 7, 2023
1 parent f310755 commit ac1ae4a
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
6 changes: 3 additions & 3 deletions ingest/config/config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -10,14 +10,14 @@ transform:
# Fields to rename.
# This is the first step in the pipeline, so any references to field names
# in the configs below should use the new field names
field_map: ['collected=date', 'submitted=date_submitted', 'genbank_accession=accession', 'submitting_organization=institution']
field_map: ['collected=date', 'released=date_released, 'genbank_accession=accession', 'submitting_organization=institution']
# Standardized strain name regex
# Currently accepts any characters because we do not have a clear standard for strain names
strain_regex: '^.+$'
# Back up strain name field if 'strain' doesn't match regex above
strain_backup_fields: ['accession']
# List of date fields to standardize
date_fields: ['date', 'date_submitted']
date_fields: ['date', 'date_released']
# Expected date formats present in date fields
# These date formats should use directives expected by datetime
# See https://docs.python.org/3.9/library/datetime.html#strftime-and-strptime-format-codes
Expand Down Expand Up @@ -63,7 +63,7 @@ transform:
'division',
'location',
'host',
'date_submitted',
'date_released',
'sra_accession',
'abbr_authors',
'authors',
Expand Down
2 changes: 1 addition & 1 deletion ingest/source-data/ncbi-dataset-field-map.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ Isolate Lineage strain
Geographic Region region
Geographic Location location
Isolate Collection date collected
Release date submitted
Release date released
Update date updated
Length length
Host Name host
Expand Down

0 comments on commit ac1ae4a

Please sign in to comment.