Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Long-format table annotation Part 1] download gene names and protein RefSeq IDs #55

Merged
merged 23 commits into from
Jul 21, 2021

Commits on Jul 15, 2021

  1. Configuration menu
    Copy the full SHA
    0b316f4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    75769b5 View commit details
    Browse the repository at this point in the history
  3. Update README.md

    logstar committed Jul 15, 2021
    Configuration menu
    Copy the full SHA
    b750ed5 View commit details
    Browse the repository at this point in the history

Commits on Jul 16, 2021

  1. Download annotation data

    Download `Gene_full_name` and `Protein_RefSeq_ID` from
    https://mygene.info/ using the mygene package.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    18ea05a View commit details
    Browse the repository at this point in the history
  2. Clean up ensg-gene-full-name-refseq-protein.tsv

    Remove rows that have both Gene_full_name and Protein_RefSeq_ID values
    missing.
    
    Write NA for missing values rather than "NA" or "", in order to be
    consistent with the data release.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    f40e997 View commit details
    Browse the repository at this point in the history
  3. Add echo commands in shell scripts

    Print messages after done running shell scripts.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    f941405 View commit details
    Browse the repository at this point in the history
  4. Sort mygene returned character values

    mygene API may return results in different orders, so sorting the values
    before output is necessary to reproduce previous results.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    96e8885 View commit details
    Browse the repository at this point in the history
  5. Update README.md

    Add long-format-table-utils module.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    80b794e View commit details
    Browse the repository at this point in the history
  6. Update README.md

    Describe how to update
    annotator/annotation-data/oncokb-cancer-gene-list.tsv.
    logstar committed Jul 16, 2021
    Configuration menu
    Copy the full SHA
    9784aae View commit details
    Browse the repository at this point in the history

Commits on Jul 17, 2021

  1. Update README.md

    Update OncoKB annotation table source to
    analyses/long-format-table-utils/annotator/annotation-data/oncokb-cancer-gene-list.tsv
    logstar committed Jul 17, 2021
    Configuration menu
    Copy the full SHA
    9817edf View commit details
    Browse the repository at this point in the history

Commits on Jul 20, 2021

  1. Update error message in download-annotation-data.R

    Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>
    logstar and jharenza authored Jul 20, 2021
    Configuration menu
    Copy the full SHA
    909277d View commit details
    Browse the repository at this point in the history
  2. Update error message in download-annotation-data.R

    Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>
    logstar and jharenza authored Jul 20, 2021
    Configuration menu
    Copy the full SHA
    80b6af7 View commit details
    Browse the repository at this point in the history
  3. Update error message in download-annotation-data.R

    Co-authored-by: Jo Lynne Rokita <jharenza@gmail.com>
    logstar and jharenza authored Jul 20, 2021
    Configuration menu
    Copy the full SHA
    a1399bc View commit details
    Browse the repository at this point in the history
  4. Rename update-long-format-table-utils.sh to run-update-long-format-ta…

    …ble-utils.sh
    
    README.md is also updated accordingly.
    
    This is suggested by @jharenza at
    d3b-center#55 (comment)
    , in order to follow the shell script name convention of analysis
    modules.
    logstar committed Jul 20, 2021
    Configuration menu
    Copy the full SHA
    090f080 View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2021

  1. Specify annotation data versions in README.md

    Add annotation data versions and data of the last update in the "Update
    downloaded data that are used in this module" section, as suggested by
    @jharenza at
    d3b-center#55 (comment)
    
    Combine gene and disease (/cancer_group) annotations into one table.
    
    Add additional notes on annotation data versions to the "Implementation
    of long-format table annotator" section.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    a183470 View commit details
    Browse the repository at this point in the history
  2. Update README.md

    Change the date of the last update of
    annotator/annotation-data/oncokb-cancer-gene-list.tsv to 07/16/2021.
    
    The 07/16/2021 annotator/annotation-data/oncokb-cancer-gene-list.tsv is
    identical to the previous 06/16/2021 version, even though the website at
    https://www.oncokb.org/cancerGenes has changed last update from
    06/16/2021 to 07/16/2021.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    5716523 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    03516c9 View commit details
    Browse the repository at this point in the history
  4. Remove test cases in download-annotation-data.R

    As suggested by @jharenza at
    <d3b-center#55 (comment)>,
    test cases should be removed from the source code file.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    6f76fae View commit details
    Browse the repository at this point in the history
  5. Add unit testing using the testthat package

    Run `bash run-tests.sh` to run all tests.
    
    In order to import a funciton for testing from an R file without running
    the whole file, a helper function import_function is defined at
    tests/helper_import_function.R, and the import_function is also tested
    in the tests/test_helper_import_function.R file.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    48a131d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    0773a1e View commit details
    Browse the repository at this point in the history
  7. Update README.md

    Add "Unit testing for long-format table annotator" section to descript
    how to use the unit testing framework.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    a08cf5f View commit details
    Browse the repository at this point in the history
  8. Update README.md

    Edit.
    logstar committed Jul 21, 2021
    Configuration menu
    Copy the full SHA
    796a26c View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    29325c7 View commit details
    Browse the repository at this point in the history