Target folder structure #14

JXu768 · 2023-11-17T19:51:50Z

JXu768
Nov 17, 2023

Hi community, can someone please clarify the target folder structure in the DA SOP? https://chorus-ai.github.io/data_acq_SOP/docs/Data-Uploading/Data_uploading/
Should we split each structured table by patient and make 13 tables for each patient (total being 13*N in each upload), or should we have 13 tables, each table would have all patients in each upload?

clermontg · 2023-11-30T18:32:13Z

clermontg
Nov 30, 2023
Maintainer

Jenny,
indeed, we want 13 tables for each patient. Many tables will have a single line (person_id, visit_occurence, observation_period). Some will have no lines most of the time (death).
We will also likely suggest that the patient folder should correspond to the visit_occurence_ID rather than person_ID, since the same patient (person_id) may have several encounters (different hospitalizations) within or across years.
We trying to sort out is how to best indicate you are uploading a modified OMOP table. Let's say you mapped more MEASUREMENTs and you want to update that table in the cloud. The cloud has implicit versioning and we should not lose data if you call the file the same. So, our suggestion is just to overwrite the tables/files. A final decision will be made by Tools before this circumstance happens.

2 replies

del42 Nov 30, 2023
Maintainer

@clermontg Could I just suggest, not to enforce visit_occurence_ID on the folders. For instance of continues waveforms, sometimes it can belong to several encounters. Where do we put this waveform? We wouldn't want to break the waveform. Sometime encounters overlap, in the instance of ICU stay and patient get up to get procedure done. There should be one physical continues waveform. You know what I mean?

Sites Directory Structure

/person_id
- /OMOP
  - PERSON.csv
  - OBSERVATION_PERIOD.csv
  - VISIT_OCCURRENCE.csv
  - VISIT_DETAIL.csv
  - CONDITION_OCCURRENCE.csv
  - DRUG_EXPOSURE.csv
  - PROCEDURE_OCCURRENCE.csv
  - DEVICE_EXPOSURE.csv
  - MEASUREMENT.csv
  - OBSERVATION.csv
  - DEATH.csv
  - NOTE.csv
  - NOTE_NLP.csv
  - FACT_RELATIONSHIP.csv
- /IMAGES
  - person_id-study_id.hdf5.
- /WAVEFORMS
  - person_id-starttime1-range.hdf5
  - person_id-starttime2-range.hdf5

With person_id and starttime, we can align the waveform with multiple encounters. -Del

Chesterguan Nov 30, 2023
Maintainer

Here are some of my naïve ideas, and I think our key goal right now is to upload the data and keep all the capabilities to organize the data as we want eventually.

@del42 , I think maybe you guys used visit_occurrence_id as visit_detail_id, and that's fine.

Although UF data is okay to use visit_occurrence_id to link, I still vote for Del's idea. The person_id + start_timestamp will be unique identifier for waveform files and will not lose any information to link with EHR data.

we may need to consider the timezone issue when we used timestamp. UNIX Epoch will be okay.

Besides that, I would suggest each site to provide crosswalk files when uploading unstructured data which indicates:

For waveform:

patientID
visit_occurrence_id or visit_detail_id or other EHR indexer
admissionStartTime -> it can be ICU admission or hospital admission based on data's granularity
admissionEndTime -> same to above
waveFormStartTime
waveFormFilePath or other unique file path identifier
(optional) waveFormEndTime(or it can be any like recording period)

Although the above information can be captured/extracted from target HDF5 or CCDEF waveform files, I believe a simple crosswalk file will be much easier.

For Images: (I am not sure how to use studyID to locate the images, but from my understanding, it's similar to waveform files)

patientID
visit_occurrence_id or visit_detail_id or other EHR indexer
admissionStartTime
admissionEndTime
imageFilePath or other unique file handler

Only if the image was produced during the admission period, It should be fine.

clermontg · 2023-11-30T21:12:03Z

clermontg
Nov 30, 2023
Maintainer

@bold, ***@***.***> Then, we need to be specific as to how different encounters are uploaded for a given person_id. Need a recipe as how to upload distinct hospitalizations. A recipe could be to just append to the OMOP tables? (for example, add a second line to the visit_occurence table to reflect that second admission and modify the observation_period table to reflect the last available data element for that person_id, all the other tables would be straight unions across admissions) Gilles From: Del Bold ***@***.***> Sent: Thursday, November 30, 2023 3:18 PM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) @clermontg<https://github.com/clermontg> Could I just suggest, not to enforce visit_occurence_ID on the folders. For instance of continues waveforms, sometimes it can belong to several encounters. Where do we put this waveform? We wouldn't want to break the waveform. Sometime encounters overlap, in the instance of ICU stay and patient get up to get procedure done. There should be one physical continues waveform. You know what I mean? Sites Directory Structure * /person_id * /OMOP * PERSON.csv * OBSERVATION_PERIOD.csv * VISIT_OCCURRENCE.csv * VISIT_DETAIL.csv * CONDITION_OCCURRENCE.csv * DRUG_EXPOSURE.csv * PROCEDURE_OCCURRENCE.csv * DEVICE_EXPOSURE.csv * MEASUREMENT.csv * OBSERVATION.csv * DEATH.csv * NOTE.csv * NOTE_NLP.csv * FACT_RELATIONSHIP.csv * /IMAGES * person_id-study_id.hdf5. * /WAVEFORMS * person_id-starttime1-range.hdf5 * person_id-starttime2-range.hdf5 With person_id and starttime, we can align the waveform with multiple encounters. -Del - Reply to this email directly, view it on GitHub<#14 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKWKOTEKJECZIIV3CK3YHDSV7AVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TOMRRHE4TS>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

1 reply

del42 Dec 4, 2023
Maintainer

@clermontg I am bit confused with your comment on we needing a recipe as how to upload distinct hospitalizations. Encounters are visit_occurrences in OMOP and person could have several visit_occurrences. We probably don't need to add a second line to the visit_occurence table to reflect that second admission and modify the observation_period table to reflect the last available data element for that person_id. It is already there to my understanding.

clermontg · 2023-11-30T21:17:51Z

clermontg
Nov 30, 2023
Maintainer

We not requesting the NOTE table (yet). Chester, not sure what you mean by your comment about occurrence and detail. Visit_detail reflects all the nursing units a patient has been moved across during a single occurrence. So, it is essential to compute ICU length of stay and datetime of ICU admission and discharge. It is true one can build visit_occurence from visit_detail. Is this what you meant? Gilles From: Chester Guan （Ziyuan Guan） ***@***.***> Sent: Thursday, November 30, 2023 4:05 PM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) Here are some of my naïve ideas, and I think our key goal right now is to upload the data and keep all the capabilities to organize the data as we want eventually. @del42<https://github.com/del42> , I think maybe you guys used visit_occurrence_id as visit_detail_id, and that's fine. Although UF data is okay to use visit_occurrence_id to link, I still vote for Del's idea. The person_id + start_timestamp will be unique identifier for waveform files and will not lose any information to link with EHR data. we may need to consider the timezone issue when we used timestamp. UNIX Epoch will be okay. Besides that, I would suggest each site to provide crosswalk files when uploading unstructured data which indicates: For waveform: * patientID * visit_occurrence_id or visit_detail_id or other EHR indexer * admissionStartTime -> it can be ICU admission or hospital admission based on data's granularity * admissionEndTime -> same to above * waveFormStartTime * waveFormFilePath or other unique file path identifier * (optional) waveFormEndTime(or it can be any like recording period) Although the above information can be captured/extracted from target HDF5 or CCDEF waveform files, I believe a simple crosswalk file will be much easier. For Images: (I am not sure how to use studyID to locate the images, but from my understanding, it's similar to waveform files) * patientID * visit_occurrence_id or visit_detail_id or other EHR indexer * admissionStartTime * admissionEndTime * imageFilePath or other unique file handler Only if the image was produced during the admission period, It should be fine. — Reply to this email directly, view it on GitHub<#14 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKX5IXHYYBQRVBSL5F3YHDYH5AVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TOMRSGQ2DK>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

4 replies

Chesterguan Nov 30, 2023
Maintainer

Hi Gilles, sorry for the misunderstanding. What i tried to say is since VISIT_DETAIL table is an optional table, so site might only use visit_occurrence_id to represent every single encounter.

So the waveform files might be shared between multiple visit_occurrence_ids.

clermontg Nov 30, 2023
Maintainer

True that visit_detail is overall optional in the CDM, but this is CHoRUS's only way to understand ICU admissions, so we will insist that sites produce it.
I am very much supporting that waveform files be encounter-specific. Why not include visit_occurrence_id as suffix instead of start time?

Chesterguan Nov 30, 2023
Maintainer

I mean we could include visit_occurrence_id as suffix for EHR linkage,
but when end users use the waveform files(and also tool module's application), they still need to use the waveform start timestamp, I think that's Del's concern. And also if we want to name waveform files with visit_occurrence_id, we might need to split a single waveform file into multiple files which might add more complexities to it.

and for ICU admission, I am not sure how to define the ICU admission precisely actually.
I tried to use location enter and exit timestamps to map with our waveform recording timestamps, but actually many of them cannot match logically. Some station enter timestamp was after the waveform recording start timestamp, while some waveform recording stop timestamp was after the station exit timestamp. It's hard to define the time intervals, many of them have hours difference.
However, if we used hospital admission start and end time to match, it will be much easier. Although it's not accurate, but make more sense.
I think other sites might have similar issues as well.

But anyway, that's just my ideas and concerns, we will follow your instructions for sure.

del42 Dec 4, 2023
Maintainer

All I am suggesting is that we leave the waveform file name with person_id and timestamp and duration. That is how I managed in the past and linking the raw waveform with those info is much better. This won't require as to break the continues file and also make duplicates on overlapping encounters. Plus, I am bit afraid to link in to wrong encounter and make the file not usable.

del42 · 2023-12-04T18:00:59Z

del42
Dec 4, 2023
Maintainer

Could I suggest how we should name the waveform files on the waveform and image folder?
This is how I would like to submit. Please take it or leave it @clermontg @Chesterguan

Clinical Waveform Data Naming Convention in Source Format

Adjustments to the naming convention for clinical waveform data stored in this case in HDF5 format are made to include the duration in seconds and exclude the modality or type of waveform. The format is as follows:

Patient Identification: A unique person ID, typically a number or numeric digits from OMOP Person Table.
Study Date: The date when the waveform data was recorded, in the format YYYYMMDD.
Start Time: The start time of the recording, in HHMMSS format.
Duration in Seconds: The duration of the recording in seconds.

Example File Name

Person123_20230101_101530_3600.h5

This example represents a recording for patient 123 on January 1, 2023, starting at 10:15:30, with a duration of 3600 seconds, and a unique identifier.

(DICOM) Image Naming Convention

Patient Identification: Typically includes a person ID.
Study Date: The date when the study was conducted, usually in the format YYYYMMDD.
Study Time: The time of the study, often in HHMMSS format.
Modality: Refers to the type of equipment used for the scan, such as MR (Magnetic Resonance), CT (Computed Tomography), or US (Ultrasound).
Series Number: Indicates the sequence of a particular series of images in a study.
Instance Number: Represents the specific image number within a series.

Example of a DICOM Image Name

PersonID_20230101_101530_CT_01_001.dcm

This example would represent a CT scan for a patient, conducted on January 1, 2023, at 10:15:30. This image is the first in its series and the first in that series.

Notes

This naming convention simplifies the retrieval for the APIs and central cloud processing ETLs.

1 reply

Chesterguan Dec 4, 2023
Maintainer

I like the idea!

clermontg · 2023-12-04T18:40:56Z

clermontg
Dec 4, 2023
Maintainer

So, if a recording covers 8 days of data, the date and time of the first entry in the h5 should be used, correct? From: Del Bold ***@***.***> Sent: Monday, December 4, 2023 1:01 PM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) Could I suggest how we should name the waveform files on the waveform and image folder? This is how I would like to submit. Please take it or leave it. Clinical Waveform Data Naming Convention in Source Format Adjustments to the naming convention for clinical waveform data stored in this case in HDF5 format are made to include the duration in seconds and exclude the modality or type of waveform. The format is as follows: * Patient Identification: A unique person ID, typically a number or numeric digits from OMOP Person Table. * Study Date: The date when the waveform data was recorded, in the format YYYYMMDD. * Start Time: The start time of the recording, in HHMMSS format. * Duration in Seconds: The duration of the recording in seconds. Example File Name Person123_20230101_101530_3600.h5 This example represents a recording for patient 123 on January 1, 2023, starting at 10:15:30, with a duration of 3600 seconds, and a unique identifier. (DICOM) Image Naming Convention * Patient Identification: Typically includes a patient ID, which could be a number or a combination of letters and numbers, assigned by the medical facility. * Study Date: The date when the study was conducted, usually in the format YYYYMMDD. * Study Time: The time of the study, often in HHMMSS format. * Modality: Refers to the type of equipment used for the scan, such as MR (Magnetic Resonance), CT (Computed Tomography), or US (Ultrasound). * Series Number: Indicates the sequence of a particular series of images in a study. * Instance Number: Represents the specific image number within a series. Example of a DICOM Image Name PatientID_20230101_101530_CT_01_001.dcm This example would represent a CT scan for a patient, conducted on January 1, 2023, at 10:15:30. This image is the first in its series and the first in that series. Notes * This naming convention simplifies the retrieval for the APIs and central cloud processing ETLs. - Reply to this email directly, view it on GitHub<#14 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKTYLDCPZ6S66AFOZWTYHYFWPAVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TONJWGEYTC>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

0 replies

clermontg · 2023-12-04T18:45:57Z

clermontg
Dec 4, 2023
Maintainer

Del, I will not micromanage this and am fine with any scheme that works. Things that make sense for me usually do not make sense for most other humans 😉 Did you modify the text in the Draft request on do you want me to? Gilles From: Del Bold ***@***.***> Sent: Monday, December 4, 2023 10:51 AM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) All I am suggesting is that we leave the waveform file name with person_id and timestamp and duration. That is how I managed in the past and linking the raw waveform with those info is much better. This won't require as to break the continues file and also make duplicates on overlapping encounters. Plus, I am bit afraid to link in to wrong encounter and make the file not usable. — Reply to this email directly, view it on GitHub<#14 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKTFVGETYIU3SYCUUMTYHXWN5AVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TONJUGY3TQ>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

1 reply

del42 Dec 4, 2023
Maintainer

@clermontg I will work with @Chesterguan to change the documentation. Thank you for your confidence in us!

clermontg · 2023-12-04T18:51:22Z

clermontg
Dec 4, 2023
Maintainer

@del42 OK, this one we do need to talk about. So, how do you suggest representing two distinct admissions in our suggested folder structure? Gilles From: Del Bold ***@***.***> Sent: Monday, December 4, 2023 10:46 AM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) @clermontg<https://github.com/clermontg> I am bit confused with your comment on we needing a recipe as how to upload distinct hospitalizations. Encounters are visit_occurrences in OMOP and person could have several visit_occurrences. We probably don't need to add a second line to the visit_occurence table to reflect that second admission and modify the observation_period table to reflect the last available data element for that person_id. It is already there to my understanding. - Reply to this email directly, view it on GitHub<#14 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKQTWCCJOSLMHTICALDYHXV4BAVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TONJUGYZTC>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

0 replies

AEW0330 · 2023-12-07T16:45:37Z

AEW0330
Dec 7, 2023

Gilles are we sure the inclusion of a per-patient OMOP folder in this structure is required?

Having the OMOP data including all the vocabulary in two places would seem to:

create very large redundancies between the patient-level OMOP and the central OMOP
data integrity challenges

If the rationale is very clear, I don't want to suggest that this approach should be reconsidered.

Is there a source of information that gives a clear understanding of

how those redundancy and integrity risks are managed
the added value the person-level OMOP instance for required functions like defining cohorts or ML feature engineering

Given the ability to link files to the aggregate OMOP structure, we might be wise to make sure we're not taking on large tool development tasks for problems that might already be mostly solved like querying data to define cohorts or generating person-level data structures for feature engineering.

Querying to define cohorts and the generation of data frames (in python or R) are already handled by mature software pipelines by the OHDSI tool stack. If we need different functionality for doing that with waveforms and images, an alternative might be needed. The documentation of the data extraction functions in the PLP package illustrate how that data extraction and transformation to a row-per-person data frame is supported within the OHDSI for ML model development.

Those functions use the FeatureExtraction package to do that work. FeatureExtraction gets data from a relational OMOP DB structure and puts it into a row-per-patient data frame. Data objects output from FeatureExtraction is stored within an object that included associated metadata about meaning of default indicator variables (e.g. condition occurrence) or custom covariates (lab or other numeric values or trends etc.). The OHDSI pipelines like those in PLP use those objects for feature engineering by the many flavors of ML supported in the OHDSI HADES packages like the DeepPatientLevelPrediction package or other approaches depending on use case requirements.

These pipelines are quite full-featured and flexible. They support ensemble models and are tools for clinically informed inspection of data-driven feature selection results, automated displays of results in various standard outputs, linkage with the [OHDSI Prediction Model library](https://delphi.ohdsi.org/] that supports upload and download of models and cumulative collection of model performance metadata at different institutions, etc. In general, they standardize inputs in ways that leverage the OMOP information model and phenotyping resources, allow very flexible adaptation of model types - hyperparameter specification and other design options - or use of completely novel classifiers, and provide standardized outputs that conform to ML best practices for model inspection and sharing.

I sketch this mature OMOP/OHDSI-based support just to help us collectively clarify where we think new approaches and associated tools need to be developed based on new ways of storing and accessing the OMOP data as suggested by this file structure.

It seems fairly urgent to reach a clear conclusion about this approach to prevent sites from having to redo work plans for data contribution and give a clear guidance to people who are working on tool development to meet our project requirements.

That type of concern is the motivation I had for us all to clearly define what the requirements are and the existing support for them. I hope this comment is useful and conveys the deep respect I have for your expertise in working in this space.

0 replies

clermontg · 2023-12-07T17:14:20Z

clermontg
Dec 7, 2023
Maintainer

Andrew, The problem at hand is collection and upload. What you suggest has been discussed as an alternative model with the main argument that sites would not have to deconstruct their OMOP tables into person_id-specific tables. Writing a script to do this though is trivial. A main advantage is that the structure allows all domains of data pertaining to a patient to be easily identified and it is a more natural way to segregate waveform and image files. This is also the model that MIMIC has adopted for their waveform files. If there were only OMOP tables, an argument could be made for all patients to be included in single clinical tables. The person_id-centric arrangement also does favor data integrity as sites add new patients or update individual patients. You know where to look. With the alternative model, you would have to develop procedures as to how to properly append/update/replace data. This seems risky. There is no data redundancy, just a larger number of files and folders. Centrally, all .csv will be assembled in a single database of course, so OHDSI script can be run to construct cohorts, etc. A script to do this is also simple to develop (a script will need to be developed anyway to merge data from different sites). So, this will work. Gilles From: Andrew Williams ***@***.***> Sent: Thursday, December 7, 2023 11:46 AM To: chorus-ai/data_acq_SOP ***@***.***> Cc: Clermont, Gilles ***@***.***>; Mention ***@***.***> Subject: Re: [chorus-ai/data_acq_SOP] Target folder structure (Discussion #14) ***@***.***> are we sure the inclusion of a per-patient OMOP folder in this structure is required? Having the OMOP data including all the vocabulary in two places would seem to: * create very large redundancies between the patient-level OMOP and the central OMOP * data integrity challenges If the rationale is very clear, I don't want to suggest that this approach should be reconsidered. Is there a source of information that gives a clear understanding of * how those redundancy and integrity risks are managed * the added value the person-level OMOP instance for required functions like defining cohorts or ML feature engineering Given the ability to link files to the aggregate OMOP structure, we might be wise to make sure we're not taking on large tool development tasks for problems that might already be mostly solved like querying data to define cohorts or generating person-level data structures for feature engineering. Querying to define cohorts and the generation of data frames (in python or R) are already handled by mature software pipelines by the OHDSI tool stack. If we need different functionality for doing that with waveforms and images, an alternative might be needed. The documentation of the data extraction functions in the PLP package<https://ohdsi.github.io/PatientLevelPrediction/reference/index.html> illistrate how that happens for ML model development. Those functions use the FeatureExtraction<http://ohdsi.github.io/FeatureExtraction/> package to do that work. FeatureExtraction gets data from a relational OMOP DB structure and puts it into a row-per-patient data frame.. Data objects output from FeatureExtraction is stored within an object that included associated metadata about meaning of default indicator variables (e.g. condition occurrence) or custom covariates (lab or other numeric values or trends etc.). The OHDSI pipelines like those in PLP use those objects for feature engineering by the many flavors of ML supported in the OHDSI HADES packages like the DeepPatientLevelPrediction<https://github.com/OHDSI/DeepPatientLevelPrediction> package or other approaches depending on use preferences. It seems fairly urgent to reach a clear conclusion about this approach to prevent sites from having to redo work plans for data contribution and give a clear guidance to people who are working on tool development to meet our project requirements. That type of concern is the motivation I had for us all to clearly define what the requirements are and the existing support for them. I hope this comment is useful and conveys the deep respect I have for your expertise in working in this space. - Reply to this email directly, view it on GitHub<#14 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AL33RKU2EKNUO3FPT7BOSCDYIHXDZAVCNFSM6AAAAAA7QI3XHWVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TOOJRGEYTG>. You are receiving this because you were mentioned.Message ID: ***@***.******@***.***>>

1 reply

JXu768 Dec 15, 2023
Author

@clermontg, we appreciate that having individual files per patient would allow all domains of data pertaining to a patient to be together, however, deconstructing OMOP tables can be done at the site level or at the central site when all OMOP tables are appended into a large database, which is one of the goals anyway.
From the local site perspective, deconstructing the table might not make it easier for future updates. We would have to either check individual tables and patient folders to find the ones that need a replacement/update, or pull the OMOP tables and deconstruct them every time we want to replace/update a few rows, which adds a step to the local site workflow each time we update/append/replace the data.
Thanks for your patience and I apologize if I overlooked some benefits of the current structure or simpler ways of updating patient OMOP tables under the current structure. If there is strong preference of keeping OMOP tables per patient, we will follow the instruction for the Feb submission.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Target folder structure #14

{{title}}

Replies: 9 comments 10 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Target folder structure #14

JXu768 Nov 17, 2023

Replies: 9 comments · 10 replies

clermontg Nov 30, 2023 Maintainer

del42 Nov 30, 2023 Maintainer

Sites Directory Structure

Chesterguan Nov 30, 2023 Maintainer

clermontg Nov 30, 2023 Maintainer

del42 Dec 4, 2023 Maintainer

clermontg Nov 30, 2023 Maintainer

Chesterguan Nov 30, 2023 Maintainer

clermontg Nov 30, 2023 Maintainer

Chesterguan Nov 30, 2023 Maintainer

del42 Dec 4, 2023 Maintainer

del42 Dec 4, 2023 Maintainer

Clinical Waveform Data Naming Convention in Source Format

Example File Name

(DICOM) Image Naming Convention

Example of a DICOM Image Name

Notes

Chesterguan Dec 4, 2023 Maintainer

clermontg Dec 4, 2023 Maintainer

clermontg Dec 4, 2023 Maintainer

del42 Dec 4, 2023 Maintainer

clermontg Dec 4, 2023 Maintainer

AEW0330 Dec 7, 2023

clermontg Dec 7, 2023 Maintainer

JXu768 Dec 15, 2023 Author

JXu768
Nov 17, 2023

Replies: 9 comments 10 replies

clermontg
Nov 30, 2023
Maintainer

del42 Nov 30, 2023
Maintainer

Chesterguan Nov 30, 2023
Maintainer

clermontg
Nov 30, 2023
Maintainer

del42 Dec 4, 2023
Maintainer

clermontg
Nov 30, 2023
Maintainer

Chesterguan Nov 30, 2023
Maintainer

clermontg Nov 30, 2023
Maintainer

Chesterguan Nov 30, 2023
Maintainer

del42 Dec 4, 2023
Maintainer

del42
Dec 4, 2023
Maintainer

Chesterguan Dec 4, 2023
Maintainer

clermontg
Dec 4, 2023
Maintainer

clermontg
Dec 4, 2023
Maintainer

del42 Dec 4, 2023
Maintainer

clermontg
Dec 4, 2023
Maintainer

AEW0330
Dec 7, 2023

clermontg
Dec 7, 2023
Maintainer

JXu768 Dec 15, 2023
Author