Skip to content

Latest commit

 

History

History
41 lines (29 loc) · 1.89 KB

import.md

File metadata and controls

41 lines (29 loc) · 1.89 KB

Import

Systems

The following systems were used for performance evaluation:

System Provider CPU Cores RAM SSD Heap Mem Block Cache Resource Cache ¹ Background Jobs ² Indexer Threads ³
LEA47 on-prem EPYC 7543P 16 128 GiB 2 TB 32 GiB 32 GiB 10 M 8 16
LEA58 on-prem EPYC 7543P 32 256 GiB 2 TB 64 GiB 64 GiB 20 M 16 32

¹ Size of the resource cache (DB_RESOURCE_CACHE_SIZE) ² The maximum number of the [background jobs][3] used for DB compactions (DB_MAX_BACKGROUND_JOBS) ³ The number threads used for indexing resources (DB_RESOURCE_INDEXER_THREADS)

Datasets

The following datasets were used:

Dataset History # Pat. ¹ # Res. ² # Obs. ³ Disc Size
100k 10 years 100 k 104 M 59 M 202 GiB
100k-fh full 100 k 317 M 191 M 323 GiB
1M 10 years 1 M 1044 M 593 M 1045 GiB

¹ Number of Patients, ² Total Number of Resources, ³ Number of Observations

The creation of the datasets is described in the Synthea section. The disc size is measured after full manual compaction of the database and can be higher during the import.

Import

blazectl --server http://localhost:8080/fhir upload -c8 <dataset>
System Dataset Time (h) Resources/s
LEA47 100k 3,048 9469
LEA47 100k-fh 9,797 8977
LEA58 100k 2,169 13305
LEA58 100k-fh 7,426 11843