Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warehouse: legacy CPRD DB replication #17624

Closed
1 of 13 tasks
thorwolpert opened this issue Sep 6, 2023 · 0 comments
Closed
1 of 13 tasks

Warehouse: legacy CPRD DB replication #17624

thorwolpert opened this issue Sep 6, 2023 · 0 comments
Labels
analytics 💹 analytics Epic SRE SRE team task

Comments

@thorwolpert
Copy link
Collaborator

thorwolpert commented Sep 6, 2023

Replicate valid CPRD schemas to the Warehouse

To perform analytics, search bulk loads, and as a historical copy, a set of valid schemas and tables will be replicated as an operational data store to the internal warehouse project.

Determine schemas and relevant tables

Create the ODS

  • Determine if ODS is HA or recoverable from backups
  • Create ODS

Add schemas and relevant tables to the existing replication

  • Create schema and tables in CloudSQL postgres for the identified schemas and tables
  • Create bulk loading scripts
  • Create incremental loading scripts

Load extracts to target systems as needed

  • Determine if any systems need copies of the data
  • Load to a max of 2 of those (another DB, BigQuery, etc)

Service accounts for accessing the ODS

  • Create procedure to create service accounts for systems to access ODS
    • Analytics
    • Business Search
    • Person Search
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
analytics 💹 analytics Epic SRE SRE team task
Projects
None yet
Development

No branches or pull requests

3 participants