Skip to content

Latest commit

 

History

History
93 lines (79 loc) · 7.82 KB

data-warehouse-naming-conventions.md

File metadata and controls

93 lines (79 loc) · 7.82 KB

Data Warehouse Naming Conventions

via

[type][subject][name]

  • where type is 'dim' or 'fact' (or 'facts' for aggregates)
  • where subject is the subject area within the warehouse ('comm' for common, 'fw' for firewall, 'ids', etc)
  • where name is ideally a single word name, or abbreviations of dimensions in the case of an aggregate table
  • ex: dim_comm_org for the organizational dimension
  • ex: fact_scan for the scan fact table
  • ex: facts_scan_org_sev_daily - fact scan summary table grouped at the org, sev & day level

Column Names:

don't prefix with the entire table name - that's way too long

do prefix with just a meaningful part of it - this helps tremendously when writing or reading queries.

Sources

via

Type Linked Service Name Linked Service Dataset Full
Databases SQL Server* MSQL_ LS_MSQL_ DS_MSQL_ LS_SQL_Example
Oracle* ORAC_ LS_ORAC_ DS_ORAC_ LS_ORAC_Example
MySQL* MYSQ_ LS_MYSQ_ DS_MYSQ_ LS_MYSQ_Example
DB2* DB2_ LS_DB2_ DS_DB2_ LS_DB2_Example
Teradata* TDAT_ LS_TDAT_ DS_TDAT_ LS_TDAT_Example
PostgreSQL* PG_ LS_PG_ DS_PG_ LS_PG_Example
Sybase* SYBA_ LS_SYBA_ DS_SYBA_ LS_SYBA_Example
Cassandra* CASS_ LS_CASS_ DS_CASS_ LS_CASS_Example
MongoDB* MONG_ LS_MONG_ DS_MONG_ LS_MONG_Example
Amazon Redshift AMRED_ LS_AMRED_ DS_AMRED_ LS_AMRED_Example
File File System* FILE_ LS_FILE_ DS_FILE_ LS_FILE_Example
HDFS* HDFS_ LS_HDFS_ DS_HDFS_ LS_HDFS_Example
Amazon S3 AMS3_ LS_AMS3_ DS_AMS3_ LS_AMS3_Example
FTP FTP_ LS_FTP_ DS_FTP_ LS_FTP_Example
Services Finale FNLE_ LS_FNLE_ DS_FNLE_ LS_FNLE_Example
Airtable AIRT_ LS_AIRT_ DS_AIRT_ LS_AIRT_Example
Klaviyo AIRT_ LS_AIRT_ DS_AIRT_ LS_AIRT_Example
Northbeam NBM_ LS_NBM_ DS_NBM_ LS_NBM_Example
ShipStation AIRT_ LS_AIRT_ DS_AIRT_ LS_AIRT_Example
SOS Inventory SOS_ LS_SOS_ DS_SOS_ LS_SOS_Example
EasyPost AIRT_ LS_AIRT_ DS_AIRT_ LS_AIRT_Example
ZenDesk ZEN_ LS_AIRT_ DS_AIRT_ LS_AIRT_Example
Andlor ANDL_ LS_ANDL_ DS_ANDL_ LS_ANDL_Example
Storage Made Easy SME_ LS_SME_ DS_SME_ LS_SME_Example
Google Google Big Table GBT_ LS_GBT_ DS_GBT_ LS_GBT_Example
Google Tag Manager GTM_ LS_GTM_ DS_GTM_ LS_GTM_Example
Google Analytics GA_ LS_GA_ DS_GA_ LS_GA_Example
Google Sheets GSHT_ LS_GSHT_ DS_GSHT_ LS_GSHT_Example
Google Docs GDOC_ LS_GDOC_ DS_ GDOC_ LS_ GDOC_Example
Azure Azure Blob storage AZBLB_ LS_AZBLB_ DS_AZBLB_ LS_AZBLB_Example
Azure Data Lake Store AZDLS_ LS_AZDLS_ DS_AZDLS_ LS_AZDLS_Example
Azure SQL Database AZSQL_ LS_AZSQL_ DS_AZSQL_ LS_AZSQL_Example
Azure SQL Data Warehouse AZSDW_ LS_AZSDW_ DS_AZSDW_ LS_AZSDW_Example
Azure Table storage AZTBL_ LS_AZTBL_ DS_AZTBL_ LS_AZTBL_Example
Azure DocumentDB AZDOC_ LS_AZDOC_ DS_AZDOC_ LS_AZDOC_Example
Azure Search Index AZSER_ LS_AZSER_ DS_AZSER_ LS_AZSER_Example
Others Salesforce SAFC_ LS_SAFC_ DS_SAFC_ LS_SAFC_Example
Generic ODBC* ODBC_ LS_ODBC_ DS_ODBC_ LS_ODBC_Example
Generic OData ODAT_ LS_ODAT_ DS_ODAT_ LS_ODAT_Example
Web Table (table from HTML) WEBT_ LS_WEBT_ DS_WEBT_ LS_WEBT_Example
GE Historian* GEHI_ LS_GEHI_ DS_GEHI_ LS_GEHI_Example

Pipelines

via

Type Name Action Example
Data movement Activity PL_DATA_ NA PL_DATA_DS_SQL_Person_To_DS_ABLB_Person
Data transformation pipeline PL_TRAN_ SPRC – Stored Procedure PL_TRAN_SPRC_CleanDimAccount
PL_TRAN_ DNET – Script PL_TRAN_DNET_AggregateSales
PL_TRAN_ ADLK – Azure Data Lake PL_TRAN_ADLK_AggregateSales
PL_TRAN_ HIVE – Hive PL_TRAN_HIVE_AggregateSales
PL_TRAN_ PIG – Pig PL_TRAN_PIG_AggregateSales
PL_TRAN_ MAPR – MapReduce PL_TRAN_MAPR_AggregateSales
PL_TRAN_ HADP – Hadoop Stream PL_TRAN_HADP_StreamData
PL_TRAN_ AML – Azure Machine Learning PL_TRAN_AML_CalculateMonthlyChurn
Data Segementation PL_SEG_ OPS manual ops classification PL_SEG_CLAS_UserFirstPurchaseBehavior
Model Training PL_MTRAIN_ SUP supervised training PL_MTRAIN_SUP_UserFirstPurchaseBehavior
Model Training PL_MTRAIN_ UNSUP unsupervised training PL_MTRAIN_UNSUP_UserFirstPurchaseBehavior
Model Training PL_MTRAIN_ SEMISUP semi-supervised training PL_MTRAIN_SEMISUP_UserFirstPurchaseBehavior
Model Training PL_MTRAIN_ REIF reinforcement training PL_MTRAIN_REIF_UserFirstPurchaseBehavior
Model Evaluation PL_MEVAL_ OPS human model eval PL_MEVAL_OPS_UserFirstPurchaseBehavior
Model Evaluation PL_MEVAL_ REPLAY model eval with data PL_MEVAL_AML_UserFirstPurchaseBehavior
Model Deployment PL_MODL_ depoyed model name PL_MODL_UserFirstPurchaseBehavior
Model Performance PL_MPERF_ model perf metrics PL_MPERF_UserFirstPurchaseBehavior