re_data - fix data issues before your users & CEO would discover them 😊
-
Updated
Apr 30, 2024 - HTML
re_data - fix data issues before your users & CEO would discover them 😊
Collection of R scripts to test packages in conducting data quality assessments
This ETL (Extract, Transform, Load) project employs several Python libraries, including Airflow, Soda, Polars, YData Profiling, DuckDB, Requests, Loguru, and Google Cloud to streamline the extraction, transformation, and loading of CSV datasets from the U.S. government's data repository at https://catalog.data.gov.
A collection of Databricks notebooks for testing and learning
Explore the world of European football through comprehensive quantitative analysis, uncovering valuable insights into player attributes, potential, and wage determinants.
collection of Jupyter Notebooks in both English and Spanish, dedicated to performing data quality analysis using the R programming language
Add a description, image, and links to the data-quality-checks topic page so that developers can more easily learn about it.
To associate your repository with the data-quality-checks topic, visit your repo's landing page and select "manage topics."