⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
-
Updated
Nov 14, 2024 - Python
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
数据治理、数据质量检核/监控平台(Django+jQuery+MySQL)
Possibly the fastest DataFrame-agnostic quality check library in town.
Swiple enables you to easily observe, understand, validate and improve the quality of your data
Lightweight library to write, orchestrate and test your SQL ETL. Writing ETL with data integrity in mind.
Code for blog at https://www.startdataengineering.com/post/python-for-de/
hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to Python
Safety net for machine learning pipelines. Plays nice with sklearn and pandas.
🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎
⚡ Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
Projeto de conclusão de curso do CESAR SCHOOL voltado para avaliação de ferramentas de Qualidade de Dados.
Data quality monitoring library designed for time series data, made for modern data stack
Validate tabular data in Python
Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.
Automatically validate datasets, poll task status, and display validation results in a GitHub using Swiple pull request.
Framework to Automatically Determine the Quality of Open Data Catalogs
Qalita Public Packs
This application would let a user perform Ouality check on their dataset
Add a description, image, and links to the data-quality-checks topic page so that developers can more easily learn about it.
To associate your repository with the data-quality-checks topic, visit your repo's landing page and select "manage topics."