Skip to content
#

ydata-profiling

Here are 10 public repositories matching this topic...

This ETL (Extract, Transform, Load) project employs several Python libraries, including Airflow, Soda, Polars, YData Profiling, DuckDB, Requests, Loguru, and Google Cloud to streamline the extraction, transformation, and loading of CSV datasets from the U.S. government's data repository at https://catalog.data.gov.

  • Updated Dec 10, 2023
  • HTML

Data profiling y-data profile, Data staging (Staging tables), Talend for ETL jobs, MySQL validations Dimensional model (Target tables), Facts and Dimensions, Mapping document explaining the source column name and where it finally maps to target column, Stage to Target, Document all transformations if any

  • Updated May 22, 2024
  • HTML

Improve this page

Add a description, image, and links to the ydata-profiling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ydata-profiling topic, visit your repo's landing page and select "manage topics."

Learn more