feat: added DAG readme.md
parent
3d230263e2
commit
5ed10fb179
|
@ -1,3 +1,23 @@
|
||||||
# Sustainability score
|
# Sustainability score
|
||||||
|
|
||||||
Placeholder
|
This DAG orchestrates the ingestion and transformation of prodcuts from
|
||||||
|
Target's website to compute their sustainability score.
|
||||||
|
|
||||||
|
## Steps
|
||||||
|
|
||||||
|
* create_products_table: create the prodcuts table with it's schema
|
||||||
|
* etl_pipeline: run the apache beam etl process
|
||||||
|
* dbt_run: run `dbt run` to apply transformations
|
||||||
|
* dbt_test: run `dbt test` to test the data quality
|
||||||
|
|
||||||
|
## Config
|
||||||
|
|
||||||
|
The following parameters are available:
|
||||||
|
|
||||||
|
* `input`: location of the CSV input file
|
||||||
|
* `beam_etl_path`: location of the apache beam pipeline
|
||||||
|
* `dbt_path`: location of the dbt project
|
||||||
|
* `products_table`: products_table table name
|
||||||
|
|
||||||
|
I decided not to configure the rest of the table locations because that makes
|
||||||
|
more sense to be defined in DBT.
|
||||||
|
|
Loading…
Reference in New Issue