GitHub - nischaybikramthapa/dbt-athena-tpch: This project demonstrates how you can build downstream data pipeline using dbt in athena

Transformation pipeline with dbt

This example demonstrates how you can build a downstream data pipeline using dbt on Amazon Athena. Outputs of this pipeline are created as views and tables in the glue catalog.

Prerequisites

AWS CLI
Install dbt athena adapter pip install dbt-athena-community

Connecting to Amazon Athena

tpch:
  target: dev
  outputs:
    dev:
      type: athena
      s3_staging_dir: [S3 URI with prefix]
      region_name: [AWS REGION]
      schema: [Schema name]
      database: [database name]
      aws_profile_name: [your AWS Profile]

Instructions

Download the data from here and upload to your S3 bucket.
Each table should be saved under their prefix. For instance, to save your customer table data it should be under s3://your_bucket/customers/data.csv.
Once uploaded, create a glue crawler and run it. This should crawl all the folders and create tables in the glue catalog. The database name for this project is tpch_raw
Now from your root directory, install all dbt dependencies dbt deps
Finally, dbt run --project-dir . --profiles-dir .

This should create a new schema in your glue catalog based on the schema name you provided in your dbt profile.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
analyses		analyses
dbt_packages/dbt_utils		dbt_packages/dbt_utils
logs		logs
macros		macros
models		models
screenshots		screenshots
snapshots		snapshots
tests		tests
.gitignore		.gitignore
README.md		README.md
dbt_project.yml		dbt_project.yml
packages.yml		packages.yml
profiles.yml		profiles.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformation pipeline with dbt

Prerequisites

Connecting to Amazon Athena

Instructions

Original Database Schema

Architecture

Data Lineage

About

Releases

Packages

Languages

nischaybikramthapa/dbt-athena-tpch

Folders and files

Latest commit

History

Repository files navigation

Transformation pipeline with dbt

Prerequisites

Connecting to Amazon Athena

Instructions

Original Database Schema

Architecture

Data Lineage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages