A demo project on batch data-parallel processing using Apache Beam and Python
python
count
pipeline
transformations
kaggle
batch
apache-beam
google-colab
colab-notebook
pcollections
beam-python
beam-sdk
groupby-transformation
-
Updated
Mar 26, 2021 - Jupyter Notebook