Distributed Deep Learning

This repo contains the code for End to End Distributed Deep Learning Process Pipeline.

The Process happens in 7 steps:

Real-Time Streaming Data or Batch Data is captured using Debezium.
Captured Stream or Batch Data is pushed as Apache Kafka Topics using Kafka Connectors.
Apache Flink is used to perform ETL operations.
The Streaming/Batch Data Predictions are received from Models Deployed using TensorFlow Serving on Docker.
Frequent Data Caching is achieved with RocksDB.
Once the required predictions are made, all the data is pushed into Apache Druid where further processing takes place.
The data present in Druid is now very powerful and can be used for making personalized predictions, cancellation probabilities, time-series forecasting etc.

Made with ❤️ by Praneet Pabolu

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
debezium_connectors		debezium_connectors
distributedDL-engine		distributedDL-engine
druid_supervisors		druid_supervisors
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
rocksdb_setup.txt		rocksdb_setup.txt
server_setup.txt		server_setup.txt

Provide feedback