-
Notifications
You must be signed in to change notification settings - Fork 40
Brief Description of MaTEx TensorFlow
abhinavvishnu edited this page May 2, 2017
·
9 revisions
MaTEx-TensorFlow provides a user-transparent distributed memory implementation of TensorFlow using MPI.
For more details, please read our Arxiv paper (User-transparent Distributed TensorFlow - https://arxiv.org/abs/1704.04560).
Features:
- Automatic parallelization of TensorFlow using MPI for GPU and CPU Clusters.
- Support for data parallelism -- which is sufficient for the majority of DL implementations
- Support for parallel data readers including formats such as MNIST, CIFAR-10, and Parallel NETCDF (PNETCDF).
This git folder has all the requirements to install and run of MPI enabled TensorFlow. MaTEx-TensorFlow is based on the 1.0.0 release of TensorFlow.
Getting Started on MaTEx-TensorFlow
- Required Software
- Installing MaTEx-TensorFlow on CPU Clusters
- Installing MaTEx-TensorFlow on GPU Clusters
- MaTEx-TensorFlow on Older glibc(v<2.19)
- DataSet Reader
- Testing Scripts
- Performance
- Running on PNNL Systems
- Running on NERSC Systems
- Restarting the MaTEx TensorFlow environment