When Google cloud customers start using BigQuery they usually start with the on-demand billing model. After a while, as their usage stabilizes they start checking the Reservations model. They ask Three main questions:
- Will moving to reservations save me money ?
- What max_slots value will give me the best cost-performance ratio ?
- Should I set a baseline value ? If yes then what value ?
This notebook contains a series of queries that begin with the INFORMATION_SCHEMA.JOBS and INFORMATION_SCHEMA.JOBS_TIMELINE views and tries to calculate the best values to answer those questions. You can find detailed instructions and explanations in the notebook itself.
- Clone the repository
- Upload the Slot_machine.ipynb file to BigQuery notebook.
- Follow the instructions in the notebook itself.
Workloads change over time and even from day to day. While the process of evaluation tries to be as precise as possible, there is no way to know precisely in advance what the cost or the slot consumption will be. So this gives you an estimation but continuous monitoring and tuning will be needed.