docker exec -it master bash
root@588acf96a879:/app# pyspark
file = spark.read.text("/dataset/yahoo-symbols-201709.csv")
file.count()
for line in file.take(10):
print(line)
Acceda al Jupyter Notebook aqui, los notebook disponibles en ese workshop están en Github
- Apache Spark in Python: Beginner's Guide
- Introduction to PySpark
- pySpark: Evaluating the machine learning model
- Python Data Visualization with Matplotlib
- Top 50 matplotlib Visualizations
- Seaborn Library for Data Visualization in Python: Part 1
Gustavo Arjones © 2017-2020