OpenClassrooms Data Scientist training program projects repository
Training planning and strategy
Educational systems data cleaning and analysis
Public health application conception
Data cleaning and analysis
Supervised learning:
Seattle buildings power consumption prediction
Unsupervised learning:
Webshop clients clustering
Marketplace products clustering and classification feasibility study
Deep learning, NLP, CNN, Transfer learning, dimension reduction
Loan attribution scoring (loan payment failure prediction)
Dashboard implementation and deployment
Interpretability (Shap)
(Refer to OC-P7 repository)
Use Spark and AWS for big data dimension reduction