For this project, we generate a dataset for three classes each with 500 samples from three Gaussian distribution described below:
Use generated data and estimate the density without pre-assuming a model for the distribution which is done by a non-parametric estimation. Implement the Gaussian kernel PDF estimation methods using Standard Deviations of 0.2, 0.6, 0.9. Estimate P(X) and Plot the estimated PDF.
Also we find the best value for h in the Gaussian kernel model with the standard deviation of 0.6 using 5-Fold cross-validation and for this goal the squared error between the actual function and the estimated Gaussian kernel function should be minimized.
Then employ the estimated Gaussian kernel for each class and do the followings with standard
deviation 0.6:
a) Divide the samples into a 90% train and 10% test data randomly.
b) Use Bayesian estimation and predict the class labels while reporting train, test and
total accuracies.