- Python 3.9.0,
- Anaconda Navigator 1.9.12,
- Jupyter notebook 6.0.3,
- Pandas, NumPy
- Data Source: clean_students_complete.csv
in the first part of this analysis, we are going to explore data from different schools in the district, in order to see how they compare to each other based on different metrics; for that we are goign to produce
- A high-level snapshot of the district's key metrics, presented in a table format
- An overview of the key metrics for each school, presented in a table format
- Tables presenting each of the following metrics:
- Top 5 and bottom 5 performing schools, based on the overall passing rate
- The average math score received by students in each grade level at each school
- The average reading score received by students in each grade level at each school
- School performance based on the budget per student
- School performance based on the school size
- School performance based on the type of school
After rumors of academic dishonesty at Thomas High School relating to the 9th grad math and reading scores, we were adviced to not take the data that relates to the incident into account, and to reproduce the same analysis with the new altered data, in order to see if that would affect the previously displayed results.
for that we will :
nullify all 9th graders math and reading scores:
student_data_df.loc[(student_data_df["grade"] == "9th") & (student_data_df["school_name"] == "Thomas High School"), ["math_score","reading_score"]] = np.nan
we can see that Thomas High School's average scores went down by about the third.
In order to keep integrity and fairness on our part, we are going to replace the average scores for Thomas High School with the new averages that discount the 9th grad scores.
we can see that Thomas High School is still amongst the top 5 schools even whithout taking the contested data into account.
we can see no effect on the bottom shools.
we see a Nan for 9th grad for Thomas High School
-
the changes made affected little to no change on the results, because they were limited to one grade in one high school.
-
Thomas High School finished second in both analysis.
-
the changes made little difference on Thomas High School's result itself, which opens up 2 questions:
-
since the other grades follow the same trends 9th grad scores do is there :
1 - manipulation of the scores beyond the 9th graders?
2 - no manipulation on the 9th graders scores?