-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.Rmd
61 lines (45 loc) · 5.92 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
title: "Univeristy of Washington GPA Report"
author: "Yuheng Zhong, Keyan Ding, Chloe Chen, Nicholas Xu"
date: "11/08/2019"
output: html_document
---
```{r setup, include=FALSE}
library(knitr)
knitr::opts_chunk$set(echo = F)
source("scripts/boxplot_chart.R")
source("scripts/scatterplot.R")
source("scripts/wf_analysis.R")
source("scripts/summary.R")
source("scripts/summary_table.R")
```
## Introduction
This study is an exciting topic regarding UW courses and could potentially reach a lot of student’s demographics. A lot of students go to sites like ratemyprofessor.com or UW’s course evaluation to learn more about the courses before they commit. Therefore, the scarcity of the information circulating leads to considerable demand for such insights. By visualizing the avg GPA of UW courses, students can better assess the difficulty of each class they are taking. It also helps to foster transparency across college courses and would help us make informed decisions to choose the right schedule.
## Summary
Influences on a student’s decision to choose a particular course to study have been popular topics of research for many educators. For this report, we gather some general information that a student will need when deciding his/her enrollment. We are only concerned with class within Info department in this report. As indicated by the dataset, the biggest class in student enrollment is `r summa["largest_class"]`. `r summa["ax_withdrawlfail"]`is the class that has the greatest number of Drop/Fail. `r summa["min_gpa"]` is the info class that has the lowest average class gpa, while the class with most 4.0 in percentage is `r summa["best_performing_class"]`. Additionally, professor rating is also an important factor in deciding which course to take. Lastly, we find out that the professor with the highest rating is: `r summa["best_prof"]`.
## Table
For the summary table, we choose to summarize the dataset by the popularity of classes. The popularity of class will be based on the student count of individual class. The most popular class will be the class with the largest student count number. We want to use this table to provide a insight of enrollment statistics of each informatics classes and student's preferences among these classes. As indicated in the table, the most popular class is INFO 200 INTELL FOUNDATIONS which has a total student enrollment of 6678. The least popular class is INFO 245: DB MGMT FNDMNTLS II which only has a total student enrollment of 18.
```{r echo=FALSE}
table
```
## Charts
### Chart 1 - Average Drop / Fail rate for INFO courses
Firstly we want to find out the hardest courses in Informatics by examining the number of students who withdraw or fail in the class. In this analysis, we define all the students who either withdraw or fail on the course to be unable to finish the course normally. Thus, by comparing the total attended students and the number of withdrawing/drop, we can calculate and compare the difficulty of each course.
For this chart, we pulled ten examples with the highest drop and failed rates among all the Info courses. According to the result, INFO 441 is the most challenging course that approximately 18% of the students will either drop or fail in the class.
```{r}
chart1
```
### Chart 2 - Average GPAs for INFO courses
Next we want to include a high-level overview of students' performance on all the 100 - 400 level INFO courses from 2010 autumn to 2018 autumn. We first define "students' performance" to be the average GPA score, since the mean value is a good indicator of performance here and we have access to the average GPA data for each INFO course at each its operating quarter and year, so we can easily compute the final average GPA score of each course for all its operating quarters. We then chose the scatterplot as our visualization tool for the aforementioned aim, since each data point we want to plot, which is the average GPA for the according course, can be easily represented by a circle/marker. And after placing all the markers on a scatterplot, we are able to see the change of performance as the course level proceeds. Another important motive is we want to give the readers an insight of the average size of each course, and this can be easily accomplished by just decr/increasing the radius of each marker!
```{r echo=FALSE, fig.width=11, fig.height=8}
chart2
```
From the chart, we observed that intro courses generally have a bigger class size. In specific, INFO 101, 102, 198, 200, 201, 270, 300 are significant in their class size. For the courses in the middle i.e. intermediate level courses, class sizes are getting much smaller to around 30 except for INFO 380, with two different titles in the past, which had an average class size of approximately 120. Lastly for the upper division courses, their size are kept small to around 30 as well but with a few more exceptions: INFO 450, 470, 490, 491, which have an average class size ranging from 68 to all the way 139. The average GPAs for INFO courses are between 3.3 to 3.9, where the least value is held by INFO 180, 344, 447, 461 and the most is held by INFO 365 and 491.
### Chart 3 - GPA distribution across info200 class
**Why to use a stacked bar chart?**
The reason to include this graph is that it explains the grading disparity across sections. With fixed length across different sections, we can easily visualize which professor's class has more A than the other in terms of percentage. Using stacked bar chart we can clearly see some professor gives eaiser grades on the class. Other charting options could not communicate grade proportioan effectively and clearly.
```{r}
final_plot
```
**What information does it reveal?**
This graph shows each letter grades given from all professors of INFO 200. For example, the majority of the Chapin's class earned A, while very few students from Freeman's class earned an A. From this graph, students can quickly compare which professor's class is easier in general, and thus making an informed decision.