Student project - analyze school district data using pandas & Jupyter Notebook
pandas: https://pandas.pydata.org/
Jupyter Notebook: https://jupyter.org/
Resources provided by:
© 2021 Trilogy Education Services, LLC, a 2U, Inc. brand. Confidential and Proprietary. All Rights Reserved.
Well done! Having spent years analyzing financial records for big banks, you've finally scratched your idealistic itch and joined the education sector. In your latest role, you've become the Chief Data Scientist for your city's school district. In this capacity, you'll be helping the school board and mayor make strategic decisions regarding future school budgets and priorities.
As a first task, you've been asked to analyze the district-wide standardized test results. You'll be given access to every student's math and reading scores, as well as various information on the schools they attend. Your responsibility is to aggregate the data to and showcase obvious trends in school performance.
Your final report should include each of the following:
- Create a high level snapshot (in table form) of the district's key metrics, including:
- Total Schools
- Total Students
- Total Budget
- Average Math Score
- Average Reading Score
- % Passing Math (The percentage of students that passed math.)
- % Passing Reading (The percentage of students that passed reading.)
- % Overall Passing (The percentage of students that passed math and reading.)
- Create an overview table that summarizes key metrics about each school, including:
- School Name
- School Type
- Total Students
- Total School Budget
- Per Student Budget
- Average Math Score
- Average Reading Score
- % Passing Math (The percentage of students that passed math.)
- % Passing Reading (The percentage of students that passed reading.)
- % Overall Passing (The percentage of students that passed math and reading.)
- Create a table that highlights the top 5 performing schools based on % Overall Passing. Include:
- School Name
- School Type
- Total Students
- Total School Budget
- Per Student Budget
- Average Math Score
- Average Reading Score
- % Passing Math (The percentage of students that passed math.)
- % Passing Reading (The percentage of students that passed reading.)
- % Overall Passing (The percentage of students that passed math and reading.)
- Create a table that highlights the bottom 5 performing schools based on % Overall Passing. Include all of the same metrics as above.
- Create a table that lists the average Math Score for students of each grade level (9th, 10th, 11th, 12th) at each school.
- Create a table that lists the average Reading Score for students of each grade level (9th, 10th, 11th, 12th) at each school.
- Create a table that breaks down school performances based on average Spending Ranges (Per Student). Use 4 reasonable bins to group school spending. Include in the table each of the following:
- Average Math Score
- Average Reading Score
- % Passing Math (The percentage of students that passed math.)
- % Passing Reading (The percentage of students that passed reading.)
- % Overall Passing (The percentage of students that passed math and reading.)
- Repeat the above breakdown, but this time group schools based on a reasonable approximation of school size (Small, Medium, Large).
- Repeat the above breakdown, but this time group schools based on school type (Charter vs. District)
-
Background and datasets provided as part of Georgia Tech Data Analytics Boot Camp:
© 2021 Trilogy Education Services, LLC, a 2U, Inc. brand. Confidential and Proprietary. All Rights Reserved.
-
Project Author: Valerie Pippenger - https://github.com/Pip85
-
This project required merging two data files of metrics on a school district, cleaning and organizing data into tables for data analysis on school performance.
-
Metrics include school size, spending, type of school (district or charter) and performance in math, reading & both.
-
The first table provides a summary shot of performance in reading, math and overall in the district as a whole.
-
The second table provides a more detailed look at performance in those same categories at the individual school level.
-
The following two tables look at the top and bottom performers indicating the better performance in charter type schools.
-
Tables 5 & 6 examine performance by school across each grade (9th through 12th).
-
The final three tables examine performance based on school spending per student, school size and type.
-
Jupyter Notebook link:
https://github.com/Pip85/pandas-school_district_analysis/blob/main/PyCitySchools/PyCitySchools.ipynb