There are a lot of columns in resulting dataset and almost all of them are in dutch :) so I think it would be clears to explain what info each column shows us. First let’s read the dataset and take a loor at it and at its columns:
import pandas as pd
data_score = pd.read_csv("output/Score.csv", error_bad_lines=False, sep=',', encoding = "ISO-8859-1")
data_score.head()
SCHOOL_ID | DATUM | INSTELLINGSNAAM_VESTIGING | POSTCODE_VESTIGING | PLAATSNAAM | GEMEENTENAAM | PROVINCIE | SOORT_PO | DENOMINATIE_VESTIGING | EXAMEN | ... | PRO | VMBO | VMBO_HAVO | HAVO | HAVO_VWO | VWO | ADVIES_NIET_MOGELIJK | TOTAAL_ADVIES | LJ8 | ZIT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 000AP_2015 | 2015 | De Schanskorf | 2715BT | ZOETERMEER | Zoetermeer | Zuid-Holland | Bo | Gereformeerd vrijgemaakt | CET | ... | 0 | 1 | 0 | 5 | 1 | 1 | 0 | 11 | 13 | 0 |
1 | 000AR_2015 | 2015 | BS "De Maasparel" | 6109AM | OHE EN LAAK | Maasgouw | Limburg | Bo | Rooms-Katholiek | CET | ... | 0 | 1 | 0 | 7 | 0 | 6 | 0 | 18 | 17 | 0 |
2 | 000AZ_2015 | 2015 | De Kiezel en de Kei | 2971AR | BLESKENSGRAAF CA | Molenwaard | Zuid-Holland | Bo | Openbaar | CET | ... | 1 | 3 | 5 | 3 | 2 | 4 | 0 | 22 | 19 | 3 |
3 | 000BA_2015 | 2015 | OBS De Klimboom | 6666EB | HETEREN | Overbetuwe | Gelderland | Bo | Openbaar | CET | ... | 1 | 1 | 0 | 1 | 4 | 0 | 0 | 12 | 24 | 5 |
4 | 000BB_2015 | 2015 | Obs Letterwies | 9944AR | NIEUWOLDA | Oldambt | Groningen | Bo | Openbaar | CET | ... | 0 | 7 | 0 | 5 | 0 | 2 | 0 | 16 | 15 | 2 |
5 rows × 33 columns
data_score.columns
Index(['SCHOOL_ID', 'DATUM', 'INSTELLINGSNAAM_VESTIGING', 'POSTCODE_VESTIGING',
'PLAATSNAAM', 'GEMEENTENAAM', 'PROVINCIE', 'SOORT_PO',
'DENOMINATIE_VESTIGING', 'EXAMEN', 'EXAMEN_AANTAL', 'EXAMEN_GEM',
'REKENEN_LAGER1F', 'REKENEN_1F', 'REKENEN_1S', 'REKENEN_2F',
'LV_LAGER1F', 'LV_1F', 'LV_2F', 'TV_LAGER1F', 'TV_1F', 'TV_2F', 'VSO',
'PRO', 'VMBO', 'VMBO_HAVO', 'HAVO', 'HAVO_VWO', 'VWO',
'ADVIES_NIET_MOGELIJK', 'TOTAAL_ADVIES', 'LJ8', 'ZIT'],
dtype='object')
We can see 37 columns with different values, lets’s explain what means what:
Columns | Meaning |
---|---|
SCHOOL_ID | School ID, was made out of 3 columns, contains info about school registration number and year for which data is shown |
DATUM | Year for which data is shown |
INSTELLINGSNAAM_VESTIGING | Name of the school |
POSTCODE_VESTIGING | School postcode |
PLAATSNAAM | Place where the school is located (usually town or village) |
GEMEENTENAAM | Name of the municipality where the school is located |
PROVINCIE | Province where the school is located |
SOORT_PO | The type of primary education: bo (primary education) or sbo (special primary education) |
DENOMINATIE_VESTIGING | The philosophy or religion propagated by the school establishment |
EXAMEN | Type of final exam which shool was chosen for most of the students |
EXAMEN_AANTAL | Amount of student who took the exam EXAMEN |
EXAMEN_GEM | Average score for the exam EXAMEN which got EXAMEN_AANTAL amount of students in the school in DATUM year |
REKENEN_LAGER1F | The number of students with has a level for arithmetic lower than the basic level 1F |
REKENEN_1F | The number of students with the basic 1F math level for arithmetic |
REKENEN_1S | The number of students with the target level for arithmetic |
REKENEN_2F | The number of students with the 2F basic math level |
LV_LAGER1F | The number of students with a level below basic level 1F for reading skills |
LV_1F | The number of students with the basic level 1F for reading skills |
LV_2F | The number of students with the basic level 2F for reading skills |
TV_LAGER1F | The number of students with a level below the basic language proficiency level |
TV_1F | The number of students with the basic level 1F for language proficiency skills |
TV_2F | The number of students with the basic level 2F for language proficiency skills |
PRO | The number of students who got school advice for practical education |
VSO | The number of students who got school advice for secondary special education |
VMBO | The number of students who got school advice for practical and some more theoretical education |
VMBO_HAVO | The number of students who got school advice for mixed or theoretical learning path and HAVO |
HAVO | The number of students who got school advice for high education (after secondary one) |
HAVO_VWO | The number of students who got school advice for high education or pre-university education |
VWO | The number of students who got school advice for pre-university education |
ADVIES_NIET_MOGELIJK | The number of students for whom the school has indicated that a specific advice is not possible (for example due to transition classes) |
TOTAAL_ADVIES | The total number of students with a school advice + the number of students for which the school has indicated that a specific advice is not possible |
LJ8 | Number of students in grade 8 |
ZIT | Number of students aged 12 or older, in grade 8, with more than 8 years of residence in education |