Slide 1

Slide 1 text

Towards a Higher Education Profiler Alex Singleton, Paul Longley, Alan Wilson

Slide 2

Slide 2 text

Introduction • Views on the transition from PhD to Post Doctorate – ESRC First Grant Scheme • Some new data sources • Some new insights • Beta Educational Profiler • From description to prediction

Slide 3

Slide 3 text

My transition from an unconventional PhD an unconventional Post Doc • A Spatio-Temporal Analysis of Access to Higher Education (aprox. 1 year ago) – Three Themes • Momentum • Unfinished Business • Future Directions – Unconventional • PhD – Spent 2 years in Cheltenham at UCAS – KTP • Post-Doc – It isn’t really one

Slide 4

Slide 4 text

Momentum • Keep it up – you are used to writing lots – Use this to write papers, grant / book proposals • Disillusion with PhD topic – Good to put it down for a while and do something else • 2 Papers on E-Society – Online validation – Digital deprivation V material deprivation • 2 Papers on Neo-Geography

Slide 5

Slide 5 text

Unfinished Business • There will be things in your thesis which you wanted to cover but didn’t have time / room – Methodological • Alternate algorithms to k-means in creation geodemographics • Geographic representations of cluster instability – related to initial seed locations • Alternate optimisation procedures –measures of spatial rather than social similarity – Domain Specific • Course Clusters – Overarching Themes • Future of area classification – Real Time Geodemographics

Slide 6

Slide 6 text

Future Directions • Like it or not, your PhD is what you are known for! – Chart hits matter: • Brunsdon = GWR • Dorling = Catograms – Unless you start again, you will always
 return to you PhD themes • It is what you know most about! – Research is driven by funding • Funding for PhD students • Funding for research grants • Building a research team enables – You to do more – Efforts shift from “doing” to “guiding” / “organising” » Don’t drown in this – you still need to keep “doing” Doctoral Post - Doctoral Academics RA

Slide 7

Slide 7 text

ESRC First Grants • Scheme: Enables early career researchers to apply for a small grant – essentially 1 researcher @ FEC funding + expenses • Very competitive – Over 200 applications last year – ~13% success • Spatial interaction modelling, geodemographics and widening participation in the Higher Education sector?

Slide 8

Slide 8 text

Start Oct. 2008 End Sept. 2009 Feb2009 Time Plan Approximately ¼ Way Through Stage 1: Data Acquisition and Insight Stage 2: Higher Education Profiler Stage 3: Higher Education Modeller Pet Project....(Facebook)

Slide 9

Slide 9 text

The HE Problem: Acceptances 1962 - 2003


Slide 10

Slide 10 text

Occupational Group: 1968-1978

Slide 11

Slide 11 text

Social Class: 1980 - 2001

Slide 12

Slide 12 text

Socio-Economic Group: 2002 - 2007

Slide 13

Slide 13 text

End point of my thesis Investigated a variety of different aspects of HE participation, from a geodemographics / geographers perspective

Slide 14

Slide 14 text

Distance travelled to HE by Mosaic

Slide 15

Slide 15 text

Mosaic Profile– UCAS Acceptances – 2004 (Base All Adult HE Acceptances

Slide 16

Slide 16 text

2006 PLASC Data - DCSF School Profiles – Selective Schools

Slide 17

Slide 17 text

2006 PLASC Data - DCSF GCSE Grades

Slide 18

Slide 18 text

School Catchment Areas

Slide 19

Slide 19 text

Stage 1: Data Acquisition and Insight • Data Sources – University and Colleges Admissions Service (UCAS) – Higher Education Statistics Agency (HESA) – Department for Children Schools and Families (DCSF) • A-Level & Equiv (Key Stage 5) • GCSE & Equiv (Key Stage 4) – DCSF & HESA now link at individual level • Map a student through time! – Previously – had to consider each key stage separately Caveat – These data only arrived last week!

Slide 20

Slide 20 text

Insight 1: Entry Rates (DCFS & HESA) DCSF Key Stage 5 HESA (0) HESA (+1) HESA (+2) 2004 ~50% ~20% ~5% Direct Entry Gap Year Gap Years National Targets = 18-30 Age Range

Slide 21

Slide 21 text

Insight 2: Course Choice Behaviour (UCAS) Applicant C1 C2 C3 C4 C5 C6

Slide 22

Slide 22 text

No content

Slide 23

Slide 23 text

No content

Slide 24

Slide 24 text

Percentages – Row ∑100% Thus, for applicants with at least one choice in “A1 - Pre-Clinical Medicine”, 76.9% of applications from other applicants are within the same JACS Line. ! A1 is quite homogeneous! Extract of the full table

Slide 25

Slide 25 text

No content

Slide 26

Slide 26 text

No content

Slide 27

Slide 27 text

No content

Slide 28

Slide 28 text

Insight 3: Model of Private Characteristics State / KS5 FE Colleges Private Demographics – inc.! spatial reference x Higher Education (HESA)

Slide 29

Slide 29 text

Insight 4: Participation Flows (based on HESA data)

Slide 30

Slide 30 text

Stage 2: Higher Education Profiler • Integrate insights from my thesis • UK HE Atlas • Platform for decision support for a range of stakeholders in HE

Slide 31

Slide 31 text

Map Generation – Dependent Solution Issues – Dependent on Google API, Limited to Google Cartography, potential issues with data ownership

Slide 32

Slide 32 text

Original Site (Google)

Slide 33

Slide 33 text

Map Generation – Independent Solution Shuttle Radar Topography Mission (SRTM) ~100m resolution

Slide 34

Slide 34 text

No content

Slide 35

Slide 35 text

OpenStreetMap (via Cloudmade) great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB ! PublicProfiler Schools Atlas DCSF.gov.uk EduBase and IDACI OpenLayers (.js) HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading AJAX Requests OSM Tiles PVCs (.kml) OAC Hawths Tools ArcGIS Google Chart API Tiles Chart Cache Architecture Diagram

Slide 36

Slide 36 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql Mapnik OSM Tiling Script (.py) Stylesheets(. xml) OpenStreetMap (via Cloudmade) Shapefiles Tiles ! PublicProfiler Schools Atlas Network Layer

Slide 37

Slide 37 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools Mapnik OSM Tiling Script (.py) Stylesheets(. xml) PerryGeo Hillshading Shapefiles Hillshading Layer ! PublicProfiler Schools Atlas Tiles OpenStreetMap (via Cloudmade)

Slide 38

Slide 38 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading ! PublicProfiler Schools Atlas Choropleth Layers Tiles OpenStreetMap (via Cloudmade)

Slide 39

Slide 39 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading OAC School Statistics ! PublicProfiler Schools Atlas Tiles OpenStreetMap (via Cloudmade)

Slide 40

Slide 40 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading PVCs (.kml) OAC Hawths Tools ArcGIS School Catchment ! PublicProfiler Schools Atlas Tiles OpenStreetMap (via Cloudmade)

Slide 41

Slide 41 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading PVCs (.kml) OAC Hawths Tools ArcGIS Serving Tiles ! PublicProfiler Schools Atlas Tiles OpenLayers (.js) OpenStreetMap (via Cloudmade)

Slide 42

Slide 42 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading PVCs (.kml) OAC Hawths Tools ArcGIS Other Tile Sources ! PublicProfiler Schools Atlas Tiles OSM Tiles OpenLayers (.js) OpenStreetMap (via Cloudmade)

Slide 43

Slide 43 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading AJAX Requests PVCs (.kml) OAC Hawths Tools ArcGIS Google Chart API Chart Cache ! PublicProfiler Schools Atlas Tiles OSM Tiles OpenLayers (.js) Showing the Statistics OpenStreetMap (via Cloudmade)

Slide 44

Slide 44 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading AJAX Requests PVCs (.kml) OAC Hawths Tools ArcGIS Google Chart API Chart Cache OSM Tiles Tiles OpenLayers (.js) ! PublicProfiler Schools Atlas Showing Catchments OpenStreetMap (via Cloudmade)

Slide 45

Slide 45 text

great_britain.osm (.xml) PostgreSQL DB with PostGIS osm2pgsql NASA’s SRTM DEMs GDAL Tools UKBorders English MSOAs and Postcodes DCSF. gov.uk 
 National Pupil Database mySQL DB DCSF.gov.uk EduBase and IDACI HEFCE.ac.uk POLAR OAC Mapnik Shapefiles OSM Tiling Script (.py) Stylesheets(. xml) ArcGIS Color Brewer PerryGeo Hillshading AJAX Requests PVCs (.kml) OAC Hawths Tools ArcGIS Google Chart API Chart Cache ! PublicProfiler Schools Atlas OSM Tiles OpenLayers (.js) Tiles Production Systems OpenStreetMap (via Cloudmade)

Slide 46

Slide 46 text

No content

Slide 47

Slide 47 text

No content

Slide 48

Slide 48 text

Stage 3: Higher Education Modeller • m: geodemographic group • i: area of residence • j: university (or university location) • a: attainment level • n: school type • x: subject group • h: university type Potential student groupings

Slide 49

Slide 49 text

The flow array • We write the array of interest as ! S ij (m, a, n, x, h) ! • on the basis that we are always going to want to model S ij with some subset of (m, a, n, x, h). • The challenge arises from the number of cells in this 7- dimensional array.

Slide 50

Slide 50 text

Numbers in each category • m: 7 (Output Area Classification) • i: 30 (NUTS2 areas) • j: 171 HEIs; may be reduced to 30 locations • a: 3 attainment levels; or a continuum of UCAS points • n: ideally 5 school/college types: independent, state selective, state non-selective, Sixth Form College, FE College; reduce to 3? • x: 8, or 4? • h: 5 – Oxbridge/UCL/Imperial, major civic, other research, large other, small other; or reduce to 3?

Slide 51

Slide 51 text

Flow array cells • If we take the largest suggested numbers, the number of cells in the array would be: ! 7x30x171x3x5x8x5 = 21,546,000 ! • which is a ludicrously large number given that we are handling roughly 500,000 students in a year. Most of the cells would have zero entries.

Slide 52

Slide 52 text

Revised flow array cells • If we take the lower category numbers, we get ! 7x30x30x3x3x4x3 = 680,400 ! • which is still too large. It is useful to look at this as 30x30 = 900 geographic dimensions and 7x3x3x4x3 = 756 other dimensions. (900x756 = 680,400)

Slide 53

Slide 53 text

Visualising the real data • The next step is to visualise the data to guide us towards further aggregation. School type Output Area Classification Attainment University NUTS2 Area Flow of student(s) Height = flow size

Slide 54

Slide 54 text

Comparing London Universities UCL (Flow size > 2 students) London Metropolitan University (Flow size > 2 students)

Slide 55

Slide 55 text

Comparing Leeds Universities University of Leeds (Flow size > 4 students) Leeds Metropolitan University (Flow size > 4 students)

Slide 56

Slide 56 text

OAC: Constrained By Circumstances 
 (Flow size > 5 students )

Slide 57

Slide 57 text

OAC: City Living 
 (Flow size > 5 students) London to University of Oxford

Slide 58

Slide 58 text

All flows out of Cornwall 
 (Flow size > 5 students)

Slide 59

Slide 59 text

Model Equation • The model would take the form: ! Sij (m, a, n, x, h) ! = Ai (m, a, n, x, h)Bj (a, x, h)ei (m, a, n, x, h)Oi (m)Dj (a, n, x, h)exp[- β(m)cij (m)] ! • This conceptual model would suffer from having too many cells, but we will use the experience of examining the data to find ways of aggregating. • Initially, the model would be run on a doubly constrained basis for calibration purposes. • It would then be possible to replace Dj (a, n, x, h) by a set of attractiveness factors, Wj (a, n, x, h). This would provide a ‘What if?’ capability. The model could then be used to test various future policy options.