A.3 King County Assessor Data
Tax parcels are the pieces of land associated with property value. King County appraises parcels in order to determine an estimate of their fair market value, which is then used in the process of determening taxation rates. For the purpose of this assessment, parcels are the geographic unit of the indicator of housing market conditions: median residential property value.
More information about the parcel dataset is available at the King County GIS Center:
The Dataset
The King County parcel dataset is a large spatiotemporal dataset comprised of approximately 600,000 annually-updated records. There are several factors worth mentioning about how these data were obtained, what they represent, and how the will be used in this analysis:
Sources
The parcel data is sepatated into a spatial dataset (polygons with minimal metadata) and several sets of tabular metadata that can be joined to each other or the spatial data using the unique Parcel Identification Number (PIN
). The most current version of these data is made publicly available by King County GIS and the KC Assessor’s office (see the links provided above).
This project requires historical property values which are only available by special request to the KC Assessor’s office. The complete parcel datasets were obtained for the 2005, 2010, and 2016 tax years, which correspond to the property values in 2004, 2009, and 2015 (respectively). Like the most current dataset, these data are split into one geospatial and several accompanying tabular metadata sets for each year.
Temporal Change
In order to assess property value patterns it is important to be aware of the fact that parcels come and go. More specifically, the county’s stock of parcels changes over time as property owners either subdivide or merge their properties. These changes create new PIN
codes and eliminate old ones, resulting in historically discontinuous records for these parcels.
Fortuntately, the geographic unit of analysis for this project is the census tract rather than the individual parcels themselves. In order to calculate the median residential property value for each tract, it is only necessary to determine which tract each parcel is within for each observations period; it is not necessary to know the changes of each individual parcel’s property value over time.
Method
The large size of these datasets present challenges, particularly for computationally expensive GIS operations. The following process makes an effort to reduce the volume of data involved in the geospatial overlay operations while producing the desired summary statistic: median residential property value by tract by year.
Process
- 2014 Observations
- Reduce geospatial data from polygons to points (
rgeos::gCentroid()
) - Subset data to include only those parcel within the Seattle CCD of King County
- Pass census tract unique identifier codes (
GEOID
) to parcel points using a spatial overlay (sp::over()
)
- Reduce geospatial data from polygons to points (
- 2004 & 2009 Observations
- Subset the geospatial data to include only those parcels that are not present in the 2015 data
- Repeat the same same spatial overlay process used in the 2014 observations to transfer GEOIDs to parcels
- Combined Data
- Add a new variable to record the year of the observation to each dataset (
YEAR
) - Merge the datasets (by
PIN
), using an operation that retains all records (dplyr::full_join()
)
- Add a new variable to record the year of the observation to each dataset (
Calendar Year 2015 / Tax Year 2016
FALSE Simple feature collection with 612313 features and 5 fields
FALSE geometry type: POINT
FALSE dimension: XY
FALSE bbox: xmin: -122.5314 ymin: 47.08658 xmax: -121.0889 ymax: 47.77919
FALSE epsg (SRID): 4326
FALSE proj4string: +proj=longlat +datum=WGS84 +no_defs
FALSE First 10 features:
Calendar Year 2004 / Tax Year 2005
FALSE Simple feature collection with 590074 features and 9 fields
FALSE geometry type: POINT
FALSE dimension: XY
FALSE bbox: xmin: -122.5315 ymin: 47.08616 xmax: -121.0883 ymax: 47.77937
FALSE epsg (SRID): 4326
FALSE proj4string: +proj=longlat +datum=WGS84 +no_defs
FALSE First 10 features:
Calendar Year 1999 / Tax Year 2000
FALSE Simple feature collection with 564733 features and 9 fields
FALSE geometry type: POINT
FALSE dimension: XY
FALSE bbox: xmin: -122.5315 ymin: 47.08616 xmax: -121.0883 ymax: 47.77937
FALSE epsg (SRID): 4326
FALSE proj4string: +proj=longlat +datum=WGS84 +no_defs
FALSE First 10 features:
X