{
"cells": [
{
"cell_type": "markdown",
"id": "e352599a",
"metadata": {},
"source": [
"# US Census Bureau Median Housing Data\n",
"\n",
"The US Census Bureau publicly provides an extensive amount of information that it obtains. Sifting through and finding the right data you're looking for is difficult enough. Getting that data into a usable format that can then be uploaded to an RDBMS is something else entirely. This guide will walk us through how to retrieve the data, how to clean it and modify it within Python, and then how to export it to an Excel or CSV file that can be uploaded to an RDBMS. This is the same process used to create the MEDIAN_HOUSE_VALUE table on the USZIPCODE database for the University.\n",
"## ZCTA Vs Zip Code\n",
"\n",
"For this example we will be using The Census Bureau derived ZCTA and not actual Zip Codes. You can read up on ZCTA more in the link below. We will also create a table that will allow us to convert and group Zip Codes to ZCTA so we can compare data in either format.\n",
"
\n",
"[Census Bureau ZCTAs](https://www.census.gov/programs-surveys/geography/guidance/geo-areas/zctas.html)\n",
"
\n",
"```{admonition} TL:DR\n",
"ZCTA combine Zip Codes with very few addresses and assign the most frequent Zip within that grouping as the ZCTA.\n",
"```\n",
"\n",
"## Obtaining The Data\n",
"\n",
"1.) Navigate to https://data.census.gov/cedsci/table. Here you can search for all of the different information that the US Census Bureau publicy provides. For this example, search for \"B25109\" or \"Median Value by Year Householder Moved Into Unit\" to use the same dataset, or another table of interest that allows you to break out that data by ZCTA5 code in the next step.\n",
"
\n",
"
\n",
"2.) Click the \"Geos\" button in the table header and navigate to \"Zip Code Tabulation Area\" (we will refer to this as ZCTA from here on.) If you want to work with a smaller subset of data first, select a single state and then All ZCTA for that state. Or select the top button for all ZCTA in the US. This will likely take some time since the table is roughly 100,000 rows. Hit the \"Transpose\" button to format the data so that ZCTA, Estimate, and Margin of Error are the rows and the year ranges are the columns.\n",
"
\n",
"
\n",
"```{note} As stated in the first step, not all tables will allow you to break out the data into ZCTA. While some of the steps can be used for data outside of ZCTA level data, this guide is explicitly for cleaning ZCTA data.\n",
"```\n",
"
\n",
"
\n",
"3.) The first column is now giving us a median of the medians and should be removed since we can do this calculation easily later if we wanted to. I also removed the final two columns of data before 2000, but these can be kept if you are interested in historical data going that far back. To remove these columns, click \"Columns\" on the far right side of the website and de-select the top two buttons with no title next to them, and the final two buttons for data before 2000.\n",
"
\n",
"
\n",
"4.) Press the CSV button on the top header to export your dataset to a CSV and file and wait for it to download. This will likely take some time since it is almost 100,000 rows.\n",
"\n",
"## Importing The Data\n",
"\n",
"We're finally ready to start preparing our data! It is a good idea to move the downloaded file to the same folder you want the Jupyter Notebook or Python program to be saved in. I also renamed the file to easily identify what it was and easily read it into Jupyter.\n",
"
\n",
"
\n",
"Execute the following cell looking at both the head and the tail."
]
},
{
"cell_type": "code",
"execution_count": 2,
"id": "735412e1",
"metadata": {},
"outputs": [
{
"data": {
"text/html": [
"
\n", " | Label (Grouping) | \n", "Median value --!!Total:!!Moved in 2019 or later | \n", "Median value --!!Total:!!Moved in 2015 to 2018 | \n", "Median value --!!Total:!!Moved in 2010 to 2014 | \n", "Median value --!!Total:!!Moved in 2000 to 2009 | \n", "
---|---|---|---|---|---|
0 | \n", "ZCTA5 00601 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
1 | \n", "Estimate | \n", "- | \n", "82,300 | \n", "81,700 | \n", "95,100 | \n", "
2 | \n", "Margin of Error | \n", "** | \n", "±10,863 | \n", "±29,876 | \n", "±20,917 | \n", "
3 | \n", "ZCTA5 00602 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
4 | \n", "Estimate | \n", "- | \n", "144,600 | \n", "116,100 | \n", "112,200 | \n", "
5 | \n", "Margin of Error | \n", "** | \n", "±45,088 | \n", "±15,512 | \n", "±20,066 | \n", "
6 | \n", "ZCTA5 00603 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
7 | \n", "Estimate | \n", "260,300 | \n", "134,400 | \n", "130,400 | \n", "124,100 | \n", "
8 | \n", "Margin of Error | \n", "±31,083 | \n", "±57,761 | \n", "±13,058 | \n", "±12,732 | \n", "
9 | \n", "ZCTA5 00606 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
\n", " | ZCTA5 | \n", "2019_Or_Later | \n", "2015_To_2018 | \n", "2010_To_2014 | \n", "2000_To_2009 | \n", "
---|---|---|---|---|---|
0 | \n", "ZCTA5 00601 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
1 | \n", "Estimate | \n", "- | \n", "82,300 | \n", "81,700 | \n", "95,100 | \n", "
2 | \n", "Margin of Error | \n", "** | \n", "10,863 | \n", "29,876 | \n", "20,917 | \n", "
3 | \n", "ZCTA5 00602 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
4 | \n", "Estimate | \n", "- | \n", "144,600 | \n", "116,100 | \n", "112,200 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
99355 | \n", "Estimate | \n", "- | \n", "- | \n", "- | \n", "- | \n", "
99356 | \n", "Margin of Error | \n", "** | \n", "** | \n", "** | \n", "** | \n", "
99357 | \n", "ZCTA5 99929 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
99358 | \n", "Estimate | \n", "- | \n", "196,900 | \n", "178,800 | \n", "231,900 | \n", "
99359 | \n", "Margin of Error | \n", "** | \n", "52,061 | \n", "30,051 | \n", "93,901 | \n", "
99360 rows × 5 columns
\n", "\n", " | 2019_Or_Later_EST | \n", "2015_To_2018_EST | \n", "2010_To_2014_EST | \n", "2000_To_2009_EST | \n", "
---|---|---|---|---|
0 | \n", "- | \n", "82,300 | \n", "81,700 | \n", "95,100 | \n", "
1 | \n", "- | \n", "144,600 | \n", "116,100 | \n", "112,200 | \n", "
2 | \n", "260,300 | \n", "134,400 | \n", "130,400 | \n", "124,100 | \n", "
3 | \n", "- | \n", "58,200 | \n", "97,000 | \n", "92,900 | \n", "
4 | \n", "- | \n", "115,900 | \n", "110,500 | \n", "112,000 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "- | \n", "- | \n", "- | \n", "- | \n", "
33116 | \n", "- | \n", "263,900 | \n", "339,200 | \n", "181,300 | \n", "
33117 | \n", "- | \n", "170,300 | \n", "243,800 | \n", "154,400 | \n", "
33118 | \n", "- | \n", "- | \n", "- | \n", "- | \n", "
33119 | \n", "- | \n", "196,900 | \n", "178,800 | \n", "231,900 | \n", "
33120 rows × 4 columns
\n", "\n", " | 2019_Or_Later_MOE | \n", "2015_To_2018_MOE | \n", "2010_To_2014_MOE | \n", "2000_To_2009_MOE | \n", "
---|---|---|---|---|
0 | \n", "** | \n", "10,863 | \n", "29,876 | \n", "20,917 | \n", "
1 | \n", "** | \n", "45,088 | \n", "15,512 | \n", "20,066 | \n", "
2 | \n", "31,083 | \n", "57,761 | \n", "13,058 | \n", "12,732 | \n", "
3 | \n", "** | \n", "25,881 | \n", "37,195 | \n", "14,438 | \n", "
4 | \n", "** | \n", "26,647 | \n", "15,103 | \n", "14,945 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "** | \n", "** | \n", "** | \n", "** | \n", "
33116 | \n", "** | \n", "31,175 | \n", "33,706 | \n", "19,820 | \n", "
33117 | \n", "** | \n", "20,982 | \n", "162,405 | \n", "31,403 | \n", "
33118 | \n", "** | \n", "** | \n", "** | \n", "** | \n", "
33119 | \n", "** | \n", "52,061 | \n", "30,051 | \n", "93,901 | \n", "
33120 rows × 4 columns
\n", "\n", " | 2019_Or_Later_EST | \n", "2015_To_2018_EST | \n", "2010_To_2014_EST | \n", "2000_To_2009_EST | \n", "2019_Or_Later_MOE | \n", "2015_To_2018_MOE | \n", "2010_To_2014_MOE | \n", "2000_To_2009_MOE | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "NaN | \n", "82300.0 | \n", "81700.0 | \n", "95100.0 | \n", "NaN | \n", "10863.0 | \n", "29876.0 | \n", "20917.0 | \n", "
1 | \n", "NaN | \n", "144600.0 | \n", "116100.0 | \n", "112200.0 | \n", "NaN | \n", "45088.0 | \n", "15512.0 | \n", "20066.0 | \n", "
2 | \n", "260300.0 | \n", "134400.0 | \n", "130400.0 | \n", "124100.0 | \n", "31083.0 | \n", "57761.0 | \n", "13058.0 | \n", "12732.0 | \n", "
3 | \n", "NaN | \n", "58200.0 | \n", "97000.0 | \n", "92900.0 | \n", "NaN | \n", "25881.0 | \n", "37195.0 | \n", "14438.0 | \n", "
4 | \n", "NaN | \n", "115900.0 | \n", "110500.0 | \n", "112000.0 | \n", "NaN | \n", "26647.0 | \n", "15103.0 | \n", "14945.0 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
33116 | \n", "NaN | \n", "263900.0 | \n", "339200.0 | \n", "181300.0 | \n", "NaN | \n", "31175.0 | \n", "33706.0 | \n", "19820.0 | \n", "
33117 | \n", "NaN | \n", "170300.0 | \n", "243800.0 | \n", "154400.0 | \n", "NaN | \n", "20982.0 | \n", "162405.0 | \n", "31403.0 | \n", "
33118 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
33119 | \n", "NaN | \n", "196900.0 | \n", "178800.0 | \n", "231900.0 | \n", "NaN | \n", "52061.0 | \n", "30051.0 | \n", "93901.0 | \n", "
33120 rows × 8 columns
\n", "\n", " | ZCTA5 | \n", "2019_Or_Later_EST | \n", "2015_To_2018_EST | \n", "2010_To_2014_EST | \n", "2000_To_2009_EST | \n", "2019_Or_Later_MOE | \n", "2015_To_2018_MOE | \n", "2010_To_2014_MOE | \n", "2000_To_2009_MOE | \n", "
---|---|---|---|---|---|---|---|---|---|
0 | \n", "00601 | \n", "NaN | \n", "82300.0 | \n", "81700.0 | \n", "95100.0 | \n", "NaN | \n", "10863.0 | \n", "29876.0 | \n", "20917.0 | \n", "
1 | \n", "00602 | \n", "NaN | \n", "144600.0 | \n", "116100.0 | \n", "112200.0 | \n", "NaN | \n", "45088.0 | \n", "15512.0 | \n", "20066.0 | \n", "
2 | \n", "00603 | \n", "260300.0 | \n", "134400.0 | \n", "130400.0 | \n", "124100.0 | \n", "31083.0 | \n", "57761.0 | \n", "13058.0 | \n", "12732.0 | \n", "
3 | \n", "00606 | \n", "NaN | \n", "58200.0 | \n", "97000.0 | \n", "92900.0 | \n", "NaN | \n", "25881.0 | \n", "37195.0 | \n", "14438.0 | \n", "
4 | \n", "00610 | \n", "NaN | \n", "115900.0 | \n", "110500.0 | \n", "112000.0 | \n", "NaN | \n", "26647.0 | \n", "15103.0 | \n", "14945.0 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "99923 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
33116 | \n", "99925 | \n", "NaN | \n", "263900.0 | \n", "339200.0 | \n", "181300.0 | \n", "NaN | \n", "31175.0 | \n", "33706.0 | \n", "19820.0 | \n", "
33117 | \n", "99926 | \n", "NaN | \n", "170300.0 | \n", "243800.0 | \n", "154400.0 | \n", "NaN | \n", "20982.0 | \n", "162405.0 | \n", "31403.0 | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
33119 | \n", "99929 | \n", "NaN | \n", "196900.0 | \n", "178800.0 | \n", "231900.0 | \n", "NaN | \n", "52061.0 | \n", "30051.0 | \n", "93901.0 | \n", "
33120 rows × 9 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "
---|---|---|---|---|---|
0 | \n", "00601 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
1 | \n", "00602 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
2 | \n", "00603 | \n", "260300.0 | \n", "31083.0 | \n", "2019 | \n", "2020 | \n", "
3 | \n", "00606 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
4 | \n", "00610 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "99923 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33116 | \n", "99925 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33117 | \n", "99926 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33119 | \n", "99929 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33120 rows × 5 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "
---|---|---|---|---|---|
0 | \n", "00601 | \n", "82300.0 | \n", "10863.0 | \n", "2015 | \n", "2018 | \n", "
1 | \n", "00602 | \n", "144600.0 | \n", "45088.0 | \n", "2015 | \n", "2018 | \n", "
2 | \n", "00603 | \n", "134400.0 | \n", "57761.0 | \n", "2015 | \n", "2018 | \n", "
3 | \n", "00606 | \n", "58200.0 | \n", "25881.0 | \n", "2015 | \n", "2018 | \n", "
4 | \n", "00610 | \n", "115900.0 | \n", "26647.0 | \n", "2015 | \n", "2018 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "99923 | \n", "NaN | \n", "NaN | \n", "2015 | \n", "2018 | \n", "
33116 | \n", "99925 | \n", "263900.0 | \n", "31175.0 | \n", "2015 | \n", "2018 | \n", "
33117 | \n", "99926 | \n", "170300.0 | \n", "20982.0 | \n", "2015 | \n", "2018 | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2015 | \n", "2018 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2015 | \n", "2018 | \n", "
33120 rows × 5 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "
---|---|---|---|---|---|
0 | \n", "00601 | \n", "81700.0 | \n", "29876.0 | \n", "2010 | \n", "2014 | \n", "
1 | \n", "00602 | \n", "116100.0 | \n", "15512.0 | \n", "2010 | \n", "2014 | \n", "
2 | \n", "00603 | \n", "130400.0 | \n", "13058.0 | \n", "2010 | \n", "2014 | \n", "
3 | \n", "00606 | \n", "97000.0 | \n", "37195.0 | \n", "2010 | \n", "2014 | \n", "
4 | \n", "00610 | \n", "110500.0 | \n", "15103.0 | \n", "2010 | \n", "2014 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "99923 | \n", "NaN | \n", "NaN | \n", "2010 | \n", "2014 | \n", "
33116 | \n", "99925 | \n", "339200.0 | \n", "33706.0 | \n", "2010 | \n", "2014 | \n", "
33117 | \n", "99926 | \n", "243800.0 | \n", "162405.0 | \n", "2010 | \n", "2014 | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2010 | \n", "2014 | \n", "
33119 | \n", "99929 | \n", "178800.0 | \n", "30051.0 | \n", "2010 | \n", "2014 | \n", "
33120 rows × 5 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "
---|---|---|---|---|---|
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2000 | \n", "2009 | \n", "
1 | \n", "00602 | \n", "112200.0 | \n", "20066.0 | \n", "2000 | \n", "2009 | \n", "
2 | \n", "00603 | \n", "124100.0 | \n", "12732.0 | \n", "2000 | \n", "2009 | \n", "
3 | \n", "00606 | \n", "92900.0 | \n", "14438.0 | \n", "2000 | \n", "2009 | \n", "
4 | \n", "00610 | \n", "112000.0 | \n", "14945.0 | \n", "2000 | \n", "2009 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33115 | \n", "99923 | \n", "NaN | \n", "NaN | \n", "2000 | \n", "2009 | \n", "
33116 | \n", "99925 | \n", "181300.0 | \n", "19820.0 | \n", "2000 | \n", "2009 | \n", "
33117 | \n", "99926 | \n", "154400.0 | \n", "31403.0 | \n", "2000 | \n", "2009 | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2000 | \n", "2009 | \n", "
33119 | \n", "99929 | \n", "231900.0 | \n", "93901.0 | \n", "2000 | \n", "2009 | \n", "
33120 rows × 5 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "
---|---|---|---|---|---|
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2000 | \n", "2009 | \n", "
0 | \n", "00601 | \n", "81700.0 | \n", "29876.0 | \n", "2010 | \n", "2014 | \n", "
0 | \n", "00601 | \n", "82300.0 | \n", "10863.0 | \n", "2015 | \n", "2018 | \n", "
0 | \n", "00601 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
1 | \n", "00602 | \n", "112200.0 | \n", "20066.0 | \n", "2000 | \n", "2009 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
33119 | \n", "99929 | \n", "231900.0 | \n", "93901.0 | \n", "2000 | \n", "2009 | \n", "
33119 | \n", "99929 | \n", "178800.0 | \n", "30051.0 | \n", "2010 | \n", "2014 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2015 | \n", "2018 | \n", "
33119 | \n", "99929 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "
132480 rows × 5 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Start_Year | \n", "End_Year | \n", "Year | \n", "
---|---|---|---|---|---|---|
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2000 | \n", "2009 | \n", "[2000, 2001, 2002, 2003, 2004, 2005, 2006, 200... | \n", "
0 | \n", "00601 | \n", "81700.0 | \n", "29876.0 | \n", "2010 | \n", "2014 | \n", "[2010, 2011, 2012, 2013, 2014] | \n", "
0 | \n", "00601 | \n", "82300.0 | \n", "10863.0 | \n", "2015 | \n", "2018 | \n", "[2015, 2016, 2017, 2018] | \n", "
0 | \n", "00601 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "[2019, 2020] | \n", "
1 | \n", "00602 | \n", "112200.0 | \n", "20066.0 | \n", "2000 | \n", "2009 | \n", "[2000, 2001, 2002, 2003, 2004, 2005, 2006, 200... | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33118 | \n", "99927 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "[2019, 2020] | \n", "
33119 | \n", "99929 | \n", "231900.0 | \n", "93901.0 | \n", "2000 | \n", "2009 | \n", "[2000, 2001, 2002, 2003, 2004, 2005, 2006, 200... | \n", "
33119 | \n", "99929 | \n", "178800.0 | \n", "30051.0 | \n", "2010 | \n", "2014 | \n", "[2010, 2011, 2012, 2013, 2014] | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2015 | \n", "2018 | \n", "[2015, 2016, 2017, 2018] | \n", "
33119 | \n", "99929 | \n", "NaN | \n", "NaN | \n", "2019 | \n", "2020 | \n", "[2019, 2020] | \n", "
132480 rows × 6 columns
\n", "\n", " | ZCTA5 | \n", "ESTIMATE | \n", "MARGIN_OF_ERROR | \n", "Year | \n", "
---|---|---|---|---|
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2000 | \n", "
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2001 | \n", "
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2002 | \n", "
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2003 | \n", "
0 | \n", "00601 | \n", "95100.0 | \n", "20917.0 | \n", "2004 | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
33119 | \n", "99929 | \n", "178800.0 | \n", "30051.0 | \n", "2014 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2015 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2016 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2017 | \n", "
33119 | \n", "99929 | \n", "196900.0 | \n", "52061.0 | \n", "2018 | \n", "
501841 rows × 4 columns
\n", "\n", " | ZIP_CODE | \n", "PO_NAME | \n", "STATE | \n", "ZIP_TYPE | \n", "ZCTA | \n", "zip_join_type | \n", "
---|---|---|---|---|---|---|
0 | \n", "00501 | \n", "Holtsville | \n", "NY | \n", "Post Office or large volume customer | \n", "11742 | \n", "Spatial join to ZCTA | \n", "
1 | \n", "00544 | \n", "Holtsville | \n", "NY | \n", "Post Office or large volume customer | \n", "11742 | \n", "Spatial join to ZCTA | \n", "
2 | \n", "00601 | \n", "Adjuntas | \n", "PR | \n", "Zip Code Area | \n", "00601 | \n", "Zip matches ZCTA | \n", "
3 | \n", "00602 | \n", "Aguada | \n", "PR | \n", "Zip Code Area | \n", "00602 | \n", "Zip matches ZCTA | \n", "
4 | \n", "00603 | \n", "Aguadilla | \n", "PR | \n", "Zip Code Area | \n", "00603 | \n", "Zip matches ZCTA | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
41086 | \n", "99926 | \n", "Metlakatla | \n", "AK | \n", "Zip Code Area | \n", "99926 | \n", "Zip matches ZCTA | \n", "
41087 | \n", "99927 | \n", "Point Baker | \n", "AK | \n", "Zip Code Area | \n", "99927 | \n", "Zip matches ZCTA | \n", "
41088 | \n", "99928 | \n", "Ward Cove | \n", "AK | \n", "Post Office or large volume customer | \n", "99901 | \n", "Spatial join to ZCTA | \n", "
41089 | \n", "99929 | \n", "Wrangell | \n", "AK | \n", "Zip Code Area | \n", "99929 | \n", "Zip matches ZCTA | \n", "
41090 | \n", "99950 | \n", "Ketchikan | \n", "AK | \n", "Zip Code Area | \n", "99921 | \n", "Spatial join to ZCTA | \n", "
41091 rows × 6 columns
\n", "