This repository contains the Open Water Foundation (OWF) dataset for Colorado municipalities. This is a foundational dataset that provides unique identifiers and other data for municipalities. The identifiers can be used to link other datasets, including population datasets and water providers that serve municipalities. OWF has created and is maintaining this dataset to facilitate work on various data analysis and visualization projects in Colorado.
Other repositories provide access to related data, including:
The following sections provide a summary of the project repository:
- Repository Contents
Colorado-Municipalities.xlsx
Contents- Data Files
- How to Use the Data
- Attribution
- Dataset Workflow
- Disclaimer
- License
- Maintainers and Contributing
The repository contains the following:
.gitattributes Git configuration file indicate repository configuration,
in particular handling of line-ending and binary files.
.gitignore Git configuration file to ignore files that should not be committed to the repository.
README.md Explanation of repository contents, data files and sources and
TSTool command files used to process the core data into other products.
data/ Folder containing data files.
co-municipalities.xlsx Simple Excel file containing core data.
co-municipalities.csv The Excel file contents from the Municipality worksheet
converted to a csv file, useful for automated processing.
co-municipalities.geojson The Excel file contents from the Municipality worksheet
converted to a geojson file, useful for mapping applications.
co-municipalities-basin-relate.csv The Excel file contents from the Municipality_Basin_Relate worksheet
converted to a csv file, useful for automated processing.
co-municipalities-county-relate.csv The Excel file contents from the Municipality_County_Relate worksheet
converted to a csv file, useful for automated processing.
co-municipalities-document-relate.csv The Excel file contents from the Municipality_Document_Relate worksheet
converted to a csv file, useful for automated processing.
data-orig/ Folder containing original data files downloaded from agency websites.
Colorado-DOLA-LocalGovernment-Municipalities.tsv
Copy and pasted from: https://dola.colorado.gov/lgis/municipalities.jsf
This is a newer download than the originally downloaded file:
Colorado-DOLA-LocalGovt-IDs-Municipality.csv
Colorado-DOLA-LocalGovernment-Municipalities.csv
The above file saved as csv.
Colorado-DOLA-LocalGovt-IDs-Municipality.csv The data file that is a copy of the Department of Local Affairs' (DOLA)
Local Government Information System website that contains local government IDs
(DOLA_LG_ID). See DOLA attribution below.
THIS FILE MAY BE REMOVED IN THE FUTURE AS THE ABOVE VERSION IS PHASED IN.
Colorado-FIPS-Places.csv The data file containing original data download from the
U.S. Census Bureau containing FIPS IDs.
Colorado-GNIS-Civil.csv The data file containing original data download from the
Geographic Names Information System containing GNIS IDs.
Colorado-Municipality-PointLocation.csv Saved attribute table of Municipal Boundaries in Colorado geojson file
downloaded from the Colorado Information Marketplace that contains
coordinates of the centroid of each municipality's boundaries.
Colorado-PWS-IDs.csv The data file containing original data download from the EPA's
Safe Drinking Water Information System containing PWS IDs.
README.md Explanation of folder contents, description of data files,
and the methodology used to obtain the data and mapping to the joined dataset
data-orig-workflow/ Folder containing files, such as TSTool command files,
for processing original data into usable formats.
downloads/ Dynamically-created folder for downloads, ignored from Git repository.
process-original-data-to-csv.tstool TSTool command file that processes files that are automatically downloaded
from websites and data files manually downloaded, which are incorporated
into the main dataset.
doc/ Additional documentation for the dataset.
workflow/ TSTool software command files to process data into useful forms.
01-convert-formats.tstool TSTool command file to convert the dataset from .xlsx to .csv and .geojson.
x-visualizations/ Files created for specific visualizations, being evaluated.
NO LONGER USED.
These files may be moved to other repositories that contain visualizations.
z-local-notes/ Used for local notes, files are not committed to the repository.
The core Excel workbook that serves as the master data contains the following data columns within the Municipality worksheet. Cross-referencing identifiers were added when creating of the workbook.
Column | Description |
---|---|
MunicipalityName | Name of the municipality. |
FIPS_ID | 5-digit Federal Information Processing Standard code, to link federal datasets. |
FIPS_ID_Flag | Data status of FIPS_ID values; see more detail below. |
GNIS_ID | Geographic Names Information System identifier, to link federal datasets. |
GNIS_ID_Flag | Data status of GNIS_ID values; see more detail below. |
DOLA_LG_ID | 5-digit identifier used by Colorado's Department of Local Affairs (DOLA), to link DOLA datasets. |
DOLA_LG_ID_Flag | Data status of DOLA_LG_ID values; see more detail below. |
OWF_ID | Unique text identifier created by OWF to ensure that every municipality has a unique identifier. The OWF_ID is a concatenation of muninicpality words, using abbreviations. |
OWF_ID_Full | Unique text identifier created by OWF to ensure that every municipality has a unique identifier. The OWF_ID is a concatenation of muninicpality words, using full words. |
OWF_ID_Flag | Data status of OWF_ID values; see more detail below. |
PWS_ID | Public Water System identifier, to link Environmental Protection Agency and Colorado Department of Public Health and Environment datasets. |
PWS_ID_Flag | Data status of PWS_ID values; see more detail below. |
DWR_WaterDistrict_ID | TO BE ADDED |
DWR_WaterDistrict_ID_Flag | TO BE ADDED |
County_CSV | County in which the municipality is contained. For municipalities that are in more than one county consistent with DOLA population data, counties are listed in alphabetical order, separated by commas. Municipalities in multiple counties can also be found in the Municipality_County_Relate worksheet. |
NumCounty | Number of counties within the municipality's boundaries. |
IBCC_Basin_CSV | Interbasin Compact Committee (IBCC) basin in which the municipality is contained. Some municipalities are in more than one basin. In these cases, each basin is listed in alphabetical order, separated by commas. Municipalities in multiple basins can also be found in the Municipality_Basin_Relate worksheet. |
Num_IBCC_Basin | Number of IBCC basins within the municipality's boundaries. This is a quick way to determine if the municipality is in multiple basins. |
Latitude | Latitude of municipality's point location in decimal degrees. |
Longitude | Longitude of municipality's point location in decimal degrees. |
Lat_Long_Flag | Indication of how latitude and longitude were determined. |
Website | Website URL of the municipality. If a website was not found, the corresponding Wikipedia page was used. |
Website_Flag | Data status of Website values; see more detail below. |
Comment | Any other information about the municipality. |
The following conventions are used to create OWF_ID
values from longer names.
See the OWF_ID_Full
identifier for identifier that uses full words rather than abbreviations.
Abbreviation | Long Version |
---|---|
Crk |
Creek |
Ft |
Fort |
Hls |
Hills |
Hts |
Heights |
Mt |
Mount |
Mtn |
Mountain |
Spgs |
Springs |
Vlg |
Village |
Vly |
Valley |
Many data columns have an accompanying second column with the same name and
the _Flag
appended to the original column name.
These columns are an indication of data status as it relates to missing or estimated data values.
The following conventions are used:
Flag Value | Description |
---|---|
G |
Value is a known/good value. |
g |
Value is an estimated (but good) value. The associated cell is also highlighted in yellow. |
N |
Value is not applicable for the municipality and a blank cell is expected. |
M |
Value is known to be missing in original source and therefore a blank cell indicates that a value cannot be provided. |
m |
Value is estimated to be missing. The associated cell is also highlighted in gray. |
z |
Value is unable to be confirmed. A value is possible but cannot be confirmed one way or the other. The associated cell is also highlighted in orange. |
x |
OWF has not made an attempt to populate the cell at this time. The value is missing because OWF has not attempted to find the value. The associated cell is also highlighted in black. |
Colors are only visible in xlsx
files and not csv
files.
Column names are taken from original sources if possible. For clarity and attribution, agency abbreviations may be added to the original column name. Column name length is not restricted, therefore, some data representations such as Esri shapefiles may contain truncated column names. In such cases, alternative formats such as GeoJSON are recommended.
Descriptions of identifiers are also provided in the Notes worksheet within the workbook. This worksheet also details how the original data were downloaded and where to find those files.
Worksheets within the workbook are as follows:
Worksheet | Description |
---|---|
Municipality | Main datase. Municipalities in Colorado, described above. |
Municipality_County_Relate | Municipality/county relationships for municipalities that are contained in more than one county. |
Municipality_Basin_Relate | Municipality/basin relationships for municipalities that are contained in more than one IBCC basin. |
Municipality_Document_Relate | Documents such as water efficiency plans, source water assessment plans, etc. that are associated with a particular municipality. A URL is provided for each document. The documents were found by manually searching for documents using the terms "water conservation efficiency plans" and the municipality's name. This worksheet is organized so that each document is its own record. A municipality may be listed in more than one row if a municipality is associated with multiple documents. This worksheet is incomplete but highlights the potential for providing links to any documents associated with a municipality. |
County | Counties in Colorado, used to fill in county data in other worksheets to ensure data consistency, i.e., no grammatical errors when typing in a county name. |
IBCC_Basin | Interbasin Compact Committee (IBCC) river basins in Colorado, used to fill in basin data in other worksheets to ensure data consistency, i.e., no grammatical errors when typing in a basin name. |
ChangeLog | Indicates changes to the dataset, the date they occurred and who made the changes. |
Metadata_Municipality | Metadata for data columns in the Municipality worksheet. |
Files in the data/
folder are created by exporting data from the Colorado-Municipalities.xlsx
workbook as csv
and geojson
files.
These files can be used directly without having to process the Excel file.
This file is the Municipality worksheet saved in csv format. Warning: if this file is opened directly in Excel, IDs that contain leading zeroes will not show those zeroes. Instead, import the file into a blank Excel file by selecting Data/Get External Data/From Text.
This file is the Municipality worksheet saved in GeoJSON format. This file is viewable as a map in the GitHub repository. It can also be used in GIS and mapping applications.
This file is the Municipality_Basin_Relate worksheet saved in csv format. Warning: if this file is opened directly in Excel, IDs that contain leading zeroes will not show those zeroes. Instead, import the file into a blank Excel file by selecting Data/Get External Data/From Text.
This file is the Municipality_County_Relate worksheet saved in csv format. Warning: if this file is opened directly in Excel, IDs that contain leading zeroes will not show those zeroes. Instead, import the file into a blank Excel file by selecting Data/Get External Data/From Text.
This file is the Municipality_Document_Relate worksheet saved in csv format. Warning: if this file is opened directly in Excel, IDs that contain leading zeroes will not show those zeroes. Instead, import the file into a blank Excel file by selecting Data/Get External Data/From Text.
The Colorado Municipalities dataset provides a complete statewide list of municipalities assembled from multiple sources. There are several unique identifiers for each municipality and the dataset allows cross-referencing the identifiers so that other datasets can be joined. For example, the Colorado Water Providers dataset dataset uses the municipalities' identifiers and can be used to link additional data. In addition, organizations like the Colorado Municipal League may find the dataset useful and connections could be made to their data if any of these identifiers are used with their data.
The Excel and csv files can be used as tabular datasets as is, to create filtered lists or to link to other datasets. Data-processing software such as TSTool can be used to link this dataset to other datasets. Datasets can be used within GIS software to create maps.
The format and contents of the dataset may change over time. It is recommended to save a copy of the dataset.
The data sources for this dataset are listed below.
Data Source | Description |
---|---|
U.S. Census Bureau | Includes municipal Federal Information Processing Standard (FIPS) codes. |
U.S. Geological Survey (USGS)'s Geographic Names Information System (GNIS) | GNIS is the Federal and national standard for geographic nomenclature. The USGS developed the GNIS in support of the U.S. Board on Geographic Names as the official repository of domestic geographic names data. OWF manually cross-referenced the Feature Name column to the MunicipalityName. |
The Colorado Department of Local Affairs (DOLA)'s Local Government Information System local government ID (LG ID) | Data were copied directly from the website and pasted into Excel. To copy the data highlight the table on the DOLA website and save OWF manually cross-referenced the LG ID to the MunicipalityName. OWF is using DOLA_LG_ID instead of LG ID to add more description to the identifier. |
Environmental Protection Agency (EPA)'s Safe Drinking Water Information System (SDWIS) Public Water System IDs (PWS ID) | PWS IDs are used for water quality reports. The Colorado Department of Public Health and Environment (CDPHE)'s Water Quality Control Division also uses the PWS ID. Not all municipalities have a PWS ID. In these instances, the municipality's water and sanitation district may have a PWS ID, or the municipality may be served by a water company that has a PWS ID. OWF manually cross-referenced the PWS Name to the MunicipalityName. |
Open Water Foundation OWF_ID | OWF_ID was created for each municipality by the Open Water Foundation in order to ensure that at least one type of identifier contains values for every municipality. For example, almost every municipality has a DOLA LG ID, with the exception of Carbonate. However, if Carbonate needed to be linked to other datasets via the DOLA_LG_ID, this would not be possible. Therefore, the OWF_ID is needed to potentially link every municipality to other datasets. OWF_ID is used in the "Relate" worksheets and csv files as the identifier for this reason. |
Latitude and Longitude coordinates from Colorado Information Marketplace. | The map titled Municipal Boundaries in Colorado. The map was downloaded as a GeoJSON file and opened in QGIS. The centroid of each municipality's polygon was calculated and used as the point location for the municipality. |
Municipality Website URLs | URLs were found by manually searching for municipality websites. Documents such as water efficiency plans were also manually searched. |
The following is a summary of the workflow to create the dataset. Details for each file are described in the data-orig/README.md file. Steps can be followed if recreating or updating the dataset. The process can be further automated to streamline the overall workflow; however, some steps were initially performned manually.
- Manually create the
Colorado-Municipalities.xlsx
Excel workbook:- Manually populate the empty worksheets (some were added later as data were evaluated).
- Manually download
data-orig
files:Colorado-DOLA-LocalGovernment-Municipalities.tsv
is copy and pasted from https://dola.colorado.gov/lgis/municipalities.jsf to create a tab-delimited file. The column headings were manually added. The olderColorado-DOLA-LocalGovt-IDs-Municipality.csv
was created similarly but is being phased out. See the automated step below to create the correspondingcsv
file.Colorado-GNIS-Civil.psv
was creating by manually downloading from https://geonames.usgs.gov/apex/f?p=138:1:::::: usingState=Colorado
andFeature Class:Civil
(all other fields have default values). After querying, save toColorado-GNIS-Civil.psv
(pipe-separated-value). This file contains municipalities, counties, and other local government entities. See the automated step below to create the correspondingcsv
file.Colorado-PWS-IDs.csv
was created by manually downloading from https://ofmpub.epa.gov/apex/sfdw/f?p=108:1:::NO::: usingSelect a Report=Water System Summary
andPrimacy Agency=Colorado
(all other fields have default values). The file was saved asColorado-PWS-IDs.csv
.
- Automate download and process
data-orig
files. Rundata-orig-process/process-original-data-to-csv.tstool
, which creates:data-orig/Colorado-DOLA-LocalGovernment-Municipalities.csv
- DOLA municipalitiesdata-orig/Colorado-FIPS-Places.csv
- FIPS places including municipalitiesdata-orig/Colorado-GNIS-Civil.csv
- GNIS civil places including municipalities and counties
OWF has attempted to create a complete statewide dataset of municipalities. OWF will attempt to fill data gaps as the dataset is used for analysis and funding allows for more data review. OWF provides no guarantee as to the accuracy of the data. Use this dataset at your own risk. OWF welcomes feedback to improve the dataset.
This dataset is distributed using the Creative Commons Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) License. Please provide attribution.
This dataset is maintained by the Open Water Foundation. The Open Water Foundation is adding value by cross-referencing datasets. If you use the dataset and have comments, please contact the maintainers and/or use the GitHub issues to provide feedback.