Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making

Kaiser, Kendra E.; Blasch, Kyle; Schmitz, Steven

doi:10.3390/w15040679

Open AccessArticle

Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making

by

Kendra E. Kaiser

^1,*

,

Kyle Blasch

²

and

Steven Schmitz

¹

Department of Geosciences, Boise State University, Boise, ID 83725, USA

²

Northwest-Pacific Island Region, U.S. Geological Survey, Boise, ID 83702, USA

^*

Author to whom correspondence should be addressed.

Water 2023, 15(4), 679; https://doi.org/10.3390/w15040679

Submission received: 24 December 2022 / Revised: 4 February 2023 / Accepted: 5 February 2023 / Published: 9 February 2023

(This article belongs to the Special Issue River Flow Monitoring: Needs, Advances and Challenges)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Streamflow data are critical for monitoring and managing water resources, yet there are significant spatial gaps in our federal monitoring networks with biases toward large perennial rivers. In some cases, streamflow monitoring exists in these spatial gaps, but information about these monitoring locations is challenging to obtain. Here, we present a streamflow catalog for the United States Pacific Northwest that includes current and historical streamflow monitoring location information obtained from 32 organizations (other than the U.S. Geological Survey), which includes 2661 continuous streamflow gaging locations (22% are currently active) and 30,557 discrete streamflow measurements. A stakeholder advisory board with representatives from organizations that operate streamflow monitoring networks identified metadata requirements and provided feedback on the Streamflow Data Catalog user interface. Engagement with the water resources community through this effort highlighted challenges that water professionals face in collecting and managing streamflow data so that data are findable, accessible, interoperable, and reusable (FAIR). Over 60% of the streamflow monitoring locations in the Streamflow Data Catalog are not available online and are thus not findable through web search engines. Providing organizations technical assistance with standard measurement procedures, metadata collection, and web accessibility could substantially increase the availability and utility of streamflow information to water resources communities.

Keywords:

streamflow data; FAIR data; metadata; water resources management

1. Introduction

Streamflow data are necessary for managing water resources for numerous societal and environmental uses as well as to support research and modeling efforts to address climate change impacts on water availability. While the U.S. Geological Survey (USGS) operates the Nation’s largest streamflow monitoring network [1], there are significant spatial gaps and stream order biases in monitoring locations [2,3,4]. The water resources community has been advocating for more streamflow monitoring to fill these spatial gaps [5], though the extent to which streamflow monitoring across the Nation has increased is unknown. This gap in monitoring is partially due to the lack of centralized information about existing streamflow monitoring outside of the USGS [6]. The USGS has an accurate count of streamflow gaging stations (i.e., gages) in its operational network, and in 2021 the number of gages operated by the USGS was 8215 [7], which is 273 more gages than the previous peak in 1967. Within the United States Pacific Northwest (PNW: Idaho, Oregon, and Washington), the peak number of gages operated in a single year by the USGS was 816 in 1968 and has declined to 639 in 2021. These opposing trends display the variability of current gage numbers across the Nation. In addition to the national USGS monitoring program, there are discrete streamflow measurement locations and continuous streamflow gaging stations operated by non-USGS organizations. The number of discrete and continuous streamflow monitoring locations operated by these organizations has not been quantified, nor is the water resources community aware of how many non-USGS networks currently exist. Thus, an important approach to increasing the availability of streamflow information is to identify streamflow monitoring data that already exist and evaluate the potential of making these data more findable and accessible.

States, tribes, counties, cities, universities, non-profits, private firms, and other federal agencies operate streamflow gaging stations and collect discrete streamflow data [8]. These data are distributed across organizations’ websites with varying operability and in disparate offices with limited accessibility (e.g., off-line, isolated digital storage, or paper records). For these reasons, it is currently difficult to identify all current and historical streamflow data that may exist for a given geographic area. This challenge is being addressed with efforts to make streamflow data findable, accessible, interoperable, and reusable (FAIR) [9,10], including increasing awareness of the importance of adhering to FAIR data standards [11]. There are a growing number of repositories in which individuals may upload streamflow data (CUASHI HydroClient, HydroShare, Environmental Data Initiative, and ScienceBase) and various efforts to create mechanisms to find these data once their metadata are available online (Network Linked Data Index, [12], Geoconnex, [13]). The remaining challenge is one that is unlikely to be automated via these workflows and community standards. That challenge is to initially find, characterize, and populate streamflow locations, location metadata, and streamflow data. For the PNW, it was uncertain how many of the organizations’ operating networks were either unaware of emerging streamflow data repositories and search mechanisms or were aware but did not have the resources to populate data in these repositories or make their data findable.

Here, we present results of a project to catalog non-USGS streamflow monitoring locations and accompanying metadata from across the PNW and qualitatively describe the challenges associated with making metadata findable, accessible, interoperable, and reusable. For this study, metadata was defined as data describing streamflow locations and associated streamflow data [14]. Additionally, throughout the cataloging process we identified obstacles that water professionals face regarding streamflow data and metadata management. While the goals of accessible streamflow data extend beyond the PNW, the authors focused on the PNW because it is sufficiently large enough to identify challenges with FAIR data practices, the authors had existing knowledge of the streamflow networks, and the catalog could be accomplished with available resources.

2. Methods

In 2020, a stakeholder advisory board (SAB) was created for this project. It included representatives from 11 organizations across Idaho, Oregon, and Washington that operated streamflow monitoring networks and used streamflow data daily. The SAB initially helped develop a survey to solicit information about organizations’ streamflow monitoring programs and metadata considered necessary to improve FAIR data practices (Figure 1). The SAB assembled a list of organizations across the PNW that may have collected streamflow information. Authors from Boise State University and the Idaho Policy Institute then distributed the survey along with a description of the Streamflow Data Catalog and its intended purpose. The SAB also developed a set of metadata considered to be the most important information to collect about streamflow monitoring locations (Table S1).

The metadata template and a description of the goals of the Streamflow Data Catalog were then distributed to the list of organizations via email (Figure 1). Organizations with small monitoring networks were able to add their data directly to the metadata template, while larger organizations generally sent their metadata in the structure and format that could easily be exported from their data management program. For these datasets, we extracted and reorganized the relevant information to fit into the metadata template. The process of obtaining metadata often required consistent contact with data managers and follow up emails to clarify information. Data were also obtained from various online repositories, such as the Forest Service Research Data Archive (Table S2).

Data quality assurance and quality control (QA/QC) was an ongoing component of the data compilation effort as new issues were identified. The primary metadata issues were missing or incomplete dates, missing information about status (active/inactive), measurement frequency, and inconsistent naming conventions. Many discrete measurement datasets did not have complete dates, for example, they might have provided month-year or only year. For these measurements, we set the date to either the first of the month or the first of the year and noted this in the notes section of the metadata. Some locations did not have start or end dates, we set these to “unknown” if they were a continuous dataset. While categories were provided for many of the categorical variables, typos and capitalizations resulted in duplicative categories. These duplications were identified, edited, and grouped into appropriate categories. Some organizations provided multiple datasets that they manage. For these datasets an additional label was added in the Streamflow Data Catalog to denote the internal naming convention of the organization’s dataset (e.g., IDWR has a telemetry dataset and daily time series dataset). Given the number of independent irrigation district datasets, the individual district name was set as the “organization dataset” and the “organization” was set as “Irrigation District.” This was done to improve the data visualization.

Once the largest known streamflow monitoring networks had provided information on their streamflow programs, including location data and accompanying metadata, additional geospatial information was derived (county and hydrologic unit codes). The Streamflow Data Catalog visualization was then developed using the provided and derived data. Roundtable discussions were organized for each state to engage with streamflow metadata providers and evaluate the utility of the data visualization and the metadata collected [15]. Roundtable discussions were modeled after the Aspen Institute Roundtable discussions that were held in California, Texas, the Great Lakes, and the Upper Colorado River Basin from 2017 to 2019 [16]. Participants (1) described comprehensive and detailed streamflow data uses and requirements for their organizational missions; (2) identified challenges with and opportunities to apply FAIR data practices; (3) shared knowledge of their network operations; and (4) provided feedback on features of the Streamflow Data Catalog.

3. Results

This effort has produced the largest inter-organizational dataset of streamflow monitoring metadata in the PNW and an interactive data visualization to archive and access the metadata (Table S1, Figure 2). The metadata represent 33,267 non-USGS streamflow monitoring locations across 32 organizations (Table S2) with data from 112 of 142 counties (79%) and 206 of 234 HUC8 watersheds (88%). There are 2661 known continuous monitoring locations, 30,557 discrete measurement locations, and 49 locations with an unknown measurement interval (Figure 3, Table 1). Of the continuous streamflow monitoring locations, 588 (22%) are currently active and 41% do not have end dates, indicating potentially more sites in the catalog that are active (Figure 4, Table 2). Idaho has a significant number of monitoring locations in irrigation canals that are monitored by water managers. These data are self-reported and make up most of the continuous monitoring locations that have unknown monitoring frequency or unknown status (Figure 3 and Figure 4). In 2021, there were a nearly equal number of non-USGS continuous streamflow monitoring locations compared to USGS streamflow monitoring locations in the PNW. Most of the non-USGS locations have fewer than 20 years of data and 139 locations with more than 50 years of data (Figure 5). While this is the largest collection of metadata in the region, critical gaps in the dataset remain. For example, 48% of the continuous monitoring locations do not have information about measurement interval (Figure 3), and 55% do not denote the stream characterization (Figure 6). Of the continuous data, only about 50% can be accessed online by way of weblinks for this project. Organizations providing online access included state agencies, private hydro power, and a few cities, counties, research watersheds, and watershed councils/associations (Table S1).

The streamflow monitoring networks varied in size, with the state organizations providing the largest continuous monitoring networks, and both federal and state organizations provided the largest number of discrete monitoring locations. The U.S. Forest Service (USFS) has by far the largest collection of discrete streamflow measurements (17,787), which at this point still does not include all the USFS streamflow information in the PNW, followed by the Idaho Department of Environmental Quality Beneficial Use Reconnaissance Project (12,671) [17]. The Oregon and Washington Departments of Fish and Wildlife had the next largest set of discrete measurements (69 and 37 respectively) in the Streamflow Data Catalog.

4. Discussion

The need for additional and standardized streamflow data to support water resources management has been suggested by many organizations [5,18], though the authors are unaware of evaluations of the number of non-centralized streamflow monitoring locations prior to this study. Metadata assembled in the Streamflow Data Catalog show an increase in the number of non-USGS streamflow monitoring locations within the PNW in the last twenty years (Figure 5a). By engaging stakeholders across the PNW, this project compiled streamflow monitoring location metadata for 30,557 locations from 32 organizations (Table 1 and Table S1). The resulting interactive metadata visualization allows users to easily identify potential sources of data for a given area of interest (Figure 2). While this effort captured data from many organizations, including those operating the largest known networks, we know there are additional streamflow data collected by other organizations (e.g., consulting firms and irrigation districts) not represented in this catalog. The Streamflow Data Catalog and feedback generated during the roundtable discussions can inform future data integration efforts and demonstrate the value of aggregating observational data. The Streamflow Data Catalog showcases the significant amount of existing data sources available to integrate into the next generation of advanced databases and web crawlers and be used to provide additional clarity on data gaps and priority locations for new monitoring efforts within the PNW.

A critical component of this work was bringing water resources professionals together to discuss the challenges of operating streamflow monitoring networks, collecting discrete measurements, and managing associated metadata [15]. While the organizations operating networks differ, there are many commonalities in the challenges they face including finding open-source metadata standards specific to streamflow data, training on how to collect streamflow data to increase interoperability and reusability, and resource capacity to perform multi-step quality assurance. The attendees reported learning about additional educational tools and funding opportunities from the roundtable discussions. Providing forums like these are important for sharing information that can improve FAIR data within the water resources communities because of the gap that remains between understanding the importance of FAIR data and implementing FAIR data practices [19,20]. The 2022 National Water Use Data Workshop identified similar challenges across the country as well as structural limitations regarding legislation or laws that impede sharing data, the impacts of centralized state-wide information technology systems, and data privacy issues [19].

The findability of streamflow data will remain a significant challenge as many organizations do not have the resources to provide data online. The streamflow data survey was used to help identify organizations that had streamflow data, but as with many surveys, the response rate was relatively low [15]. The most challenging part of this endeavor was finding the right offices and individuals within organizations to solicit streamflow metadata and transferring the metadata from those offices to the catalog. Personal communications and meetings were often the only way to obtain the information needed to add monitoring locations and associated metadata to the catalog. For example, while many of the state agencies operating large networks have online databases, shapefiles of their monitoring locations with pertinent metadata did not exist. This required staff to extract the relevant information from their data management systems (e.g., WISKY, AQUARIUS, and MS Excel; any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government) with subsequent manual and time-intensive formatting and editing.

USGS Water Mission Area and the Internet of Water (IoW) are working toward increasing findability of streamflow data [21]. They have developed the Network Linked Data Index (NLDI) and Geoconnex, which are mechanisms to facilitate finding streamflow data. The NLDI indexes data to the National Hydrography Dataset (NHDPlus V2) [12,22], and Geoconnex is a registry system that creates a unique identifier that is linked to the URL that hosts the original data [13]. The utility of these tools is that once a measurement location has a permanent landing page, the Geoconnex web crawler can harvest links between hydrologic features and the associated data. Full utility of these tools will require organizations that collect continuous datasets to create webpages measurement locations and will decrease the effort needed to maintain a centralized database. As these groups and others (e.g., Water Data Exchange Program (WADE) [23] and Consortium of Universities for the Advancement of Hydrological Sciences, [CUASHI]) continue to collaboratively work toward open-source data management tools, the scaffolding for interoperable streamflow data will emerge and build on the successes of similar efforts like the Water Quality Portal and the Community Collaborative Rain, Hail, and Snow Network (CoCoRaHS) [24].

The ultimate success of these search engines requires data to be online, which is not the case for at least 60% of the network data identified within the catalog. The attendees identified a range of organizational abilities and capacities to consolidate streamflow data and metadata into communicable formats and into accessible platforms. Organizations operating smaller networks or collecting a smaller number of discrete measurements were less likely to prioritize these efforts or have the necessary resources to accomplish these tasks. While these organizations may want to make their data findable and available, external resources will be necessary to achieve this outcome. The attendees identified the need for no/low-cost tools for data formatting and metadata description to assure data are easily transferred to online repositories.

While multiple organizations provided geographic metadata for their continuous and discrete streamflow gaging locations, other important types of metadata were missing. Some of these metadata were accessible on organizational websites or internal databases but were not provided directly, so the authors obtained this information from online or other organizational records. Of the metadata entries in this catalog, 94% included the complete dataset including start and end dates, 100% included latitude and longitude, 53% included measurement intervals, and only 8% of metadata included information about instrumentation. Basic stream characterization (e.g., “natural”, “canal”, etc.) was only included in 45% of the metadata associated with the continuous gages. Knowing measurement instrumentation and intervals (Figure 5a) are critical to understanding the usability of the data, and while the individual organizations likely recorded this information, it is not regularly integrated into their existing data structures or easily obtained from their data management systems. None of the organizations provided geospatial accuracy metadata for the sites or methods used to determine geolocations. Very few of the data providers included information about their streamflow measurement techniques. Several organizations operating the largest networks provide data collection protocols through their online data portals [25]. Characterizing which metadata were easily obtained, or lacking, highlights the information that will be readily available and also highlights those metadata which the water resources community may need to provide assistance if it is to be obtainable in the future.

Interoperability and reusability of streamflow data are primary objectives for the data identified within the Streamflow Data Catalog. Creating and implementing controlled vocabularies when sourcing information from many organizations could substantially improve automated interoperability. For example, many stream type categories were provided (e.g., “constructed channel”, “canal”, “diversion”, and “weir”) that later were grouped into a smaller set of stream types (Figure 5b). These vocabularies are well defined in the CUASHI HydroClient (http://his.cuahsi.org/mastercvreg/cv11.aspx (accessed on 19 January 2023)) [26], a subset of which could be included with metadata templates to increase efficiencies and provide some initial standardization. Relational databases, such as CUASHIs HydroClient, can facilitate data interoperability [27], yet the effort to contribute data to these repositories can be a limiting factor for an organization that is staff and/or resource limited [15,19].

Reproducibility of data collection and quality assurance methods, which can improve confidence in reusability, is considered a more challenging task given that streamflow is a calculated value from multiple parameters that can themselves be measured using varying techniques and procedures. Metadata about streamflow measurement methods are important for characterizing uncertainty [28,29] and confirming the applicability of data for its intended purpose. Methods for data collection were reported for only 8% of the continuously monitored gage locations. About 35% of responding organizations used a provisional QA/QC method, meaning data were checked when downloaded but not reviewed or analyzed for errors, and only 13% of the organizations audit their data on an annual basis [8]. Training and education on both streamflow measurement methods and QA/QC processes were identified as important for improving reproducibility within the streamflow monitoring community, especially as they relate to preferred techniques for small versus large order streams. Creation of open-source QA/QC tools could be particularly beneficial for small organizations and could be a mechanism to incentivize data contributions to online platforms.

The challenges we will continue to face as we seek to integrate streamflow data are (1) obtaining additional data from known and unknown networks, (2) addressing incomplete metadata, (3) obtaining sustained resources for data management, and (4) developing streamlined processes and tools for organizations to update their network metadata. The water resources community can help address these challenges in many ways. First, individuals and organizations can ensure that streamflow metadata are complete for each location and supporting organizations can coordinate common metadata templates (see metadata template, Table S1). The primary weakness of the Streamflow Data Catalog is that it is incomplete, both in total number and location of measurements and associated metadata for each location. Small organizations may be the most challenging groups to gather metadata from as they are limited in their capacity to compile information if it is not already centralized. Support for, and adoption of, available and emerging data management strategies and workflows will improve organizational capacity to maintain complete metadata [30]. Supporting organizations can support community education and outreach around data management standards (e.g., Internet of Water Coalition). Secondly, matching the funding for data management with funding for additional monitoring efforts would likely improve overall data fidelity for the entire network. Keeping information sourced from many organizations current will be a significant challenge given the time required to gather and QA/QC the data provided. This is not an insurmountable task given the many examples of data compilation efforts, such as the Water Quality Portal [31], which provides an example of how data providers maintaining ownership can distribute the effort of data maintenance, and the NorWest stream temperature database, which compiled data from over 100 agencies in the western United States [32]. Identifying mechanisms to continue ongoing efforts to centralize streamflow data and improve associated metadata and data management workflows will have valuable and widespread impacts on water resources research and management.

5. Conclusions

As shown through this investigation, a substantial amount of non-USGS streamflow data exist within the Pacific Northwest that had not previously been easily findable. The streamflow gages and discrete streamflow measurement locations captured in the Streamflow Data Catalog will provide numerous benefits to water resources and land management organizations. The existing metadata are far from complete but are sufficient to identify where data exist and, often, the period of record. Organizations operating streamflow networks and collecting discrete data are often open to sharing their data but need additional data management resources and technical support. Stakeholder engagement will be necessary to ensure proposed data solutions are appropriately developed and distributed. While time-consuming, the benefits of the Streamflow Data Catalog development outweigh the resources devoted to this project and may provide a more cost-effective solution compared to funding new streamflow gages. A state-by-state approach for these types of catalog efforts can unearth significant amounts of streamflow data that will provide a valuable resource for the Nation’s water resources and land management communities.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/w15040679/s1. Table S1: Streamflow Metadata Template; Table S2: Organizations that provided data for the Streamflow Catalog.

Author Contributions

Conceptualization and methodology, funding acquisition, writing—original draft, review and editing, K.E.K. and K.B.; data curation, formal analysis, software and visualization, S.S. and K.E.K.; supervision, project administration, investigation, K.E.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by U.S. Geological Survey Northwest Climate Adaptation Science Center under Grant/Cooperative Agreement No. G20AC00439-A.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The metadata for the streamflow data catalog is available through Boise State University ScholarWorks 10.18122/redi_data.2.boisestate.

Acknowledgments

The authors would like to acknowledge the contributions from the individuals that comprised the Stakeholder Advisory Board, Jennifer Villa from the Oklahoma-Texas Water Science Center for making the Streamflow Data Catalog Visualization, as well as Paul Rydlund, Adam Stonewall, and Roy Bartholomay who reviewed this document.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results. This journal article has been peer reviewed and approved for publication consistent with USGS Fundamental Science Practices (https://pubs.usgs.gov/circ/1367/ (accessed on 3 February 2023).

References

Eberts, S.M.; Wagner, C.R.; Woodside, M.D. Water Priorities for the Nation—The U.S. Geological Survey Next Generation Water Observing System: U.S. Geological Survey Fact Sheet 2019–3046; U.S. Geological Survey: Reston, VA, USA, 2019; p. 2. [CrossRef]
King, T.; Hundt, S.; Simonson, A.; Blasch, K. Evaluation of Select Velocity Measurement Techniques for Estimating Discharge in Small Streams across the United States. J. Am. Water Resour. Assoc. 2022, 58, 1510–1530. [Google Scholar] [CrossRef]
Krabbenhoft, C.A.; Allen, G.H.; Lin, P.; Godsey, S.E.; Allen, D.C.; Burrows, R.M.; DelVecchia, A.G.; Fritz, K.M.; Shanafield, M.; Burgin, A.J.; et al. Assessing placement bias of the global river gauge network. Nat. Sustain. 2022, 5, 586–592. [Google Scholar] [CrossRef] [PubMed]
Deweber, J.T.; Tsang, Y.P.; Krueger, D.M.; Whittier, J.B.; Wagner, T.; Infante, D.M.; Whelan, G. Importance of understanding landscape biases in USGS gage locations: Implications and solutions for managers. Fisheries 2014, 39, 155–163. [Google Scholar]
Larson, K.B.; Wong, M.A.Y.L.; Mitchel, K.C.; Tagestad, J.D.; Saulsbury, J.W.; Bellgraph, B.J.; Reicher, D.W. Improving Discovery, Sharing, and Use of Water Data: Initial findings and Suggested Future Work (PNNL-29737); Pacific Northwest National Laboratory: Richland, WA, USA, 2021. Available online: https://www.pnnl.gov/main/publications/external/technical_reports/PNNL-29737.pdf (accessed on 20 December 2022).
Jaeger, K.L.; Hafen, K.C.; Dunham, J.B.; Fritz, K.M.; Kampf, S.K.; Barnhart, T.B.; Kaiser, K.E.; Sando, R.; Johnson, S.L.; McShane, R.R.; et al. Beyond Streamflow: Call for a National Data Repository of Streamflow Presence for Streams and Rivers in the United States. Water 2021, 13, 1627. [Google Scholar] [CrossRef] [PubMed]
Bucknell, M.; Briggs, A.; Bechtel, E.; DeCicco, L.; Hines, M.; Nell, C.; Platt, L.; Read, E.; Read, J.; Walker, J.; et al. Gages Through the Ages: U.S. Geological Survey VizLab. 2020. Available online: https://labs.waterdata.usgs.gov/visualizations/gages-through-the-ages/index.html#/ (accessed on 20 December 2022).
Kaiser, K.E.; Blasch, K.; Hall, M. A survey of non-USGS continuous streamflow gaging networks in the Pacific Northwest. J. Am. Water Resour. Assoc. 2022, submitted.
Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed]
Boeckhout, M.; Zielhuis, G.A.; Bredenoord, A.L. The FAIR guiding principles for data stewardship: Fair enough? Eur. J. Hum. Genet. 2018, 26, 931–936. [Google Scholar] [CrossRef]
National Science Foundation-Dissemination and Sharing of Research Results-NSF Data Management Plan Requirements. Available online: https://www.nsf.gov/bfa/dias/policy/dmp.jsp (accessed on 9 December 2022).
Networked Linked Data Index and geoconnex, USGS Water Data for the Nation Blog. 2022. Available online: https://waterdata.usgs.gov/blog/nldi-geoconnex/ (accessed on 13 December 2022).
Geoconnex. Internet of Water [IoW]. Available online: https://internetofwater.org/geoconnex/ (accessed on 9 December 2022).
Obuch, R.C.; Carlino, J.; Zhang, L.; Blythe, J.; Dietrich, C.; Hawkinson, C. Department of the Interior metadata implementation guide—Framework for developing the metadata component for data resource management. In U.S. Geological Survey Techniques and Methods; Book 16, Chapter A1; U.S. Geological Survey: Reston, VA, USA, 2018. [Google Scholar]
Kaiser, K.E. Pacific Northwest Streamflow Data Landscape, A report from the 2022 Streamflow Data Roundtables. Northwest Clim. Adapt. Sci. Cent. Rep. 2022, submitted.
Patterson, L.; Doyle, M.; Gershuny, G.; Rogers, J.E. Building an Internet of Water—A Report from the 2017 to 2019 Internet of Water Roundtables; The Aspen Institute and the Nicholas Institute for Environmental Policy Solutions: Durham, NC, USA, 2019. [Google Scholar]
Idaho Department of Environmental Quality [IDEQ]. Monitoring and Assessment. Available online: https://www.deq.idaho.gov/water-quality/surface-water/monitoring-and-assessment/ (accessed on 9 December 2022).
Miller, S.; Eldred, P.; Muldoon, A.; Anlauf-Dunn, K.; Stein, C.; Hubler, S.; Merrick, L.; Haxton, N.; Larson, C.; Rehn, A.; et al. A Large-Scale, Multiagency Approach to Defining a Reference Network for Pacific Northwest Streams. Environ. Manag. 2016, 58, 1091–1104. [Google Scholar] [CrossRef]
Abdallah, A.; Willardson, T.; James, R. The 2022 National Water Use Data Workshop Summary Report. HydroShare 2022. [Google Scholar] [CrossRef]
Kinkade, D.; Shepherd, A. Geoscience data publication: Practices and perspectives on enabling the FAIR guiding principles. Geosci. Data J. 2021, 9, 177–186. [Google Scholar] [CrossRef]
Internet of Water Coalition [IoW]. Available online: https://internetofwater.org/ (accessed on 9 December 2022).
Wieczorek, M.E.; Jackson, S.E.; Schwarz, G.E. Select Attributes for NHDPlus Version 2.1 Reach Catchments and Modified Network Routed Upstream Watersheds for the Conterminous United States (ver. 2.0, November 2019); U.S. Geological Survey: Reston, VA, USA, 2018. [CrossRef]
Western States Water Council. Water Data Exchange Program [WADE]. Available online: https://westernstateswater.org/wade/ (accessed on 13 December 2022).
Community Collaborative Rain. Hail & Snow Network [CoCoRaHS]. Available online: https://www.cocorahs.org/ (accessed on 9 December 2022).
Shedd, J.R. Measuring and Calculating Stream Discharge, Standard Operating Procedure EAP056; Version 1.3; Publication No. 18-03-203; Washington State Department of Ecology: Lacey, WA, USA, 2018.
CUASHI Hydrologic Information System (HIS). Master Controlled Vocabulary Registry for ODM 1.1. Available online: http://his.cuahsi.org/mastercvreg/cv11.aspx (accessed on 19 January 2023).
Horsburgh, J.S.; Tarboton, D.G.; Maidment, D.R.; Zaslavsky, I. A relational model for environmental and water resources data. Water Resour. Res. 2008, 44, 6392. [Google Scholar] [CrossRef]
Horner, I.; Renard, B.; Le Coz, J.; Branger, F.; McMillan, H.K.; Pierrefeu, G. Impact of Stage Measurement Errors on Streamflow Uncertainty. Water Resour. Res. 2018, 54, 1952–1976. [Google Scholar] [CrossRef]
Hamilton, A.S.; Moore, R.D. Quantifying Uncertainty in Streamflow Records. Can. Water Resour. J. Rev. Can. Des Ressour. Hydr. 2012, 37, 3–21. [Google Scholar] [CrossRef]
Volk, C.J.; Lucero, Y.; Barnas, K. Why is Data Sharing in Collaborative Natural Resource Efforts so Hard and What can We Do to Improve it? Environ. Manag. 2014, 53, 883–893. [Google Scholar] [CrossRef]
Water Quality Portal. Washington (DC): National Water Quality Monitoring Council, United States Geological Survey (USGS), Environmental Protection Agency (EPA). 2021. Available online: https://doi.org/10.5066/P9QRKUVJ (accessed on 17 January 2023).
Isaak, D.J.; Wenger, S.J.; Peterson, E.E.; Hoef, J.M.V.; Nagel, D.E.; Luce, C.H.; Hostetler, S.W.; Dunham, J.B.; Roper, B.B.; Wollrab, S.P.; et al. The NorWeST Summer Stream Temperature Model and Scenarios for the Western U.S.: A Crowd—Sourced Database and New Geospatial Tools Foster a User Community and Predict Broad Climate Warming of Rivers and Streams. Water Resour. Res. 2017, 53, 9181–9205. [Google Scholar] [CrossRef]

Figure 1. Workflow for distributing metadata template, compiling and QA/QC of metadata, and development of Streamflow Data Catalog. Blue boxes denote stakeholder advisory board input, and arrows denote the aspects of the workflow that required continual revision.

Figure 2. The Streamflow Data Catalog online visualization (https://tableau.usgs.gov/views/Streamflow_Catalog/Introduction (accessed on 3 Faburary 2023)) shows all of the sites in the catalog and provides users the ability to select and filter the sites based on available metadata. Circles represent each monitoring location in the dataset and are scaled to the length of the data record. Note that incomplete metadata, such as missing start and end dates results in displaying the number of monitoring years as zero. While 32 individual organization’s datasets are in the catalog, some organizations are grouped to improve the visualization (e.g., irrigation districts and Oregon Watershed Enhancement Board grantees). The U.S. Forest Service and Idaho Department of Environmental Quality discrete measurements are not shown in this figure as the number of locations precludes the ability to see other locations.

Figure 3. Number of streamflow monitoring locations with continuous, discrete, or unknown measurement intervals in each state. Note, unknown intervals are small percentages but account for 13 locations in Idaho and 36 locations in Oregon.

Figure 4. Streamflow monitoring location status (active, inactive, or unknown) for each continuous monitoring location in each state.

Figure 5. (a) Number of active, continuous streamflow monitoring locations over time operated by the USGS and Streamflow Data Catalog providers in the Pacific Northwest (Idaho, Oregon, and Washington). Note, for the Streamflow Data Catalog, about 1000 locations do not currently have start and end dates, so the number of active gages in the catalog is likely higher; (b) histogram of the period of record for continuous monitoring locations in the dataset. The horizontal bars represent a four-year interval.

Figure 6. Stream characterization represented by the continuous measurement locations.

Table 1. Number of streamflow monitoring locations and interval of measurements in each state.

State	Total	Measurement Interval
State	Total	Continuous	Discrete	Unknown
Idaho	14,000	1321	12,666	13
Oregon	12,976	972	11,968	36
Washington	6291	368	5923	0

Table 2. Status of continuous monitoring locations recorded in the Streamflow Data Catalog in each state.

State	Total	Active	Inactive	Unknown
Idaho	1321	215	37	1069
Oregon	972	254	598	120
Washington	368	119	184	65
PNW Total	2661	588	819	1254

Note: Unknown includes locations that do not have end dates.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kaiser, K.E.; Blasch, K.; Schmitz, S. Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making. Water 2023, 15, 679. https://doi.org/10.3390/w15040679

AMA Style

Kaiser KE, Blasch K, Schmitz S. Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making. Water. 2023; 15(4):679. https://doi.org/10.3390/w15040679

Chicago/Turabian Style

Kaiser, Kendra E., Kyle Blasch, and Steven Schmitz. 2023. "Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making" Water 15, no. 4: 679. https://doi.org/10.3390/w15040679

APA Style

Kaiser, K. E., Blasch, K., & Schmitz, S. (2023). Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making. Water, 15(4), 679. https://doi.org/10.3390/w15040679

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Integration of Distributed Streamflow Measurement Metadata for Improved Water Resource Decision-Making

Abstract

1. Introduction

2. Methods

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI