Title: | Example hydrologic data to support statistical methods in water resources |
---|---|
Description: | Datasets as data.frames and as text files from U.S. Geological Survey publications. |
Authors: | Dave Lorenz [aut], Laura DeCicco [cre] |
Maintainer: | Laura DeCicco <[email protected]> |
License: | CC0 |
Version: | 1.1.2 |
Built: | 2024-11-04 21:42:19 UTC |
Source: | https://github.com/USGS-R/smwrData |
The smwrData package provides example hydrologic data to support statistical methods in water resources.
Package: | USGSwsData |
Type: | Package |
Version: | 1.1.1 |
Date: | 2015-11-05 |
License: | CC0 |
This package contains the following datasets.
AppalachianSpecCap | Well Characteristics |
ChoptankFlow | Daily Streamflow |
ChoptankNH3 | Stream Water Quality |
ConecuhFlows | Annual Streamflow |
CuyahogaTDS | Stream Water Quality |
EasternIowaNO3 | Groundwater Quality |
GlacialRidge | Daily Groundwater Levels |
IonBalance | Stream Water Quality |
KlamathTP | Stream Water Quality |
MayflyNymph | Stream Water Quality |
MC11_1993 | Soil Temperature |
MenomineeMajorIons | Stream Water Quality |
MiningIron | Stream Water Quality |
MiscGW | Groundwater Quality |
PrecipNitrogen | Precipitation Water Quality |
PugetNitrate | Groundwater Quality |
Q05078470 | Daily Streamflow |
Q05078770 | Daily Streamflow |
Qall | Daily Streamflow |
QW05078470 | Stream Water Quality |
QWall | Stream Water Quality |
QWstacked | Stream Water Quality |
SaddlePeaks | Annual Peak Streamflow |
TNLoads | Stream Water Quality |
UraniumTDS | Groundwater Quality |
The folder named misc contains three additional ASCII files. The file TestFull.rdb is an ASCII relational database (RDB) file constructed by James R. Slack (retired USGS). The use of dots as a delimiter for date data is not standard and is no longer supported. The file TestPart.csv is a comma-separated variable file that contains three columns from the TestFull.rdb file. The file TestPart.meta contains meta-information about the columns in TestPart.csv.
Dave Lorenz, with data contributions from James R. Slack and Jim
Tesoriero.
Maintainer: Dave Lorenz <[email protected]>
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in
water resources: U.S. Geological Survey Techniques of Water-Resources
Investigations, book 4, chap. A3, 522 p. Available at
http://pubs.usgs.gov/twri/twri4a3/.
Lorenz, D.L., 2015, smwrData—An R package of example hydrologic data,
version 1.1.1, U.S. Geological Survey Open-File Report 2015–1103, 5 p. Available
at http://pubs.usgs.gov/publication/ofr20151103.
Natural logarithms of specific capacity of wells in four rock types within the Appalachian Mountain region of Pennsylvania.
AppalachianSpecCap
AppalachianSpecCap
Data frame with 200 rows and 2 columns
Name | Type | Description |
LogSpecCap | numeric | Natural log of the specific capacity |
RockType | factor | Type of rock |
Appendix C7 in Helsel and Hirsch (2002).
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(AppalachianSpecCap) # Create simple boxplots by Rock type with(AppalachianSpecCap, boxplot(split(LogSpecCap, RockType), range=0))
data(AppalachianSpecCap) # Create simple boxplots by Rock type with(AppalachianSpecCap, boxplot(split(LogSpecCap, RockType), range=0))
Selected daily flow data for Choptank River near Greensboro, Maryland, USGS station number 01491000.
ChoptankFlow
ChoptankFlow
Data frame with 8035 rows and 5 columns
Name | Type | Description |
agency_cd | character | Agency code |
site_no | character | USGS station number |
datetime | Date | Day |
Flow | numeric | Daily mean streamflow |
Flow_cd | character | Daily mean streamflow code |
Note: the daily mean streamflow rates are in cubic feet per second.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/md/nwis/sw) on 2012-06-19.
data(ChoptankFlow) # The hydrograph for the entire record retrieved par(mfrow=c(2,1), mar=c(4.1, 4.1, 1.1, 1.1)) with(ChoptankFlow, plot(datetime, Flow, type='l')) # The hydrograph for a "typical water year" with(subset(ChoptankFlow, datetime >= "2003-10-01" & datetime <= "2004-09-30"), plot(datetime, Flow, type='l'))
data(ChoptankFlow) # The hydrograph for the entire record retrieved par(mfrow=c(2,1), mar=c(4.1, 4.1, 1.1, 1.1)) with(ChoptankFlow, plot(datetime, Flow, type='l')) # The hydrograph for a "typical water year" with(subset(ChoptankFlow, datetime >= "2003-10-01" & datetime <= "2004-09-30"), plot(datetime, Flow, type='l'))
Selected ammonia concentration data for Choptank River near Greensboro, Maryland, USGS station number 01491000.
ChoptankNH3
ChoptankNH3
Data frame with 182 rows and 13 columns
Name | Type | Description |
agency_cd | character | Agency code |
site_no | character | USGS station number |
sample_dt | Date | Sample date |
sample_tm | character | Sample time |
time_datum_cd | character | Time zone datum |
parm_cd | character | Parameter code |
remark_cd | character | Remark code for the value |
result_va | character | Numeric value of the result |
val_qual_tx | character | Result value qualifier code |
meth_cd | character | Lab method code |
dqi_cd | character | Data-quality indicator code |
rpt_lev_va | character | Reporting level |
rpt_lev_cd | character | Reporting level type |
Note: all concentrations in the column
result_va are in milligrams per liter as nitrogen.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/md/nwis/qwdata) on 2012-06-18. Column names are preserved as retrieved, but unnecessary columns were deleted.
data(ChoptankNH3) # Concentration values plotted over time--black is uncensored, red represents # a less-than value and green are E-coded (less than the reporting level, but # greater than the detection limit). with(ChoptankNH3, plot(sample_dt, result_va, col=as.factor(remark_cd)))
data(ChoptankNH3) # Concentration values plotted over time--black is uncensored, red represents # a less-than value and green are E-coded (less than the reporting level, but # greater than the detection limit). with(ChoptankNH3, plot(sample_dt, result_va, col=as.factor(remark_cd)))
Annual streamflows from 1941 through 1960 for the Conecuh River at Brantley, Alabama, USGS station number 02371500.
ConecuhFlows
ConecuhFlows
Data frame with 20 rows and 2 columns
Name | Type | Description |
Year | integer | Water year |
Flow | integer | Annual mean flow |
Note: the annual mean flow rates are cubic feet per second.
Appendix C2 in Helsel and Hirsch (2002).
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(ConecuhFlows) # The apparent "break" beginning in 1950 is climatological. with(ConecuhFlows, plot(Year, Flow))
data(ConecuhFlows) # The apparent "break" beginning in 1950 is climatological. with(ConecuhFlows, plot(Year, Flow))
U.S. Geological Survey National Stream Quality Accounting Network (NASQAN) station: Cuyahoga River at Independence, Ohio (04208000).
CuyahogaTDS
CuyahogaTDS
Data frame with 80 rows and 4 columns
Name | Type | Description |
MONTH | integer | Month number |
TIME | numeric | Sample collection time, in decimal years |
TDS | integer | Total dissolved solids concentration, milligrams per liter |
Q | integer | Streamflow, in cubic feet per second |
Appendix C9 in Helsel and Hirsch (2002).
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(CuyahogaTDS) # TDS concentration is inversely related to flow with(CuyahogaTDS, plot(Q, TDS, log='xy'))
data(CuyahogaTDS) # TDS concentration is inversely related to flow with(CuyahogaTDS, plot(Q, TDS, log='xy'))
Facilitate the constructing of files documenting R data.frames.
dataRd(obj, file = NULL, obname = deparse(substitute(obj)), format = c("roxygen", "man"))
dataRd(obj, file = NULL, obname = deparse(substitute(obj)), format = c("roxygen", "man"))
obj |
the name of the dataset to document. |
file |
the output filename. |
obname |
a character string giving the name of the object. |
format |
a character string specifying the format of the output; "roxygen" generates a file that is an R script with a roxygen header and should be placed in the R folder and "man" generates a file that is an Rd file and should be placed in the man folder. |
When dataRd
is used within a loop, use of obname
facilitates using character strings for objects.
The object name, from obname
.
The prototype file must be edited to include descriptions of the columns and complete the source and example.
Selected nitrite plus nitrate concentration data from wells in the National Water-Quality Assessment (NAWQA) Program in the eastern Iowa study unit survey.
EasternIowaNO3
EasternIowaNO3
Data frame with 30 rows and 12 columns
Name | Type | Description |
su_code | character | Study unit code |
site_no | character | USGS station number |
C1sample_dt | Date | Cycle 1 sample date |
C1sample_tm | character | Cycle 1 sample time |
C1time_zone | character | Cycle 1 time zone datum |
C1Nitrate.rmk | character | Cycle 1 remark code for nitrite plus nitrate concentration |
C1Nitrate | numeric | Cycle 1 nitrite plus nitrate concentration |
C2sample_dt | Date | Cycle 2 sample date |
C2sample_tm | character | Cycle 2 sample time |
C2time_zone | character | Cycle 2 time zone datum |
C2Nitrate.rmk | character | Cycle 2 remark code for nitrite plus nitrate concentration |
C2Nitrate | numeric | Cycle 2 nitrite plus nitrate concentration |
Note: all
concentrations are in milligrams per liter as nitrogen.
Data retrieved from NWISWeb
(http://nwis.waterdata.usgs.gov/ia/nwis/qw) on 2012-06-18. The data
were rearranged and columns renamed to create matching data from each well.
These data represent the nitrite plus nitrate concentration in a survey in
eastern Iowa. The cycle 1 samples were collected during the summer of 1998,
and the cycle 2 samples were collected during the summer of 2007.
The description of the study unit and the cycle 1 data are described in Sadorf and Linart (2000).
Sadorf, E.M, and Linart, S.M., 2000, Ground-water quality in alluvial aquifers in the eastern Iowa basins, Iowa and Minnesota: U.S. Geological Survey Water-Resources Investigations Report 00-4106, 46 p.
data(EasternIowaNO3) # Show the change from cycle 1 to 2, ignoring censoring par(mar=c(5.1, 12.1, 1.1, 1.1)) plot(EasternIowaNO3$C1Nitrate, seq(nrow(EasternIowaNO3)), ylab="", xlab="Nitrate", yaxt='n', xlim=c(0, 25)) points(EasternIowaNO3$C2Nitrate, seq(nrow(EasternIowaNO3)), col='green') axis(2, at=seq(nrow(EasternIowaNO3)), labels=EasternIowaNO3$site_no, las=1)
data(EasternIowaNO3) # Show the change from cycle 1 to 2, ignoring censoring par(mar=c(5.1, 12.1, 1.1, 1.1)) plot(EasternIowaNO3$C1Nitrate, seq(nrow(EasternIowaNO3)), ylab="", xlab="Nitrate", yaxt='n', xlim=c(0, 25)) points(EasternIowaNO3$C2Nitrate, seq(nrow(EasternIowaNO3)), col='green') axis(2, at=seq(nrow(EasternIowaNO3)), labels=EasternIowaNO3$site_no, las=1)
Daily groundwater data for water year 2008 (October 1, 2007 to September 30, 2008) for selected wells in the Glacial Ridge National Wildlife Refuge in northwestern Minnesota.
GlacialRidge
GlacialRidge
Data frame with 366 rows and 6 columns
Name | Type | Description |
datetime | Date | Day |
G01 | numeric | Data for well G01-R |
G12 | numeric | Data for well G12-R |
G20 | numeric | Data for well G20S-R |
G25 | numeric | Data for well G25-R |
G22 | numeric | Data for well G22S-R |
Note all groundwater levels are depth below land surface in feet.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/gw) on 2012-06-18. The data were rearranged and the columns renamed to the current format.
Cowdery, T. K., Lorenz, D. L, Arntson, D., 2008, Hydrology prior to wetland and prairie restoration in and around the Glacial Ridge National Wildlife Refuge, northwestern Minnesota, 2002–5: U.S Geological Survey Scientific Investigations Report 2007-5200, 68 p.
data(GlacialRidge) # Hydrographs showing the change in variability with depth with(GlacialRidge, plot(datetime, G01, type='l', ylim=c(25, 0), ylab="Depth below landsurface, in feet")) with(GlacialRidge, lines(datetime, G12)) with(GlacialRidge, lines(datetime, G20)) with(GlacialRidge, lines(datetime, G25)) with(GlacialRidge, lines(datetime, G22))
data(GlacialRidge) # Hydrographs showing the change in variability with depth with(GlacialRidge, plot(datetime, G01, type='l', ylim=c(25, 0), ylab="Depth below landsurface, in feet")) with(GlacialRidge, lines(datetime, G12)) with(GlacialRidge, lines(datetime, G20)) with(GlacialRidge, lines(datetime, G25)) with(GlacialRidge, lines(datetime, G22))
Selected ion balance data for County Ditch 65 near Maple Bay, Minnesota (SW2), USGS station number 05079250.
IonBalance
IonBalance
Data frame with 22 rows and 28 columns
Name | Type | Description |
SAMPLE | character | Sample record number |
AGNCY | character | Agency code |
STAID | character | USGS station number |
DATES | Date | Sample date |
TIMES | character | Sample time |
EDATE | Date | Sample end date (all missing) |
ETIME | character | Sample end time (all missing) |
MEDIUM | character | Sample medium code (surface water) |
H_ion | numeric | Hydrogen ion concentration |
Ca | numeric | Calcium ion concentration |
Mg | numeric | Magnesium ion concentration |
Na | numeric | Sodium ion concentration |
K | numeric | Potassium ion concentration |
Fe | numeric | Iron ion concentration (all missing) |
Mn | numeric | Manganese ion concentration (all missing) |
Cation_sum | numeric | Sum of all available cation concentrations |
Alk | numeric | Alkalinity concentration |
Hydrox | numeric | Hydroxide ion concentration (all missing) |
Bicarb | numeric | Bicarbonate ion concentration (all missing) |
Carb | numeric | Carbonate ion concentration (all missing) |
NO2NO3 | numeric | Nitrite plus nitrate ion concentration |
NO2 | numeric | Nitrite ion concentration (all missing) |
NO3 | numeric | Nitrate ion concentration (all missing) |
Cl | numeric | Chloride ion concentration |
SO4 | numeric | Sulfate ion concentration |
F | numeric | Fluoride ion concentration |
Anion_sum | numeric | Sum of all available anion concentrations |
Pct_Diff | numeric | Percent difference beteeen Cation_sum and Anion_sum
|
Note: all
concentrations are in milli-equivalents per liter.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/qw) on 2012-01-19. The data were modified to represt the concentrations in milli-equivalents per liter.
data(IonBalance) # Create a scatter plot matrix of the cations # Shows the correlation amongst those variables. # Note that potassium (K) is negatively correlated to the others! pairs(IonBalance[,c("Ca", "Mg", "Na", "K")])
data(IonBalance) # Create a scatter plot matrix of the cations # Shows the correlation amongst those variables. # Note that potassium (K) is negatively correlated to the others! pairs(IonBalance[,c("Ca", "Mg", "Na", "K")])
Total phosphorus concentrations and streamflow data from the Klamath River near Klamath, California, USGS station number 11530500.
KlamathTP
KlamathTP
Data frame with 80 rows and 6 columns
Name | Type | Description |
site_no | character | USGS station number |
sample_dt | Date | Sample date |
TP_rmk | character | Remark code for total phosphorus concentration |
TP | numeric | Total phosphorus concetration |
Flow | numeric | Streamflow, may be either daily mean or measured |
TP_ss | numeric | Total phosphorus concetration, modified for censoring |
Note: all concentrations are in milligrams per liter as phosphorus, the streamflow rates are in cubic feet per second.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/ca/nwis/qw) on 2012-07-13. The data were retrieved and modified to replicate the results in Smith and others (1982).
Smith, R.A., Hirsch, R.M, and Slack, J.R., 1982, A study of trends in total phosphorus measurements at NASQAN stations: U.S. Geological Circular 2190, 34 p. Also available at http://pubs.usgs.gov/wsp/2190/report.pdf.
data(KlamathTP) par(mfrow=c(2,1), mar=c(5.1, 4.1, .5, .5)) # TP concentration over time (figure 11A in Smith and others (1982)) with(KlamathTP, plot(sample_dt, TP_ss)) abline(.15, -0.0000137) # Coefficients modified for actual graph units # Flow (or discharge) and TP concentration (figure 3A in Smith and others, 1982) with(KlamathTP, plot(Flow, TP_ss)) abline(0.014, 0.0000054) # Note difference in intercept
data(KlamathTP) par(mfrow=c(2,1), mar=c(5.1, 4.1, .5, .5)) # TP concentration over time (figure 11A in Smith and others (1982)) with(KlamathTP, plot(sample_dt, TP_ss)) abline(.15, -0.0000137) # Coefficients modified for actual graph units # Flow (or discharge) and TP concentration (figure 3A in Smith and others, 1982) with(KlamathTP, plot(Flow, TP_ss)) abline(0.014, 0.0000054) # Note difference in intercept
Mayfly nymph counts in small streams above and below an industrial outfall.
MayflyNymph
MayflyNymph
Data frame with 12 rows and 2 columns
Name | Type | Description |
Above | integer | Mayfly nymph counts above the industrial outfall |
Below | integer | Mayfly nymph counts below the industrial outfall |
Table 6.1 in Helsel and Hirsch (2002).
Helsel, D.R. and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(MayflyNymph) # Compare the counts above and below the industrial outfalls boxplot(MayflyNymph, range=0, log='y')
data(MayflyNymph) # Compare the counts above and below the industrial outfalls boxplot(MayflyNymph, range=0, log='y')
Daily soil- and reference-temperature data for observation well MC11 for March 1, 1993 through November 30, 1993.
MC11_1993
MC11_1993
Data frame with 271 rows and 10 columns
Name | Type | Description |
YEAR | numeric | Year of measurement |
JULIAN | numeric | The Julian day of the year of the measurement |
MONTH | numeric | Month of measurement |
DAY | numeric | Day of month of measurement |
TEMP.REF | numeric | Reference temperature in shelter |
TEMP.0.5 | numeric | Soil temperature at 0.5 meter depth |
TEMP.1.0 | numeric | Soil temperature at 1.0 meter depth |
TEMP.1.5 | numeric | Soil temperature at 1.5 meter depth |
TEMP.2.0 | numeric | Soil temperature at 2.0 meter depth |
TEMP.2.5 | numeric | Soil temperature at 2.5 meter depth |
Note: all temperatures are in degrees Celsius.
Extracted from the MC11 file on the CDROM in Landon and others (1997).
Landon, M.K., Delin, G.N., Nelson, K.J., Regan, C.P., Lamb, J.A., Larson, S.J., Capel, P.D., Anderson, J.L., and Dowdy, R.H., 1997, Water-quality and hydrogeologic data used to evaluate the effects of farming systems on ground-water quality at the Management Systems Evaluation Area near Princeton, Minnesota, 1991-95: U.S. Geological Survey Open-File Report 97-22, 31 p.
## Not run: data(MC11_1993) # plot the reference temperature with(MC11_1993, plot(JULIAN, TEMP.REF, type='l')) ## End(Not run)
## Not run: data(MC11_1993) # plot the reference temperature with(MC11_1993, plot(JULIAN, TEMP.REF, type='l')) ## End(Not run)
Concentrations of selected major ions in the Menominee River near McAllister, Wisconsin, USGS station number 04067500.
MenomineeMajorIons
MenomineeMajorIons
Data frame with 37 rows and 25 columns
Name | Type | Description |
agency.cd | character | Agency code |
site.no | character | USGS station number |
sample.dt | Date | Sample date |
medium.cd | character | Sample medium code |
CO3.rmk | character | Remark code for carbonate concentration |
CO3 | numeric | Carbonate concentration |
HCO3.rmk | character | Remark code for bicarbonate concentration |
HCO3 | numeric | Bicarbonate concentration |
Nitrate.rmk | character | Remark code for nitrate concentration |
Nitrate | numeric | Nitrate concentration |
Calcium.rmk | character | Remark code for calcium concentration |
Calcium | numeric | Calcium concentration |
Magnesium.rmk | character | Remark code for magnesium concentration |
Magnesium | numeric | Magnesium concentration |
Sodium.rmk | character | Remark code for sodium concentration |
Sodium | numeric | Sodium concentration |
Potassium.rmk | character | Remark code for potassium concentration |
Potassium | numeric | Potassium concentration |
Chloride.rmk | character | Remark code for chloride concentration |
Chloride | numeric | Chloride concentration |
Sulfate.rmk | character | Remark code for sulfate concentration |
Sulfate | numeric | Sulfate concentration |
Fluoride.rmk | character | Remark code for fluoride concentration |
Fluoride | numeric | Fluoride concentration |
season | factor | Season |
Note: all
concentrations are in milligrams per liter as the constituent, except
nitrate concentrations are in milligrams per liter as nitrogen.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/wi/nwis/qw) on 2002-12-30. Data were rearranged and columns renamed to the current format.
data(MenomineeMajorIons) # Create a scatter plot matrix of the major anions # Nitrate and Fluoride have censored values and single high values that do not plot well pairs(MenomineeMajorIons[,c("HCO3", "Chloride", "Sulfate")])
data(MenomineeMajorIons) # Create a scatter plot matrix of the major anions # Nitrate and Fluoride have censored values and single high values that do not plot well pairs(MenomineeMajorIons[,c("HCO3", "Chloride", "Sulfate")])
Iron concentrations at low flow for small eastern Ohio streams.
MiningIron
MiningIron
Data frame with 241 rows and 4 columns
Name | Type | Description |
Iron | numeric | Iron concentration |
Rock | factor | Rock type of the stream |
MineType | factor | Description of mining activity |
C5 | factor | Used to construct table C5 |
Appendix C6 in Helsel and Hirsch (2002).
Note: A different set of
randomly selected values are used to reconstruct table C5 than what was
selected in Helsel and Hirsch (2002).
Helsel, D.R., 1983, Mine drainage and rock type influences on
eastern Ohio stream water quality: Water Resources Bulletin, v. 19, no. 6,
p. 881–887.
Helsel, D.R., and Hirsch, R.M., 2002, Statistical
methods in water resources: U.S. Geological Survey Techniques of
Water-Resources Investigations, book 4, chap. A3, 522 p.
data(MiningIron) # Create simple boxplots by rock type and by mining activity par(mfrow=c(2,1), las=1) with(MiningIron, boxplot(split(Iron, Rock), range=0, log='y')) with(MiningIron, boxplot(split(Iron, MineType), range=0, log='y'))
data(MiningIron) # Create simple boxplots by rock type and by mining activity par(mfrow=c(2,1), las=1) with(MiningIron, boxplot(split(Iron, Rock), range=0, log='y')) with(MiningIron, boxplot(split(Iron, MineType), range=0, log='y'))
Groundwater-quality data from miscellaneous wells in the United States.
MiscGW
MiscGW
Data frame with 4 rows and 10 columns
Name | Type | Description |
Calcium | numeric | Calcium concentration |
Magnesium | numeric | Magnesium concentration |
Sodium | numeric | Sodium concentration |
Potassium | numeric | Potassium concentration |
Carbonate | numeric | Carbonate concentration |
Bicarbonate | numeric | Bicarbonate concentration |
Sulfate | numeric | Sulfate concentration |
Chloride | numeric | Chloride concentration |
Fluoride | numeric | Fluoride concentration |
Nitrate | numeric | Nitrate concentration |
Notes: all concentrations are in milligrams per liter; for combined sodium and potassium in the original tables, the concentration was assumed to be sodium in these data and potassium is set to 0.
Extracted from Hem (1989), to replicate the data in figure 37.
Hem J.D., 1989, Study and interpretation of the chemical characteristics of natural water: U.S. Geological Survey Water-Supply Paper 2254, 263 p.
## Not run: data(MiscGW) # The sodium-potassium data MiscGW[, c("Sodium", "Potassium")] ## End(Not run)
## Not run: data(MiscGW) # The sodium-potassium data MiscGW[, c("Sodium", "Potassium")] ## End(Not run)
Ammonia plus organic nitrogen concentrations in precipitation.
PrecipNitrogen
PrecipNitrogen
Data frame with 20 rows and 2 columns
Name | Type | Description |
NH3plusOrganic | numeric | Concentration of ammonia plus organic nitrogen |
Site | factor | Land-use type |
Note: all concentrations are in milligrams per liter as nitrogen.
Example data in chapter 5 in Helsel and Hirsch (2002)
Helsel, D.R. and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(PrecipNitrogen) # Compare concentrations between residential and industrial land uses with(PrecipNitrogen, boxplot(split(NH3plusOrganic, Site), range=0, log='y'))
data(PrecipNitrogen) # Compare concentrations between residential and industrial land uses with(PrecipNitrogen, boxplot(split(NH3plusOrganic, Site), range=0, log='y'))
Selected groundwater nitrate and ancillary data collected near Puget Sound, Washington.
PugetNitrate
PugetNitrate
Data frame with 1967 rows and 8 columns
Name | Type | Description |
wellid | integer | Well identification number |
l10 | numeric | Percent urban land use wthin 3.2 km of well |
l20 | numeric | Percent agricultural land use wthin 3.2 kilometers of well |
l40 | numeric | Percent Forested land wthin 3.2 kilometers of well |
surfgeo | character | Description of surficial geology at well |
date | Date | Sample date |
nitrate | numeric | Nitrate concentration, in milligrams per liter |
wellmet | numeric | Well depth, in meters |
For surfgeo
, "Fine" and "Coarse"
refer to fine-grained and coarse-grained glacial deposits.
Some censored
nitrate concentrations are recorded as 0 in nitrate
but the original
data had multiple reporting levels of 0.05, 0.1, an 0.2. Therefore, all
values less than or equal to 0.2 should be considered potentially censored.
Data provided by Jim Tesoriero.
Tesoriero, A.J., and Voss, F.D., 1997, Predicting the probability of elevated nitrate concentrations in the Puget Sound Basin—Implications for aquifer susceptibility and vulnerability: Groundwater, v. 35, no. 6, p. 1029–1039.
## Not run: data(PugetNitrate) # Suppress plotting the high value of 732 in Coarse with(PugetNitrate, boxplot(split(pmin(nitrate, 50), surfgeo))) ## End(Not run)
## Not run: data(PugetNitrate) # Suppress plotting the high value of 732 in Coarse with(PugetNitrate, boxplot(split(pmin(nitrate, 50), surfgeo))) ## End(Not run)
Daily mean flow for Judicial Ditch 64 near Mentor, Minnesota (SW4), USGS station number 05078470 for calendar year 2003.
Q05078470
Q05078470
Data frame with 365 rows and 2 columns
Name | Type | Description |
DATES | Date | Day |
FLOW | numeric | Daily mean streamflow |
Note: the daily mean streamflow rates are in cubic feet per second.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/sw) on 2005-01-05.
data(Q05078470) # Plot the data for station 05078470 with(Q05078470, plot(DATES, FLOW, type='l'))
data(Q05078470) # Plot the data for station 05078470 with(Q05078470, plot(DATES, FLOW, type='l'))
Daily mean flow for Judicial Ditch 66 near Marcoux Corners, Minnesota (SW6), USGS station number 05078770 for calendar year 2003.
Q05078770
Q05078770
Data frame with 365 rows and 2 columns
Name | Type | Description |
DATES | Date | Day |
FLOW | numeric | Daily mean streamflow |
Note: the daily mean streamflow rates are in cubic feet per second.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/sw) on 2005-01-05.
data(Q05078770) # Plot the data for station 05078770 with(Q05078770, plot(DATES, FLOW, type='l', log='y'))
data(Q05078770) # Plot the data for station 05078770 with(Q05078770, plot(DATES, FLOW, type='l', log='y'))
Daily mean flow for selected USGS streamgages for calendar year 2003.
Qall
Qall
Data frame with 365 rows and 3 columns
Name | Type | Description |
DATES | Date | Day |
Flow | numeric | Daily mean streamflow |
STAID | character | USGS station number |
Note: the daily mean streamflow rates are in cubic feet per second.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/sw) on 2005-01-05.
data(Qall) # Plot the data for station 05078770 with(subset(Qall, STAID == "05078770"), plot(DATES, Flow, type='l')) # Overlay the data for station 05078470, some zero flows in this station with(subset(Qall, STAID == "05078470"), lines(DATES, Flow, col='red'))
data(Qall) # Plot the data for station 05078770 with(subset(Qall, STAID == "05078770"), plot(DATES, Flow, type='l')) # Overlay the data for station 05078470, some zero flows in this station with(subset(Qall, STAID == "05078470"), lines(DATES, Flow, col='red'))
Selected water-quality data for Judicial Ditch 64 near Mentor, Minnesota (SW4), USGS station number 05078470 for calendar year 2003.
QW05078470
QW05078470
Data frame with 11 rows and 4 columns
Name | Type | Description |
DATES | Date | Sample date |
TIMES | character | Sample time |
R00665 | character | Remark code for total phosphorus concentration |
P00665 | numeric | Total phosphorus concentration |
Note: all concentrations are in milligrams per liter as
phosphorus.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/qw) on 2005-01-05.
data(QW05078470) # Plot the data with(QW05078470, plot(DATES, P00665, log='y'))
data(QW05078470) # Plot the data with(QW05078470, plot(DATES, P00665, log='y'))
Whole-water or total phosphorus data for selected USGS streamgages for calendar year 2003.
QWall
QWall
Data frame with 23 rows and 4 columns
Name | Type | Description |
STAID | character | USGS station number |
DATES | Date | Sample date |
TIMES | character | Sample time |
R00665 | character | Remark code for total phosphorus concentration |
P00665 | numeric | Total phosphorus concentration |
Note: all
concentrations are in milligrams per liter as phosphorus.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/qw) on 2005-01-05.
data(QWall) # Plot the data for station 05078770 with(subset(QWall, STAID == "05078770"), plot(DATES, P00665, log='y', ylim=c(0.001, 0.2))) # Overlay the data for station 05078470, smaller values and missing the sample in late April. with(subset(QWall, STAID == "05078470"), points(DATES, P00665, col='red'))
data(QWall) # Plot the data for station 05078770 with(subset(QWall, STAID == "05078770"), plot(DATES, P00665, log='y', ylim=c(0.001, 0.2))) # Overlay the data for station 05078470, smaller values and missing the sample in late April. with(subset(QWall, STAID == "05078470"), points(DATES, P00665, col='red'))
Selected water-quality data for Judicial Ditch 64 near Mentor, Minnesota (SW4), USGS station number 05078470 for calendar year 2003.
QWstacked
QWstacked
Data frame with 46 rows and 22 columns
Name | Type | Description |
agency_cd | character | Agency code |
site_no | character | USGS station number |
sample_dt | Date | Sample date |
sample_tm | character | Sample time |
sample_end_dt | Date | Sample end date |
sample_end_tm | character | Sample end time |
sample_start_time_datum_cd | character | Time zone datum |
tm_datum_rlbty_cd | character | Time datum reliability code |
coll_ent_cd | character | Collecting agency code |
medium_cd | character | Sample medium code |
tu_id | character | Taxonomic unit code |
body_part_id | character | Body part code |
parm_cd | character | Analyte parameter code |
remark_cd | character | Remark code for the result |
result_va | character | Numeric value of the result |
val_qual_tx | character | Result value qualifier code |
meth_cd | character | Lab method code |
dqi_cd | character | Data-quality indicator code |
rpt_lev_va | character | Reporting level |
rpt_lev_cd | character | Reporting level type |
lab_std_va | character | Lab standard deviation |
anl_ent_cd | character | Analyzing agency code |
Note: all concentrations in the
column result_va are in milligrams per liter as phosphorus.
Data retrieved from NWISWeb (http://nwis.waterdata.usgs.gov/mn/nwis/qwdata) on 2012-04-06. The format of water-quality data is often retrieved by result–each row represents the data for a single analyte. Note that many columns are part of the generic data retrieval and do not pertain specifically to these data.
data(QWstacked) # Plot the data for whole-water phosphorus with(subset(QWstacked, parm_cd == "00665"), plot(sample_dt, result_va)) # Overlay the data for dissolved phosphorus (should be smaller values) with(subset(QWstacked, parm_cd == "00666"), points(sample_dt, result_va, col='red'))
data(QWstacked) # Plot the data for whole-water phosphorus with(subset(QWstacked, parm_cd == "00665"), plot(sample_dt, result_va)) # Overlay the data for dissolved phosphorus (should be smaller values) with(subset(QWstacked, parm_cd == "00666"), points(sample_dt, result_va, col='red'))
Annual peak discharges, 1925–1989, for the Saddle River at Lodi, New Jersey, USGS station number 01391500.
SaddlePeaks
SaddlePeaks
Data frame with 65 rows and 2 columns
Name | Type | Description |
Year | integer | Water year |
Flow | integer | Annual peak flow |
Note: the annual peak flow rates are in cubic feet per second.
Appendix C1 in Helsel and Hirsch (2002). Channelization occurred in 1968.
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(SaddlePeaks) # The data are approximately log-normally distributed qqnorm(SaddlePeaks$Flow, log='y') with(SaddlePeaks, abline(mean(log10(Flow)), sd(log10(Flow))))
data(SaddlePeaks) # The data are approximately log-normally distributed qqnorm(SaddlePeaks$Flow, log='y') with(SaddlePeaks, abline(mean(log10(Flow)), sd(log10(Flow))))
Total nitrogen loads and basin characteristics for urban runoff.
TNLoads
TNLoads
Data frame with 42 rows and 9 columns
Name | Type | Description |
LOGTN | numeric | Log10 of (total nitrogen load, in pounds) |
LOGDA | numeric | Log10 of (drainage area, in square miles) |
LOGIMP | numeric | Log10 of (percentage impervious surface area [unitless]) |
MJTEMP | numeric | Mean minimum January temperature, in degrees Fahrenheit |
MSRAIN | numeric | Mean seasonal rainfall, in inches |
PRES | integer | Percentage of basin that is residential [unitless] |
PNON | integer | Percentage of basin that is non-urban [unitless] |
PCOMM | integer | Percentage of basin that is commercial [unitless] |
PIND | integer | Percentage of basin that is industrial [unitless] |
Appendix C15 in Helsel and Hirsch (2002), from Mustard and others (1987).
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in
water resources: U.S. Geological Survey Techniques of Water-Resources
Investigations, book 4, chap. A3, 522 p.
Mustard, D.C., Driver, N.E.,
Chyr, J., and Hansen, B.G., 1987, U.S. Geological Survey urban stormwater
data base of constituent storm loads: U.S. Geological Survey Water-Resources
Investigations Report 87-4036, 328 p.
data(TNLoads) # Create a scatter plot matrix of the first 5 columns # Shows the correlation amongst those variables. # The remaining variables are even more discretized than MSRAIN pairs(TNLoads[,1:5])
data(TNLoads) # Create a scatter plot matrix of the first 5 columns # Shows the correlation amongst those variables. # The remaining variables are even more discretized than MSRAIN pairs(TNLoads[,1:5])
Uranium and total dissolved solids in groundwaters of differing bicarbonate concentrations.
UraniumTDS
UraniumTDS
Data frame with 44 rows and 3 columns
Name | Type | Description |
TDS | numeric | Total dissolved solids concentration, in milligrams per liter |
Uranium | numeric | Uranium concentration, in parts per billion |
HCO3 | factor | Bicarbonate concentration relative to total anions |
Appendix C16 in Helsel and Hirsch (2002).
Helsel, D.R., and Hirsch, R.M., 2002, Statistical methods in water resources: U.S. Geological Survey Techniques of Water-Resources Investigations, book 4, chap. A3, 522 p.
data(UraniumTDS) # The concentration of bicarbonate affects the relation between Uranium and TDS # Bicarbonate concentration > 50% is plotted in red with(UraniumTDS, plot(Uranium, TDS, col=HCO3, log='y'))
data(UraniumTDS) # The concentration of bicarbonate affects the relation between Uranium and TDS # Bicarbonate concentration > 50% is plotted in red with(UraniumTDS, plot(Uranium, TDS, col=HCO3, log='y'))