Blog

Data analytics, statistics, and more

Creating Static Maps Using R

Use the functionality of R and R packages to create both simple maps and complex maps containing many different layers.

September 18, 2019

Testing Group Differences with Data Containing Non-detects

Often data from more than two groups needs to be evaluated, usually on the basis of a representative value from each group. This post examines the use of survival analysis techniques to test whether surface water samples containing a high frequency of censored (non-detect) values differ in dissolved lead concentration between various watersheds.

September 13, 2019

Outlier Detection Using Machine Learning

There is no precise way to define and identify outliers in general because of the specifics of each dataset. This post evaluates three methods for multivariate outlier detection, including Mahalanobis distance (a multivariate extension to standard univariate tests) and two machine learning (clustering) techniques.

September 9, 2019

Introduction to Statistical Intervals

The issue of uncertainty in estimating population parameters from data samples is often addressed using statistical intervals. The three types of statistical interval differ in their definitions as well as their typical applications. It is important to fully understand the assumptions and limitations underlying the use, interpretation, and calculation of statistical intervals before applying them.

August 6, 2019

Power of the Mann-Kendall Test

An important objective of many environmental monitoring programs is to detect changes or trends in constituent concentrations over time. The Mann-Kendell test is one of the most popular nonparametric tests for determining temporal trend. This post evaluates the power of the Mann-Kendall test to identify a trend for various sample sizes and variability in the data using Monte Carlo simulation.

July 21, 2019