Blog

Data analytics, statistics, and more

Trend Detection Using Survival Analysis

Non-detects in environmental data can complicate analysis if not handled properly, leading to incorrect conclusions. The mathematical structure of survival analysis is general enough that it can be used in diverse fields examining various types of data not typically associated with survival/death events or failure analysis. In this post, survival analysis methods will be applied to fit a censored linear regression model to weekly ammonium deposition data to assess temporal trends.

March 7, 2024

Arctic Sea Ice Time Series Analysis

The modeltime R library offers a wide range of features for model evaluation, selection, and forecasting using the tidymodels ecosystem. Time-series analysis of sea ice in the Arctic polar regions performed using the modeltime library suggests that the Arctic sea will be nearly ice-free in the very near future.

February 28, 2024

Temporal Behavior of Arctic Sea Ice

Passive-microwave instrumentation on satellites has allowed for the monitoring of Arctic sea ice coverage since the late 1970s, showing a long-term downward trend due to both natural variability and climate change. The rate of decline in Arctic sea ice has varied over the past 43 years, with some periods showing faster rates of loss than others. This blog post explores the temporal changes in Arctic sea ice extent using the R language for statistical computing and visualization.

February 22, 2024

Comparison of Random and Geographically Stratified Sampling

The post compares differences between simple random sampling and geographically stratified sampling. Stratified random sampling improves the spatial distribution of sample point locations by stratifying the field geographically and sampling randomly within each stratum. Different sampling patterns are compared using Monte Carlo simulations on a simulated population.

December 20, 2023

Analyses of Pesticide Concentrations in California Surface Waters

Environmental data are frequently left-censored, indicating that some values are less than the limit of quantification for the analytical methods employed. These data are problematic because censored (non-detect) values are known only to range between zero and the censoring limit. This complicates analysis of the data, including estimating statistical parameters, characterizing data distributions, and conducting inferential statistics. This post demonstrates various procedures and methods that are available in R for analyzing data containing a mixture of detects and non-detects. These methods make few or no assumptions about the data, or substitute arbitrary values (e.g., one-half the detection or reporting limit) for the non-detects.

September 16, 2023