Computing SE for PSD indices

Posted on December 9, 2016 by fishR Blog in R bloggers | 0 Comments

[This article was first published on fishR Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

A user of my Introductory Fisheries Analyses with R book recently asked me how to compute standard errors (SE) for the various PSD indices by using the usual equation for the SE of a proportion (square root of p times (1-p) divided by n). Below is my response using the Inch Lake data from the book.

Load required packages.

library(FSA)
library(magrittr)
library(dplyr)

Load (and prepare) the Inch Lake data used in the Size Structure chapter of the book (this and next step is from this script).

inchAlls <- read.csv("http://derekogle.com/IFAR/scripts/InchLake1113.csv") %>%
  mutate(gcat=psdAdd(tl,species)) %>%
  filterD(gcat!="substock")

## No known Gabelhouse (PSD) lengths for Iowa Darter

Compute the PSD-X values.

( freq <- xtabs(~species+gcat,data=inchAlls) )

##                  gcat
## species           stock quality preferred memorable
##   Black Crappie      33      40         8        81
##   Bluegill          571     199       223        43
##   Largemouth Bass    26      36         0         0
##   Pumpkinseed         5       2         0         0
##   Yellow Perch        4       0         0         1

iPSDs <- prop.table(freq,margin=1)*100
( PSDs <- t(apply(iPSDs,MARGIN=1,FUN=rcumsum)) )

##                  gcat
## species           stock  quality preferred memorable
##   Black Crappie     100 79.62963  54.93827 50.000000
##   Bluegill          100 44.88417  25.67568  4.150579
##   Largemouth Bass   100 58.06452   0.00000  0.000000
##   Pumpkinseed       100 28.57143   0.00000  0.000000
##   Yellow Perch      100 20.00000  20.00000 20.000000

Compute the row sums for the freq table to get the total number of fish (that are stock size or greater). These will be n in the SE calculations.

( sums <- rowSums(freq) )

##   Black Crappie        Bluegill Largemouth Bass     Pumpkinseed 
##             162            1036              62               7 
##    Yellow Perch 
##               5

Compute the SE for PSD-Q.

SEs <- sqrt(PSDs[,"quality"]*(100-iPSDs[,"quality"])/sums)
round(SEs,1)

##   Black Crappie        Bluegill Largemouth Bass     Pumpkinseed 
##             6.1             1.9             6.3            17.1 
##    Yellow Perch 
##            20.0

Repeat for other size indices. For example, for PSD-P.

SEs <- sqrt(PSDs[,"preferred"]*(100-iPSDs[,"preferred"])/sums)
round(SEs,1)

##   Black Crappie        Bluegill Largemouth Bass     Pumpkinseed 
##             5.7             1.4             0.0             0.0 
##    Yellow Perch 
##            20.0

Note that the SE formula is usually used for proportions (and then multiplied by 100 to get a SE for percentages). However, it can also be shown algebraically to be useful for percentages.

I would caution against using these standard errors to produce confidence intervals for the PSD-X values. If confidence intervals are the end-product for this analysis, then I suggest using the methods described in my book.

To leave a comment for the author, please follow the link and comment on their blog: fishR Blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Computing SE for PSD indices

Related

Related

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)