Files for use with the R script accompanying the paper Cooper et al. (2018). Note that this script also uses files from https://doi.org/10.14466/CefasDataHub.34 (details provided in script). Cooper, K.M., Bolam, S.G., Downie, A. Callaway, A., Barry, J. (2018). Biological-based habitat classification approaches promote cost-efficient monitoring: an example using seabed assemblages. Journal of Applied Ecology. Files include: R SCRIPT FINAL.R (R script)
C5922DATASETFAM13022017REDACTED.csv (see below for description) UKSeaMap2016_SedimentsDissClip.shp (UK Seamap data clipped to study area. These data are available from http://jncc.defra.gov.uk/ukseamap under an Open Government Licence)) StudyArea.shp (polygon for study area) FaunalCluster.tif (faunal cluster habitat map in raster format) PhysicalCluster.tif (physical cluster habitat map in raster format) FaunalClusterClip.tif (faunal cluster habitat map, clipped to study area, in raster format) PhysicalClusterClip.tif (physical cluster habitat map, clipped to study area, in raster format) Description of C5922DATASETFAM13022017REDACTED.csv This file is based on the RSMP dataset (see https://www.cefas.co.uk/cefas-data-hub/dois/rsmp-baseline-dataset/), but with macrofaunal data output at the level of family or above. A variety of gear types have been used for sample collection including grabs (0.1m2 Hamon, 0.2m2 Hamon, 0.1m2 Day, 0.1m2 Van Veen and 0.1m2 Smith McIntrye) and cores. Of these various devices, 93% of samples were acquired using either a 0.1m2 Hamon grab or a 0.1m2 Day grab. Sieve sizes used in sample processing include 1mm and 0.5mm, reflecting the conventional preference for 1mm offshore and 0.5mm inshore (see Figure 2). Of the samples collected using either a 0.1m2 Hamon grab or a 0.1m2 Day grab, 88% were processed using a 1mm sieve. Taxon names were standardised according to the WoRMS (World Register of Marine Species) list using the Taxon Match Tool (http://www.marinespecies.org/aphia.php?p=match). Of the initial 13,449 taxon names, only 774 remained after correction and aggregation to family level. The final dataset comprises of a single sheet comma-separated values (.csv) file. Colonials accounted for less than 20% of the total number of taxa and, where present, were given a value of 1 in the dataset. This component of the fauna was missing from 325 out of the 777 surveys, reflecting either a true absence, or simply that colonial taxa were ignored by the analyst. Sediment particle size data were provided as percentage weight by sieve mesh size, with the dataset including 99 different sieve sizes. Sediment samples have been processed using sieve, and a combination of sieve and laser diffraction techniques. Key metadata fields include: Sample coordinates (Latitude & Longitude), Survey Name, Gear, Date, Grab Sample Volume (litres) and Water Depth (m). A number of additional explanatory variables are also provided (salinity, temperature, chlorophyll a, Suspended particulate matter, Water depth, Wave Orbital Velocity, Average Current, Bed Stress). In total, the dataset dimensions are 33,198 rows (samples) x 900 columns (variables/factors), yielding a matrix of 29,878,200 individual data values.