Galiano Island BC Canada Marine Zoology 1893–2021

Latest version published by Biodiversity Data Journal on Jun 23, 2021 Biodiversity Data Journal

Catalogue synthesizing various sources of marine animal occurrence data, documenting species reported for waters around Galiano Island, British Columbia, Canada. Data aggregated from the following sources: 1. British Columbia Cetacean Sightings Network; 2. Canadian Museum of Nature; 3. Chu & Leys (2010, 2012); 4. Erickson (2000); 5. iNaturalist; 6. Pacific Marine Life Surveys; and, 7. Royal British Columbia Museum, dataset extracted from

Data Records

The data in this occurrence resource has been published as a Darwin Core Archive (DwC-A), which is a standardized format for sharing biodiversity data as a set of one or more data tables. The core data table contains 19,651 records.

This IPT archives the data and thus serves as the data repository. The data and resource metadata are available for download in the downloads section. The versions table lists other versions of the resource that have been made publicly available and allows tracking changes made to the resource over time.


Download the latest version of this resource data as a Darwin Core Archive (DwC-A) or the resource metadata as EML or RTF:

Data as a DwC-A file download 19,651 records in English (529 KB) - Update frequency: not planned
Metadata as an EML file download in English (18 KB)
Metadata as an RTF file download in English (15 KB)


The table below shows only published versions of the resource that are publicly accessible.

How to cite

Researchers should cite this work as follows:

Simon A, Basman A (2021): Galiano Island BC Canada Marine Zoology 1893–2021. v1.0. Biodiversity Data Journal. Dataset/Occurrence.


Researchers should respect the following rights statement:

The publisher and rights holder of this work is Biodiversity Data Journal. This work is licensed under a Creative Commons Attribution (CC-BY) 4.0 License.

GBIF Registration

This resource has been registered with GBIF, and assigned the following GBIF UUID: 56687a3d-b48f-44e5-b2b8-d55bafff2f33.  Biodiversity Data Journal publishes this resource, and is itself registered in GBIF as a data publisher endorsed by Participant Node Managers Committee.


Occurrence; Observation


Who created the resource:

Andrew Simon
Institute for Multidisciplinary Ecological Research in the Salish Sea 281 Highland Road V0N 1P0 Galiano Island BC CA 12505395089
Antranig Basman
Research Affilliate
Institute for Multidisciplinary Ecological Research in the Salish Sea

Who can answer questions about the resource:

Andrew Simon
Institute for Multidisciplinary Ecological Research in the Salish Sea 281 Highland Road V0N 1P0 Galiano Island BC CA 12505395089

Who filled in the metadata:

Andrew Simon
Institute for Multidisciplinary Ecological Research in the Salish Sea 281 Highland Road V0N 1P0 Galiano Island BC CA 12505395089

Who else was associated with the resource:

Point Of Contact
Andrew Simon
Institute for Multidisciplinary Ecological Research in the Salish Sea 281 Highland Road V0N 1P0 Galiano Island British Columbia CA 12505395089

Geographic Coverage

Location: Galiano Island, British Columbia, Canada. For a GeoJSON file precisely designating the study area please see:

Bounding Coordinates South West [48.845, -123.618], North East [49.027, -123.291]

Taxonomic Coverage

Taxonomic groups covered include: Porifera, Cnidaria, Ctenophora, Nemertea, Platyhelminthes, Chaetognatha, Mollusca, Annelida, Sipuncula, Arthropoda, Entoprocta, Brachiopoda, Bryozoa, Phoronida, Echinodermata, Chordata

Phylum  Porifera,  Cnidaria,  Ctenophora,  Nemertea,  Platyhelminthes,  Chaetognatha,  Mollusca,  Annelida,  Sipuncula,  Arthropoda,  Entoprocta,  Brachiopoda,  Bryozoa,  Phoronida,  Echinodermata,  Chordata

Temporal Coverage

Living Time Period 1893–2021

Project Data

Over 19,000 marine animal occurrence records from waters around Galiano Island, BC, Canada, dating from 1893 to 2021. Occurrences include dive records collected through the Pacific Marine Life Surveys, museum voucher specimens, ecological data, and crowd-sources observations from the BC Cetacean Sightings Network and iNaturalist platform.

Title The Flora and Fauna of Galiano Island, British Columbia, Canada: Part I. Marine Zoology
Identifier FFGIBCCAN
Study Area Description Galiano Island (Hul'qumi'num: Swiikw) is located on the northwest coast of North America, in a bioregion known as the Salish Sea. The island lies within the traditional territories of Penelakut, Hwlitsum, and Tsawwassen First Nations, and other Hul’qumi’num-speaking peoples. Galiano Island is part of an archipelago that spans the Canada–USA border, falling in the rain shadow of mountains on Vancouver Island, British Columbia, Canada, and the Olympic Peninsula of Washington, USA.
Design Description We synthesized data from multiple sources, which were subject to critical review by contributing experts. Review of the dataset was facilitated through a series of data pipelines developed in JavaScript. Each dataset represents various forms of search effort. We quantified the proportional contributions of each source and evaluated their biases and limitations.

The personnel involved in the project:

Point Of Contact
Andrew Simon

Sampling Methods

Sampling procedures vary according to each data source, including systematic dive records (Pacific Marine Life Surveys), ecological data collected by ROV (Chu and Leys 2010, Chu and Leys 2012), museum voucher specimens (Canadian Museum of Nature, Royal British Columbia Museum), crowd-sourced citizen science observations (British Columbia Cetacean Sightings Network and iNaturalist observations on the Biodiversity Galiano Island Project), and additional reports from the literature (Agassiz 1862, McMurrich 1921, Erickson 2000). Museum specimens may represent collections made through heterogenous methods. All raw catalogues, processing scripts, and processed catalogues contributing to this paper have been tagged in GitHub at

Study Extent Study extent was roughly delimited by bathymetric and geographic boundaries, including: Porlier Pass (north of Galiano Island), Active Pass (south of Galiano Island), the Trincomali Channel (west of Galiano Island), and the Outer Island fault line, which lies in the Strait of Georgia (east of Galiano Island). A KML defining the study area is available: XXXXX.
Quality Control Curation of this dataset was facilitated through a rigorous review of taxonomic summaries and catalogues of occurrence data based around each phylum and data source. The algorithms we designed summarized all taxa represented in source catalogues by phylum. These summaries were then made available in Google Sheets for expert review. Based on the critical remarks added by experts to these taxonomic summaries, catalogues were reviewed and revised as necessary. This iterative process was continued until there was a one-to-one correspondence between taxonomic summaries and catalogues of occurrence records. Our algorithms conserved memory of all modifications, including typographic errors, and taxonomic and nomenclatural changes. iNaturalist observations were thoroughly reviewed and identifications added on the iNaturalist platform. Other sources were modified in collaboration with contributing authorities. Where disagreements arose between our critical review process and occurrence records published by sources such as the Royal BC Museum and Canadian Museum of Nature, we have added critical annotations identifying the discrepancy between species names reported in this dataset vs those reported by institutions. We also reported these discrepancies directly to museum curators. Georeferencing was also reviewed and corrected where appropriate based on the best available metadata. These change are recorded in our finalized catalogue of occurrence records.

Method step description:

  1. The data catalogues contributing to this dataset have been normalised, aligned, corrected, and rendered into visualisations by a collection of open source data processing scripts written in JavaScript. These scripts operate in the following stages: 1. The columns of each source catalogue, imported as CSV, are mapped onto a common core of fields drawn from a subset of the Darwin Core standard, as well as other project-specific fields 2. The taxon name is mapped onto a core backbone by means of a taxon resolution file which resolves preferred taxon names and accounts for typographical errors 3. A dataset id is assigned to every source catalogue, and they are then combined into a single master catalogue  4. This catalogue is filtered to include only the taxa of interest—marine fauna 5. Private or obscured coordinates held in project-specific fields are copied into the principal georeferencing fields 6. A patch of georeferencing corrections is then applied to the resulting coordinates, together with curational notes motivating the corrections 7. The resulting observations are then filtered by the polygon representing the project area 8. The resulting output produces two consolidated CSV files, a catalogue of all observations, and a master summary file 9. The master summary file is then divided into phyla according to the checklist divisions in this paper, for curation by subject matter experts—these are exported into a Google Sheets representation where they may edit them live 10. The subject matter experts add and check authorities for the taxa, add curation notes and resolve taxonomic discrepancies 11. After curation, the Google Sheets are then re-ingested, combined, and converted back into CSV, and compared with the original summary produced at Step 8 12. Any discrepancies between these summaries are fed into the taxon resolution file at Step 2, and, where appropriate, circulated amongst the managers of the source catalogues to incorporate corrections they find desirable 13. The process is then rerun from Step 1 until repeated passes of curation and reconciliation give rise to no further discrepancies at Step 12 The files output at Step 8 of the pipeline form the basis of the map-based data visualisations referenced from this paper, as well our our Darwin Core data submission to the Global Biodiversity Information Facility.

Collection Data

Collection Name Canadian Museum of Nature
Collection Identifier
Parent Collection Identifier CMN
Collection Name Royal BC Museum
Collection Identifier
Parent Collection Identifier RBCM

Additional Metadata

Alternative Identifiers