A workflow for standardising and integrating alien species distribution data

dc.contributor.authorSeebens, Hannoen_ZA
dc.contributor.authorClarke, David A.en_ZA
dc.contributor.authorGroom, Quentinen_ZA
dc.contributor.authorWilson, John R. U.en_ZA
dc.contributor.authorGarcia-Berthou, Emilien_ZA
dc.contributor.authorKuhn, Ingolfen_ZA
dc.contributor.authorRoige, Marionaen_ZA
dc.contributor.authorPagad, Shyamaen_ZA
dc.contributor.authorEssl, Franzen_ZA
dc.contributor.authorVicente, Joanaen_ZA
dc.contributor.authorWinter, Martenen_ZA
dc.contributor.authorMcGeoch, Melodieen_ZA
dc.date.accessioned2020-08-07T07:40:02Z
dc.date.accessioned2021-08-24T14:50:17Z
dc.date.available2020-08-07T07:40:02Z
dc.date.available2021-08-24T14:50:17Z
dc.date.issued2020-07-20
dc.descriptionCITATION: Seebens, H. et al. 2020. A workflow for standardising and integrating alien species distribution data. NeoBiota 59, 39-59, doi:10.3897/neobiota.59.53578.
dc.description.abstractENGLISH ABSTRACT: Biodiversity data are being collected at unprecedented rates. Such data often have significant value for purposes beyond the initial reason for which they were collected, particularly when they are combined and collated with other data sources. In the field of invasion ecology, however, integrating data represents a major challenge due to the notorious lack of standardisation of terminologies and categorisations, and the application of deviating concepts of biological invasions. Here, we introduce the SInAS workflow, short for Standardising and Integrating Alien Species data. The SInAS workflow standardises terminologies following Darwin Core, location names using a proposed translation table, taxon names based on the GBIF backbone taxonomy, and dates of first records based on a set of predefined rules. The output of the SInAS workflow provides various entry points that can be used both to improve coherence among the databases and to check and correct the original data. The workflow is flexible and can be easily adapted and extended to the needs of different users. We illustrate the workflow using a case-study integrating five widely used global databases of information on biological invasions. The comparison of the standardised databases revealed a surprisingly low degree of overlap, which indicates that the amount of data may currently not be fully exploited in the original databases. We highly recommend the use and development of publicly available workflows to ensure that the integration of databases is reproducible and transparent. Workflows, such as SInAS, ultimately increase trust in data, study results, and conclusions.en_ZA
dc.description.versionPublisher's version
dc.format.extent21 pages : illustrations, mapsen_ZA
dc.identifier.citationSeebens, H., Clarke, D.A., Groom, Q., Wilson, J.R.U., García-Berthou, E., Kühn, I., Roigé, M., Pagad, S., Essl, F., Vicente, J., Winter, M. and McGeoch, M. (2020). A workflow for standardising and integrating alien species distribution data. NeoBiota 59, 39-59.en_ZA
dc.identifier.issn1314-2488
dc.identifier.urihttp://hdl.handle.net/10019.1/112353
dc.language.isoenen_ZA
dc.publisherPensoft
dc.rights.holderAuthors retain copyright
dc.subject.lcshBiodiversity -- Data processingen_ZA
dc.subject.lcshDarwin Coreen_ZA
dc.subject.lcshBiological invasions -- Databasesen_ZA
dc.subject.lcshAlien species -- Geographical distributionen_ZA
dc.subject.lcshStandardizationen_ZA
dc.subject.lcshWorkflow -- Data processing -- Evaluationen_ZA
dc.titleA workflow for standardising and integrating alien species distribution dataen_ZA
dc.typeArticleen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Wilson_NeoBiota_2020.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
81 B
Format:
Plain Text
Description: