<?xml version="1.0" encoding="UTF-8"?>
<resource xmlns="http://datacite.org/schema/kernel-4" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="DOI">10.18453/rosdok_id00002897</identifier>
  <creators>
    <creator>
      <creatorName nameType="Personal">Janßen, René</creatorName>
      <givenName>René</givenName>
      <familyName>Janßen</familyName>
      <nameIdentifier nameIdentifierScheme="GND" schemeURI="http://d-nb.info/gnd/">http://d-nb.info/gnd/1225131375</nameIdentifier>
      <nameIdentifier nameIdentifierScheme="ORCID" schemeURI="https://orcid.org/">https://orcid.org/0000-0002-0605-6860</nameIdentifier>
    </creator>
  </creators>
  <titles>
    <title>Machine learning classification of microbial community compositions to predict anthropogenic pollutants in the Baltic Sea</title>
  </titles>
  <publisher>Universität Rostock</publisher>
  <publicationYear>2020</publicationYear>
  <resourceType resourceTypeGeneral="Text" />
  <subjects>
    <subject xml:lang="en" schemeURI="http://dewey.info/" subjectScheme="dewey">004 Data processing Computer sciences</subject>
    <subject xml:lang="en" schemeURI="http://dewey.info/" subjectScheme="dewey">570 Life science</subject>
  </subjects>
  <dates>
    <date dateType="Created">2020</date>
  </dates>
  <language>en</language>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="PURL">http://purl.uni-rostock.de/rosdok/id00002897</alternateIdentifier>
    <alternateIdentifier alternateIdentifierType="URN">urn:nbn:de:gbv:28-rosdok_id00002897-8</alternateIdentifier>
  </alternateIdentifiers>
  <descriptions>
    <description descriptionType="Abstract">Microbial communities react rapidly and specifically to changing environments, indicating distinct microbial fingerprints for a given environmental state. Machine learning with community data predicted the Baltic Sea-detected pollutants glyphosate and 2,4,6-trinitrotoluene, using the developed R package “phyloseq2ML”. Predictions by Random Forest and Artificial Neural Network were accurate. Relevant taxa were identified. The interpretability of machine learning models was found of particular importance. Microbial communities predicted even minor influencing factors in complex environments.</description>
  </descriptions>
</resource>
