Check Dataset Metadata Quality

Assessment Reports provide automated dataset metadata quality checks based on FAIR (Findable, Accessible, Interoperable, Reusable) data principles.

ESS-DIVE’s Assessment Reports should be used by dataset creators to assess your dataset metadata quality based on FAIR (Findable, Accessible, Interoperable, Reusable) data principles during submission. In order for your dataset to be approved for publication, you’ll need to ensure you pass all required checks and resolve warnings within the assessment report (Figure 1).

Assessment Reports display the compiled outcome of the suite of automated checks that are performed whenever a dataset is submitted on ESS-DIVE. Assessment reports become available once a dataset is submitted on ESS-DIVE. The report, while the dataset is private, is only available to the dataset creator and those who have shared access to the private dataset. The assessment report becomes available to all users on ESS-DIVE once the dataset is published.

The assessment reports are used by ESS-DIVE reviewers to assess the quality of the dataset before publication. Not all dataset metadata checks are contained within the assessment report, but the report provides one form of feedback that you will receive during the dataset review process. For a complete list of dataset metadata checks and requirements, see the Dataset Requirements page.

To access an assessment report, navigate to a dataset landing page on ESS-DIVE and select the "Assessment report" button on the right-hand side of the landing page (Figure 2).

Review Assessment Reports when Submitting Data

Before requesting publication for a dataset, data submitters should review the assessment report to address any failed automated checks or warnings and receive automated feedback on their dataset quality based on FAIR data principles through the assessment report score (e.g., 93/100/100/50). By addressing any failed automated checks or warnings before requesting publication, data submitters can expedite the review and publication process.

Please note that assessment reports can take a few minutes, or up to 24 hours, to generate.

Failed checks indicate that a required field does not follow the automated check criteria (Figure 3). Details on requirements of the check and steps to address the issue will be provided for each failed check. For example, if a dataset does not contain an ORCiD for the dataset contact, the failed check will notify the user that there is no ORCiD present and one should be provided. Similarly if there is no methods section present, the failed check will alert the user that they need to add a methods section within their dataset.

Warning checks notify that an optional field does not follow the automated check criteria (Figure 4). For example, if a user does not provide a textual description of the geographic coverage, the geographic region automated check will alert the user to provide a geographic region description.

Resolving Failed Checks and Warnings

Utilize the instructions provided for each check to resolve any failed checks and warnings. The assessment report is run every time a dataset is submitted so once you revise any checks or warnings you can re-review the assessment report to ensure you’ve addressed all issues.

Note, the assessment report can take some time to re-generate after submitting the datasets. If you are seeing any other error other than the assessment report loading page (Figure 5), please contact ESS-DIVE support.

Assessment Report Checks

The below checks are run on each dataset upon submission as a part of the ESS-DIVE automated check suite. Informational checks appear on the assessment reports within their own section and are not pass/fail.

Review the Dataset Requirements Page for more detailed descriptions, formatting requirements, and examples for the automated checks.

CriteriaRequired/OptionalFAIR Category

Title length between between 7 and 40 words

Required

Findable

Abstract length is at least 100 words

Required

Findable

Keywords vary from title and at least 3 are present

Required

Findable

Publication date is present

Required

Findable

Creators, at least one is present

Required

Findable

Dataset Contact, ensure contact is present and ORCiD is provided

Required

Findable

URLs in metadata resolve correctly

Required

Findable

Start and End Dates are present

Required

Findable

Project name is from controlled list

Optional

Findable

Funding organization "U.S. DOE > Office of Science > Biological and Environmental Research (BER)" is present

Optional

Findable

Geographic Description is present

Optional

Findable

Coordinates describing the point location or geographic area of the dataset are present

Optional

Findable

Metadata Identifier Resolvable

Optional

Accessible

Methods description is more than 7 words in length

Required

Interoperable

Data file formats are non-proprietary

Optional

Reusable

Usage rights is set to Creative Commons CC-BY license

Optional

Reusable

Informational: Number of contacts with email addresses provided

Informational

Findable

Informational: Number of creators with email addresses provided

Informational

Findable

Informational: Count of data entities present

Informational

Interoperable

Last updated