Generally, all papers should model the following outline:
Title - Short title appropriate for data
Structured abstract - A ~300 word abstract describing the data, how it supports one or more of the U.N. Goals and suggested ways the data can be used.
ORCID, DOI etc
Key Words - facilitate indexing, metadata tagging etc.
U.N. Goal ID - Select the single goal most relevant to the data. This is used for organizing the journal by goal topics. Add any secondary goals for indexing.
Introduction - Sales pitch for the value of the data.
Data Provenance - Discuss the origin of the data, purpose it was collected etc. Similar to the first half of a research paper. Should include steps to protect privacy while preserving usefulness.
Source Data Description - A normalized version of the original source data that has been tidied, but not yet transformed. Expect a table with key definitions and summary reports for each table in the source data set.
Enhanced Analysis Data - Data that has been prepped for analysis, including cleaning, addressing missing data, transformation, feature extraction and feature enrichments designed to facilitate analysis. Think of this as the steps a data scientist would take before analysis. Expect a clear report of how the data has been tidied, cleaned and prepared for analysis.
Data Usefulness - A deeper discussion of how the data can be useful in helping meet one or more SDGs and suggest methods of analysis approaches to try
A link to the complete data set, including both the source data and enhanced data. Should also link to other papers published on this data or related to it (updated later) to help build the ecosystem.
References, acknowledgments, disclosures