13  Authorship and Licensing Guidelines

Our goal is to consistently, accurately and transparently attribute the contribution of each contributor to papers, software, data products and other scientific outputs that our group leads or contributes to. We seek to encourage participation in manuscripts by interested parties, not limited to scientists, developers, and students who are directly involved in our projects. We hope to recognize each contributor who has made a significant and defined contribution.

Below we define group protocols and expectations for being acknowledged and acknowledging contributions to different types of content.

We generally follow the Contributor Roles Taxonomy (CRediT) to define contributions, but are open when collaborators use other approaches.

13.1 Publications

For publications like peer-reviewed journal articles, white papers, conference papers, etc. CCT Data Science members who are substantially involved should be listed as authors. Authorship and author order should be discussed with collaborators early and often (e.g. during the project plan phase of incubator projects). Involvement need not include writing (see CRediT and MeRIT frameworks) to be considered for authorship.

13.2 Software / Code

Our group uses Zenodo for archiving code we produce in collaboration with researchers. Code authorship and author order need not be the same as the authorship and author order for publications or data that the code contributes to. Have discussions early and often about authorship of code.

13.2.1 Zenodo + GitHub integration

The easiest way to archive code on Zenodo is to take advantage of the Zenodo–GitHub integration. If there is a CITATION.cff file in a GitHub repository, it will be used to populate the metadata on Zenodo. We recommend creating a CITATION.cff file with a tool such as cffinit or the cffr R package. Use semantic versioning in GitHub releases and in the corresponding version: field of CITATION.cff.

13.3 Data

Zenodo can also be used to archive data and it is an especially good option when code and data are in the same GitHub repository. Other options for archiving data include Dryad, UA ReDATA, and domain-specific data repositories. On Zenodo (possibly other repositories), there is a metadata field for contributor which is distinct from author. This field may be appropriate for crediting CCT Data Science members who contributed to data management but were not involved in creation of the data. Discuss this with collaborators early and often.

13.4 Other

For guidelines on recognizing our group on web apps/websites, presentations, handouts, and other products, see group branding guidelines.

13.5 Licenses

Our group uses the following licenses:


  1. Why Dryad uses CC-0: https://blog.datadryad.org/2011/10/05/why-does-dryad-use-cc0/↩︎