{"id":31684,"date":"2019-10-22T10:37:31","date_gmt":"2019-10-22T09:37:31","guid":{"rendered":"http:\/\/cupblog.bluefusesystems.com\/?p=31684"},"modified":"2020-06-24T09:50:16","modified_gmt":"2020-06-24T08:50:16","slug":"the-challenges-of-openness-transparency-and-impact-in-a-data-driven-world","status":"publish","type":"post","link":"https:\/\/www.cambridge.org\/core\/blog\/2019\/10\/22\/the-challenges-of-openness-transparency-and-impact-in-a-data-driven-world\/","title":{"rendered":"The challenges of openness, transparency and impact in a data-driven world"},"content":{"rendered":"<div id=\"bsf_rt_marker\"><\/div><p>Data science is a broad, interdisciplinary field being that in the UK is being shaped by the activities of the <a href=\"https:\/\/www.turing.ac.uk\/\">Turing Institute<\/a>. It encompasses: the technologies that are enabling the capture of an ever-greater volume of data; the techniques (including machine learning and other artificial intelligence applications) that can be used to analyse data and deploy insights for useful purposes; the challenges involved with data at unprecedented scale and from heterogeneous sources; and the ethical considerations of a more data-driven world.<\/p>\n<p>Two newly-launched OA journals from Cambridge University Press \u2013 <em><a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-and-policy\/information\">Data &amp; Policy<\/a><\/em> and <em><a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-centric-engineering\/information\">Data-Centric Engineering<\/a><\/em> are both exploring the transformative impact of data science in different domains. \u00a0Both journals are open for submissions (see the Call for Papers for <em><a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-centric-engineering\/information\/call-for-papers-2019\">DCE<\/a><\/em> and <em><a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-and-policy\/information\/call-for-papers-2019\">D&amp;P<\/a><\/em>). As publishing experiments they are trying to address similar challenges, outlined below.<\/p>\n<p><strong>Enabling OA for all authors<\/strong><\/p>\n<p>We\u2019ve engaged key partnering organisations for both journals \u2013 important for helping shape these emerging areas, and for supporting authors to publish on an OA basis.<\/p>\n<p><em>Data &amp; Policy <\/em>is a collaboration with the Data for Policy conference (<a href=\"dataforpolicy.org\">dataforpolicy.org<\/a>), which has successfully networked researchers, policy-makers and others interested in the implications of data-driven technology for policy, governance and the public sector. The conference and journal are partnered by UCL, the Alan Turing Institute and the UK Office for National Statistics (ONS). All three organisations have an interest in the way in which data science can advance the public interest: the ONS, for example, has a <a href=\"https:\/\/datasciencecampus.ons.gov.uk\/\">Data Science Campus<\/a> in Newport that investigates the use of new data sources for public good.<\/p>\n<p><em>Data-Centric Engineering<\/em> is supported by the <a href=\"https:\/\/www.lrfoundation.org.uk\/en\/\">Lloyd\u2019s Register Foundation<\/a> (LRF): a global charity that\u2019s been at the forefront of the potential of new data-driven technology for making engineering systems safer, more secure and resilient. See the <a href=\"https:\/\/www.lrfoundation.org.uk\/en\/publications\/foresight-review-of-big-data\/\">LRF\u2019s Foresight Review of Big Data<\/a>. The LRF is also a partner of the Turing Institute\u2019s <a href=\"https:\/\/www.turing.ac.uk\/research\/research-programmes\/data-centric-engineering\">data-centric engineering programme<\/a>.<\/p>\n<p>As a consequence of these partnerships, we are able to remove the financial barriers to publishing as well as the financial barriers to reading. Authors with access to grant and institutional funds are asked to pay an article processing charge (APC) if their article is accepted for publication, but those who do not have access to these kind funds are not required to pay.<\/p>\n<p><strong>Nuances of data access and transparency<\/strong><\/p>\n<p>We\u2019re interested in exploring the potential of other trends towards openness and collaboration in these new areas of research, including open data and code. But this also involves recognising that in areas of research collecting data at scale and from diverse sources, there can be valid ethical, security and commercial reasons why data cannot be made openly available.<\/p>\n<p>The Open Data Institute and the Lloyd\u2019s Register Foundation recently launched <a href=\"https:\/\/theodi.org\/article\/engineering-data-for-the-public-good-a-manifesto\/#1565084667757-5e97b880-e130\">a manifesto for sharing engineering data<\/a>, which has been signed by a number of organisations including Cambridge University Press. This advances the notion of data \u201cas infrastructure that powers our societies and economies\u201d and calls for engineering data to be as open as possible. But it also recognises that data needs to be stewarded \u2013 managed as an asset in society\u2019s interest \u2013 and that legal and ethical frameworks are needed to minimise any harmful impacts that data use can bring.<\/p>\n<p>There have been a number of initiatives to power research collaborations through improved access to public and private sector data, whilst recognising the need for safeguards.\u00a0 Networks such as <a href=\"https:\/\/www.adruk.org\/\">Administrative Data UK<\/a> have been important for making public sector data available for research purposes, whilst putting in place measures to de-identify data where needed. The concept of <a href=\"https:\/\/theodi.org\/article\/what-is-a-data-trust\/#1531153930162-87dda9b8-fb58\">data trusts<\/a> have also been an important development for improving access to public and private sector data for new technologies, whilst recognising the need to retain public trust through clear governance structures and oversight.<\/p>\n<p>For the new journals we\u2019ve launched, transparency is perhaps the most important concept. Three ways in which we are asking authors to think about, and disclose, essential information about their submitted papers are below.<\/p>\n<p><em>Data availability statements <\/em><\/p>\n<p>In line with the <a href=\"https:\/\/cos.io\/top\/\">Transparency and Openness Promotion<\/a> (TOP) Guidelines, we\u2019re asking all authors submitting articles to <em>Data &amp; Policy<\/em> and <em>Data-Centric Engineering<\/em> to provide a Data Availability Statement with their study on submission. As a general principle authors should make available the resources (data, code and other materials) to allow others to understand, verify and replicate findings. These resources should be placed in a trusted repository and cited in the statement. But in the event that data cannot be made available for valid ethical, security or privacy reasons this should be explained in the statement.<\/p>\n<p><em>Contributor roles<\/em><\/p>\n<p>On submission, authors are required to outline the kinds of contributions that authors (and non-authors) have made to the study, using the <a href=\"https:\/\/www.casrai.org\/credit.html\">CRediT taxonomy<\/a> as a framework. The goal of the taxonomy is to bring greater transparency to the contributions that individuals have made to a work \u2013 including giving credit to roles that are not traditionally recognised in terms of authorship, such as funding acquisition, data visualisation and validation, and software programming and development.<\/p>\n<p><em>Impact statements <\/em><\/p>\n<p>One of the goals of open research is of course to reach new audiences. Both <em>Data &amp; Policy<\/em> and <em>Data-Centric Engineering<\/em> are actively trying to cultivate readers and authors outside of academic institutions: in policy, civil society and industry. In order to \u2018translate\u2019 the significance of their research to a wider audience, we\u2019re asking authors to provide an impact statement (<em>DCE<\/em>) and policy significance statement (<em>D&amp;P<\/em>) that succinctly summarises the article\u2019s findings.<\/p>\n<p>Visit the information pages of <em><a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-centric-engineering\">Data-Centric Engineering<\/a><\/em> and<em> <a href=\"https:\/\/www.cambridge.org\/core\/journals\/data-and-policy\/information\">Data &amp; Policy<\/a><\/em> for more details about the journals \u2013 or contact <a href=\"mailto:dce@cambridge.org\">dce@cambridge.org<\/a> and <a href=\"mailto:dataandpolicy@cambridge.org\">dataandpolicy@cambridge.org<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Data science is a broad, interdisciplinary field being that in the UK is being shaped by the activities of the Turing Institute. It encompasses: the technologies that are enabling the capture of an ever-greater volume of data; the techniques (including machine learning and other artificial intelligence applications) that can be used to analyse data and [&hellip;]<\/p>\n","protected":false},"author":57,"featured_media":31653,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2262,4955,7],"tags":[505,1057,7627,62,308,46,45,1709,6088],"coauthors":[6569],"class_list":["post-31684","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-engineering","category-open-research","category-social-sciences","tag-academic","tag-data","tag-dce","tag-engineering","tag-journals","tag-oa","tag-open-access","tag-policy","tag-science"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts\/31684","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/users\/57"}],"replies":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/comments?post=31684"}],"version-history":[{"count":0,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/posts\/31684\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/media\/31653"}],"wp:attachment":[{"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/media?parent=31684"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/categories?post=31684"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/tags?post=31684"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.cambridge.org\/core\/blog\/wp-json\/wp\/v2\/coauthors?post=31684"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}