Hostname: page-component-89b8bd64d-ktprf Total loading time: 0 Render date: 2026-05-06T19:46:30.130Z Has data issue: false hasContentIssue false

Embedding data science innovations in organizations: a new workflow approach

Published online by Cambridge University Press:  03 November 2023

Keyao Li*
Affiliation:
Future of Work Institute, Curtin University, Perth, WA, Australia
Mark A. Griffin
Affiliation:
Future of Work Institute, Curtin University, Perth, WA, Australia
Tamryn Barker
Affiliation:
CORE Skills, Perth, WA, Australia
Zane Prickett
Affiliation:
CORE Skills, Perth, WA, Australia
Melinda R. Hodkiewicz
Affiliation:
School of Engineering, University of Western Australia, Perth, WA, Australia
Jess Kozman
Affiliation:
Katalyst Data Management, Perth, WA, Australia
Peta Chirgwin
Affiliation:
Chameleon Mettle Group, Perth, WA, Australia
*
Corresponding author: Keyao Li; Email: Keyao.li@curtin.edu.au

Abstract

There have been consistent calls for more research on managing teams and embedding processes in data science innovations. Widely used frameworks (e.g., the cross-industry standard process for data mining) provide a standardized approach to data science but are limited in features such as role clarity, skills, and cross-team collaboration that are essential for developing organizational capabilities in data science. In this study, we introduce a data workflow method (DWM) as a new approach to break organizational silos and create a multi-disciplinary team to develop, implement and embed data science. Different from current data science process workflows, the DWM is managed at the system level that shapes business operating model for continuous improvement, rather than as a function of a particular project, one single business unit, or isolated individuals. To further operationalize the DWM approach, we investigated an embedded data workflow at a mining operation that has been using geological data in a machine-learning model to stabilize daily mill production for the last 2 years. Based on the findings in this study, we propose that the DWM approach derives its capability from three aspects: (a) a systemic data workflow; (b) multi-disciplinary networks of collaboration and responsibility; and (c) clearly identified data roles and the associated skills and expertise. This study suggests a whole-of-organization approach and pathway to develop data science capability.

Information

Type
Translational Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NC
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial licence (http://creativecommons.org/licenses/by-nc/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Table 1. Data science process workflows

Figure 1

Figure 1. Data workflow method.

Figure 2

Figure 2. The development timeline of the machine learning workflow.

Figure 3

Table 2. Profile of the stakeholders in the case study

Figure 4

Table 3. Data structure of the stages in the data workflow

Figure 5

Figure 3. The process workflow of implementing a predictive machine learning model in an Australian mining company.

Figure 6

Table 4. Data roles, knowledge, skills, and abilities (KSAs) and demonstrated behaviors

Submit a response

Comments

No Comments have been published for this article.