Data-Driven Strategies for Project Pricing and Resource Allocation in Software Development

25 March 2026, Version 1

Abstract

B13.ai are a software development company specialising in developing bespoke solutions to clients’ needs. Key to estimating the cost of a software development project is forecasting the time required to deliver the project and address defects/improvements. We perform a data-driven analysis of this task, investigating the accuracy of B13.ai’s current (typically, linear) rubrics. We: • Perform predictive modelling to investigate hours worked by developers vs. other roles. We show that a linear model is sometimes adequate but often inadequate. • Use PCA to understand the roles which most influence total hours, finding that these are the Dev, Senior Dev, and QC roles. • Perform a life-cycle analysis to elucidate phases of projects and the involvements for each role in each phase. • Study the time spent on defects/improvements vs. development, finding that the former has a different skew and kurtosis to the latter, so the times cannot be linearly translated.

Content

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.