The present application provides a scalable system for managing requests for compute resources using a cloud computing architecture. The system estimates the total processing time of each computation in advance and monitors the progress of each computation to provide a more accurate estimate of remaining processing time. In this way, a determination may be made as each new computation request is received as to whether an additional resource is required or whether an existing resource would be suitable.