Financial solutions
Results
There are many possible Virtual Machine (VM) types or flavours provided by the various Cloud Providers. Properly choosing the most optimal flavour for specific task requires additional calculation. It is done automatically by MELODIC. The list of the most suitable VM flavours for AI Investments is depicted in table below:
VM type | VCPUs | CPU RAM | GPU | GPU RAM | Price/hour |
---|---|---|---|---|---|
Amazon Web Services – EU: Ireland – OS: Linux | |||||
p2.xlarge | 4 | 61 | NVIDIA K80 | – | 0.972$ |
g3.4xlarge | 16 | 122 | NVIDIA Tesla M60 | 8 | 1.210$ |
Azure—West Europe—OS: Ubuntu Advantage Standard | |||||
Nd6 | 6 | 122 | NVIDIA Tesla P40 | 24 | 2.287EUR |
Google Compute Engine—EU: Netherlands | |||||
nl-standard-8 | 8 | 30 | NVIDIA Tesla T4 | – | 1.3684$ |
In the table the analysis of the different infrastructure scenarios for AI Invest-ments is presented. Each row contains one scenario. The private infrastructure gives the number of servers each hosting two GPUs providing a certain GPU time for training the models. According to the above cost assumptions the cost of this private infrastructure is calculated. If the number of GPUs provided by the on-premise servers is less than what is needed for the average 10 models to be trained, the addi tional GPUs must be sourced from a Cloud base infrastructure. Finally, additional GPU resources must be acquired form a Cloud burst infrastructure when the burst of 50 models must be trained, i.e. 40 models more than normally handled by the private infrastructure. Finally, the table also shows the total monthly cost of all servers in the given scenario adding together the cost of the private infrastructure, the Cloud base and the Cloud burst servers. Assuming the above maintenance cost, the TCO over a three years’ period is computed. The cost is minimal for the cases of having 5 private servers, i.e. not being continuously dependent on using Cloud servers, and only use the Cloud to manage the bursts.
VM type | VCPUs | CPU RAM | Total monthly cost | TCO over 3 years | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Servers | GPU | Time | Cost | GPU | Time | Cost | GPU | Time | Cost | ||
1 | 2 | 1440 | 181 | 8 | 5760 | 5599 | 40 | 1133 | 1101 | 6880 | 247695 |
2 | 4 | 2880 | 361 | 6 | 4320 | 4199 | 40 | 1133 | 1101 | 5661 | 203807 |
3 | 6 | 4320 | 542 | 4 | 2880 | 2799 | 40 | 1133 | 1101 | 4442 | 159918 |
4 | 8 | 5760 | 722 | 2 | 1440 | 1400 | 40 | 1133 | 1101 | 3223 | 116030 |
5 | 10 | 7200 | 903 | 0 | 0 | 0 | 40 | 1133 | 1101 | 2004 | 72141 |
6 | 12 | 8640 | 1083 | 0 | 0 | 0 | 38 | 1076 | 1046 | 2129 | 76659 |
7 | 14 | 10080 | 1264 | 0 | 0 | 0 | 36 | 1020 | 991 | 2255 | 81177 |
8 | 16 | 11520 | 1444 | 0 | 0 | 0 | 34 | 963 | 936 | 2380 | 85695 |
9 | 18 | 12960 | 1625 | 0 | 0 | 0 | 32 | 906 | 881 | 2506 | 90213 |
10 | 20 | 14499 | 1806 | 0 | 0 | 0 | 30 | 850 | 826 | 2631 | 94731 |
25 | 50 | 36000 | 4514 | 0 | 0 | 0 | 0 | 0 | 0 | 4514 | 162500 |