A lot of Web apps perform inference of deep neural network (DNN) models within Web browsers to provide intelligent services for their users. Typically, GPU acceleration is required during DNN ...