momentumContrastiveLoss(k, W, q, queue, device, tau=1)
Calculate the loss of the network depending on the current key(k), the query(q)
and the overall queue(queue)
We follow the suggestion of the paper, Algorithm 1:
https://arxiv.org/pdf/1911.05722.pdf