This experiment (number #2) used the following hyper parameters:
        - Model: BERT
        - Embedding Operator: l2
        - Normalization: l1
        - Normalization2: none
        - Softmax enabled: False
        - Cuda enabled: True
    