currently, we test with BERT_base, but we may as well use the smallest available.
currently, we test with BERT_base, but we may as well use the smallest available.