Run evaluation of a fine-tuned model on a dataset
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
adapter, merged, base conversation, qa, summarization, translation, classification, general rouge, bertscore, accuracy, exact_match, bleu, meteor, recall, precision, f1