Process a dataset into ChatML format
The processing converts the dataset to ChatML format using the provided configuration. The configuration can include field mappings that specify either direct column mappings or template strings with column references.
Example field mappings:
{
"system_field": {"type": "template", "value": "You are a helpful assistant."},
"user_field": {"type": "column", "value": "question"},
"assistant_field": {"type": "template", "value": "Answer: {answer}"}
}
upload, huggingface Specifies the preprocessing mode to format the dataset for a specific fine-tuning task.
language_modeling, prompt_only, preference Successful Response
upload, huggingface text, vision