-
Set
type
tolocal
. -
Use
source
to provide the path to the input directory.
-
Set
type
tohuggingface
. -
Use
source
to specify the dataset name from the Hugging Face hub. -
Use
split
to specify the dataset split.
load_dataset
API.
When calling the API, parameters are passed as keyword arguments and they must conform to the specifications outlined by HuggingFace. Refer to the
load_dataset
documentation here.Config Examples
Use the tabs to view examples:View example configs for various use cases here.