For local data:Documentation Index
Fetch the complete documentation index at: https://training-docs.cerebras.ai/llms.txt
Use this file to discover all available pages before exploring further.
-
Set
typetolocal. -
Use
sourceto provide the path to the input directory.
-
Set
typetohuggingface. -
Use
sourceto specify the dataset name from the Hugging Face hub. -
Use
splitto specify the dataset split.
load_dataset API.
When calling the API, parameters are passed as keyword arguments and they must conform to the specifications outlined by HuggingFace. Refer to the
load_dataset documentation here.