SFT
Start Job
Starts a SFT job
POST
Authorizations
Body
application/json
Examples to use in the SFT tuning process. We split this data into train/eval 90/10.
An example for training or evaluation
The scoring spec to use in the SFT tuning process
The base model to start the SFT tuning process.
Available options:
LLAMA_3.2_3B
, LLAMA_3.1_8B
SFT learning rate
Example:
0.0002
The LoRA configuration.
SFT number of train epochs: <= 10.
Example:
10
A custom system prompt to use during the RL tuning process
Example:
"An optional system prompt."
Response
200
application/json
Successful Response
SftStatus is the status of a SFT job.
Detailed status of the job
Example:
["Downloading model", "Tuning prompt"]
The job id
Example:
"1234abcd"
Current state of the job
Available options:
QUEUED
, RUNNING
, DONE
, ERROR
, CANCELLED
A list of trained models selected based on the PI score.