axolotl is a fine tuning framework for fine-tuning LLM type models
Alpaca Instruction Format
Documented here - you provide a table with columns instruction
, input
and output
. These are formatted into a prompt that is fed to the model like so:
Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
### Instruction:
{instruction}
### Input:
{input}
### Response:
After this the generated response is compared with the output
and a signal generated for backprop.
flash_attn issues
Packaging
on initial install python package packaging
may be missing so pip install packaging
before installing requirements.txt
Missing Symbols
- Encountered an issue where
flash_attn
couldn’t load due to missing symbols in the c library. Runningpip install --upgrade flash_attn
seems to fix this.
AttributeError: ‘torch.dtype’ object has no attribute ‘itemsize’
- Issue caused by incompatibility between transformers and torch solved by downgrading transformers to
v4.38.2
as per this thread