Documentation for LoRAConfig. #2212

brynhayder · 2024-11-12T16:06:40Z

Lines 112 to 113 in 162d7e5

    
                       initialization scaled by the LoRA rank for linear and layers. Setting the initialization to False leads to 
        
                       completely random initialization and is discouraged. Pass `'loftq'` to use LoftQ initialization. Pass

Documentation for False is not clear. Presumably 'completely random' means the arrays will be uninitialized and hence contain whatever the contents of the relevant memory locations are?

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2024-11-12T22:44:45Z

To explain further: The default implementation initializes the LoRA A parameter randomly and the LoRA B parameter to zeros. This results in LoRA being an identity transform at initialization, which can help with training. When setting init_lora_weights=False, the LoRA B weight is instead also randomly initialized, resulting in a non-identity transform.

For real LoRA training, you almost never want that, which is why we discourage it. However, the weights are not initialized as random memory as in torch.empty, which seems to be what you suspected.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation for LoRAConfig. #2212

Documentation for LoRAConfig. #2212

brynhayder commented Nov 12, 2024 •

edited

Loading

BenjaminBossan commented Nov 12, 2024

Documentation for LoRAConfig. #2212

Documentation for LoRAConfig. #2212

Comments

brynhayder commented Nov 12, 2024 • edited Loading

BenjaminBossan commented Nov 12, 2024

brynhayder commented Nov 12, 2024 •

edited

Loading