mindformers.pet.models.LoraModel
- class mindformers.pet.models.LoraModel(config: LoraConfig, base_model: PreTrainedModel)[source]
LoRA model for LLM. Provide a flexible and efficient way to adjust and optimize pre-trained models by adding LoRA structures to the base pre-trained models.
- Parameters
config (LoraConfig) – Pet config, defines Parameter-Efficient Tuning (Pet) algorithm.
base_model (PreTrainedModel) – Pre-trained model for tuning.
- Inputs:
*inputs (Tensor) - Input arguments of original base model.
- Outputs:
Outputs of original base model.
Examples
>>> import mindspore as ms >>> from mindformers.pet import LoraModel, LoraConfig >>> from mindformers.models import LlamaConfig, LlamaForCausalLM >>> ms.set_context(mode=0) >>> config = LlamaConfig(num_layers=2) >>> lora_config = LoraConfig(target_modules='.*wq|.*wk|.*wv|.*wo') >>> model = LlamaForCausalLM(config) >>> lora_model = LoraModel(lora_config,model) >>> print(lora_model.lora_model) LlamaForCausalLM< (model): LlamaModel< (freqs_mgr): FreqsMgr<> (casual_mask): LowerTriangularMaskWithDynamic<> (tok_embeddings): LlamaEmbedding<> (layers): CellList< (0): LLamaDecodeLayer< (ffn_norm): LlamaRMSNorm<> (attention_norm): LlamaRMSNorm<> (attention): LLamaAttention< (wq): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wk): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wv): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wo): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (apply_rotary_emb): RotaryEmbedding<> > (feed_forward): LlamaFeedForward< (w1): Linear< (activation): LlamaSiLU<> > (w2): Linear<> (w3): Linear<> > > (1): LLamaDecodeLayer< (ffn_norm): LlamaRMSNorm<> (attention_norm): LlamaRMSNorm<> (attention): LLamaAttention< (wq): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wk): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wv): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (wo): LoRADense< input_channels=4096, output_channels=4096 (lora_dropout): Dropout<p=0.01> > (apply_rotary_emb): RotaryEmbedding<> > (feed_forward): LlamaFeedForward< (w1): Linear< (activation): LlamaSiLU<> > (w2): Linear<> (w3): Linear<> > > > (norm_out): LlamaRMSNorm<> > (lm_head): Linear<> (loss): CrossEntropyLoss< (_log_softmax): _LogSoftmax<> (_nllloss): _NLLLoss<> > >