Document feedback

Question document fragment

When a question document fragment contains a formula, it is displayed as a space.

Submission type

issue

It's a little complicated...

I'd like to ask someone.

PR

Just a small problem.

I can fix it online!

Please select the submission type

Problem type

Specifications and Common Mistakes

- Specifications and Common Mistakes:

- Misspellings or punctuation mistakes,incorrect formulas, abnormal display.

- Incorrect links, empty cells, or wrong formats.

- Chinese characters in English context.

- Minor inconsistencies between the UI and descriptions.

- Low writing fluency that does not affect understanding.

- Incorrect version numbers, including software package names and version numbers on the UI.

Usability

- Usability:

- Incorrect or missing key steps.

- Missing main function descriptions, keyword explanation, necessary prerequisites, or precautions.

- Ambiguous descriptions, unclear reference, or contradictory context.

- Unclear logic, such as missing classifications, items, and steps.

Correctness

- Correctness:

- Technical principles, function descriptions, supported platforms, parameter types, or exceptions inconsistent with that of software implementation.

- Incorrect schematic or architecture diagrams.

- Incorrect commands or command parameters.

- Incorrect code.

- Commands inconsistent with the functions.

- Wrong screenshots.

- Sample code running error, or running results inconsistent with the expectation.

Risk Warnings

- Risk Warnings:

- Lack of risk warnings for operations that may damage the system or important data.

Content Compliance

- Content Compliance:

- Contents that may violate applicable laws and regulations or geo-cultural context-sensitive words and expressions.

- Copyright infringement.

Please select the type of question

Problem description

Describe the bug so that we can quickly locate the problem.

Document feedback

mindformers.models.LlamaForCausalLM

class mindformers.models.LlamaForCausalLM(config: LlamaConfig = None)[source]

Provide llama training loss or logits through network.

Parameters: config (LlamaConfig, optional) – The config of llama model. Default: None .

Inputs:

input_ids (Tensor) - the indices of input sequence tokens in the vocabulary with data type Int64/Int32, Tensor of shape $(b a t c h, s e q_l e n g t h)$ .
labels (Tensor, optional) - the labels of inputs with data type Int64/Int32, Tensor of shape $(b a t c h, s e q_l e n g t h)$ . Default: None.
input_position (Tensor, optional) - the position ids of inputs (at incremental reasoning mode) which is an increasing sequence with data type Int64/Int32, Tensor $(b a t c h, s e q_l e n g t h)$ . Default: None.
position_ids (Tensor, optional) - the position ids of inputs which is an increasing sequence with data type Int64/Int32, Tensor $(b a t c h, s e q_l e n g t h)$ . Default: None.
attention_mask (Tensor, optional) - input sentences padding mask, where 0 indicates padding position with data type Int64/Int32, Tensor of shape $(b a t c h, s e q_l e n g t h)$ . Default: None.
input_embeds (Tensor, optional) - the embedding of inputs with data type Float32/Float16, Tensor of shape $(b a t c h, s e q_l e n g t h, h i d d e n_s i z e)$ . Default: None.
init_reset (Tensor, optional) - A Bool tensor with shape [1], used to clear the past key parameter and past value parameter used in the incremental prediction. Only valid when use_past is True. Tensor of shape $(1)$ . Default: Tensor([True]).
batch_valid_length (Tensor, optional) - Int32 tensor with shape [batch_size] the past calculated the index. Used for incremental prediction when the use_past is True. Default: None.
batch_index (Tensor, optional) - Discard argument. Will be deleted in the future. Default: None.
zactivate_len (Tensor, optional) - Discard argument. Will be deleted in the future. Default: None.
block_tables (Tensor, optional) - Int64 type Tensor, store mapping tables for each sequence. Default: None.
slot_mapping (Tensor, optional) - Int32 type Tensor, token cache physical slot index. Default: None.
prefix_keys_values (Tensor, optional) - Discard argument. Will be deleted in the future. Default: None.
llm_boost_inputs (Tensor, optional) - Discard argument. Will be deleted in the future. Default: None.
q_seq_lens (Tensor, optional) - In parallel decoding, the query may be flattened. The Paged Attention operator need q_seq_lens to obtain the length information. Default: None .
loss_mask (Tensor, optional) - Float32/Int32 type tensor, which is used to determine whether the corresponding token position participates in the loss calculation. If the value is $(1)$ , the loss of the position is calculated, and $(0)$ is not calculated. Default: None.
gather_index (Tensor, optional) - Int32 type Tensor, used to obtain the last latent vector of each sequence. Default: None.
seq_range (Tensor, optional) - Int32 type Tensor, used to obtain Mask and positional encoding of valid tokens for each sequence. Default: None.
actual_seq_len (Tensor, optional) - Int32 type Tensor, used to automatically generate attention mask within FlashAttention for eod text. Default: None.

Outputs:

Tensor. If it is in training mode, the output Tensor contains loss; If it is in prediction mode, the output Tensor contains logits; If it is in evaluation mode, the output Tensor contains logits, tokens, and input masks.