# Models

[![View Source On Gitee](https://mindspore-website.obs.cn-north-4.myhuaweicloud.com/website-images/master/resource/_static/logo_source_en.svg)](https://gitee.com/mindspore/docs/blob/master/docs/mindformers/docs/source_en/start/models.md)

The following table lists models supported by MindFormers.

| Model                                                                                                   | Specifications                |    Model Type    |     Latest Version     |
|:--------------------------------------------------------------------------------------------------------|:------------------------------|:----------------:|:----------------------:|
| [CodeLlama](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/codellama.md)             | 34B                           |    Dense LLM     | In-development version |
| [CogVLM2-Image](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/cogvlm2_image.md)     | 19B                           |        MM        | In-development version |
| [CogVLM2-Video](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/cogvlm2_video.md)     | 13B                           |        MM        | In-development version |
| [DeepSeek-V3](https://gitee.com/mindspore/mindformers/tree/dev/research/deepseek3)                      | 671B                          |    Sparse LLM    | In-development version |
| [DeepSeek-V2](https://gitee.com/mindspore/mindformers/tree/dev/research/deepseek2)                      | 236B                          |    Sparse LLM    | In-development version |
| [DeepSeek-Coder-V1.5](https://gitee.com/mindspore/mindformers/tree/dev/research/deepseek1_5)            | 7B                            |    Dense LLM     | In-development version |
| [DeepSeek-Coder](https://gitee.com/mindspore/mindformers/tree/dev/research/deepseek)                    | 33B                           |    Dense LLM     | In-development version |
| [GLM4](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/glm4.md)                       | 9B                            |    Dense LLM     | In-development version |
| [GLM3-32K](https://gitee.com/mindspore/mindformers/tree/dev/research/glm32k)                            | 6B                            |    Dense LLM     | In-development version |
| [GLM3](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/glm3.md)                       | 6B                            |    Dense LLM     | In-development version |
| [InternLM2](https://gitee.com/mindspore/mindformers/tree/dev/research/internlm2)                        | 7B/20B                        |    Dense LLM     | In-development version |
| [Llama3.1](https://gitee.com/mindspore/mindformers/tree/dev/research/llama3_1)                          | 8B/70B                        |    Dense LLM     | In-development version |
| [Llama3](https://gitee.com/mindspore/mindformers/tree/dev/research/llama3)                              | 8B/70B                        |    Dense LLM     | In-development version |
| [Llama2](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/llama2.md)                   | 7B/13B/70B                    |    Dense LLM     | In-development version |
| [Mixtral](https://gitee.com/mindspore/mindformers/tree/dev/research/mixtral)                            | 8x7B                          |    Sparse LLM    | In-development version |
| [Qwen2](https://gitee.com/mindspore/mindformers/tree/dev/research/qwen2)                                | 0.5B/1.5B/7B/57B/57B-A14B/72B | Dense/Sparse LLM | In-development version |
| [Qwen1.5](https://gitee.com/mindspore/mindformers/tree/dev/research/qwen1_5)                            | 7B/14B/72B                    |    Dense LLM     | In-development version |
| [Qwen-VL](https://gitee.com/mindspore/mindformers/tree/dev/research/qwenvl)                             | 9.6B                          |        MM        | In-development version |
| [Whisper](https://gitee.com/mindspore/mindformers/blob/dev/docs/model_cards/whisper.md)                 | 1.5B                          |        MM        | In-development version |
| [Yi](https://gitee.com/mindspore/mindformers/tree/dev/research/yi)                                      | 6B/34B                        |    Dense LLM     | In-development version |
| [Baichuan2](https://gitee.com/mindspore/mindformers/blob/r1.3.0/research/baichuan2/baichuan2.md)        | 7B/13B                        |    Dense LLM     |         1.3.2          |
| [GLM2](https://gitee.com/mindspore/mindformers/blob/r1.3.0/docs/model_cards/glm2.md)                    | 6B                            |    Dense LLM     |         1.3.2          |
| [GPT2](https://gitee.com/mindspore/mindformers/blob/r1.3.0/docs/model_cards/gpt2.md)                    | 124M/13B                      |    Dense LLM     |         1.3.2          |
| [InternLM](https://gitee.com/mindspore/mindformers/blob/r1.3.0/research/internlm/internlm.md)           | 7B/20B                        |    Dense LLM     |         1.3.2          |
| [Qwen](https://gitee.com/mindspore/mindformers/blob/r1.3.0/research/qwen/qwen.md)                       | 7B/14B                        |    Dense LLM     |         1.3.2          |
| [CodeGeex2](https://gitee.com/mindspore/mindformers/blob/r1.1.0/docs/model_cards/codegeex2.md)          | 6B                            |    Dense LLM     |         1.1.0          |
| [WizardCoder](https://gitee.com/mindspore/mindformers/blob/r1.1.0/research/wizardcoder/wizardcoder.md)  | 15B                           |    Dense LLM     |         1.1.0          |
| [Baichuan](https://gitee.com/mindspore/mindformers/blob/r1.0/research/baichuan/baichuan.md)             | 7B/13B                        |    Dense LLM     |          1.0           |
| [Blip2](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/blip2.md)                    | 8.1B                          |        MM        |          1.0           |
| [Bloom](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/bloom.md)                    | 560M/7.1B/65B/176B            |    Dense LLM     |          1.0           |
| [Clip](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/clip.md)                      | 149M/428M                     |        MM        |          1.0           |
| [CodeGeex](https://gitee.com/mindspore/mindformers/blob/r1.0/research/codegeex/codegeex.md)             | 13B                           |    Dense LLM     |          1.0           |
| [GLM](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/glm.md)                        | 6B                            |    Dense LLM     |          1.0           |
| [iFlytekSpark](https://gitee.com/mindspore/mindformers/blob/r1.0/research/iflytekspark/iflytekspark.md) | 13B                           |    Dense LLM     |          1.0           |
| [Llama](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/llama.md)                    | 7B/13B                        |    Dense LLM     |          1.0           |
| [MAE](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/mae.md)                        | 86M                           |        MM        |          1.0           |
| [Mengzi3](https://gitee.com/mindspore/mindformers/blob/r1.0/research/mengzi3/mengzi3.md)                | 13B                           |    Dense LLM     |          1.0           |
| [PanguAlpha](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/pangualpha.md)          | 2.6B/13B                      |    Dense LLM     |          1.0           |
| [SAM](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/sam.md)                        | 91M/308M/636M                 |        MM        |          1.0           |
| [Skywork](https://gitee.com/mindspore/mindformers/blob/r1.0/research/skywork/skywork.md)                | 13B                           |    Dense LLM     |          1.0           |
| [Swin](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/swin.md)                      | 88M                           |        MM        |          1.0           |
| [T5](https://gitee.com/mindspore/mindformers/blob/r1.0/docs/model_cards/t5.md)                          | 14M/60M                       |    Dense LLM     |          1.0           |
| [VisualGLM](https://gitee.com/mindspore/mindformers/blob/r1.0/research/visualglm/visualglm.md)          | 6B                            |        MM        |          1.0           |
| [Ziya](https://gitee.com/mindspore/mindformers/blob/r1.0/research/ziya/ziya.md)                         | 13B                           |    Dense LLM     |          1.0           |
| [Bert](https://gitee.com/mindspore/mindformers/blob/r0.8/docs/model_cards/bert.md)                      | 4M/110M                       |    Dense LLM     |          0.8           |

* ***LLM:*** *Large Language Model;* ***MM:*** *Multi-Modal*