初始化项目,由ModelHub XC社区提供模型

Model: clowman/Llama-3.2-3B-Instruct-GPTQ-Int8
Source: Original Platform
This commit is contained in:
ModelHub XC
2026-05-12 14:31:39 +08:00
commit 0b8046e604
13 changed files with 2876 additions and 0 deletions

36
.gitattributes vendored Normal file
View File

@@ -0,0 +1,36 @@
*.7z filter=lfs diff=lfs merge=lfs -text
*.arrow filter=lfs diff=lfs merge=lfs -text
*.bin filter=lfs diff=lfs merge=lfs -text
*.bz2 filter=lfs diff=lfs merge=lfs -text
*.ckpt filter=lfs diff=lfs merge=lfs -text
*.ftz filter=lfs diff=lfs merge=lfs -text
*.gz filter=lfs diff=lfs merge=lfs -text
*.h5 filter=lfs diff=lfs merge=lfs -text
*.joblib filter=lfs diff=lfs merge=lfs -text
*.lfs.* filter=lfs diff=lfs merge=lfs -text
*.mlmodel filter=lfs diff=lfs merge=lfs -text
*.model filter=lfs diff=lfs merge=lfs -text
*.msgpack filter=lfs diff=lfs merge=lfs -text
*.npy filter=lfs diff=lfs merge=lfs -text
*.npz filter=lfs diff=lfs merge=lfs -text
*.onnx filter=lfs diff=lfs merge=lfs -text
*.ot filter=lfs diff=lfs merge=lfs -text
*.parquet filter=lfs diff=lfs merge=lfs -text
*.pb filter=lfs diff=lfs merge=lfs -text
*.pickle filter=lfs diff=lfs merge=lfs -text
*.pkl filter=lfs diff=lfs merge=lfs -text
*.pt filter=lfs diff=lfs merge=lfs -text
*.pth filter=lfs diff=lfs merge=lfs -text
*.rar filter=lfs diff=lfs merge=lfs -text
*.safetensors filter=lfs diff=lfs merge=lfs -text
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
*.tar.* filter=lfs diff=lfs merge=lfs -text
*.tar filter=lfs diff=lfs merge=lfs -text
*.tflite filter=lfs diff=lfs merge=lfs -text
*.tgz filter=lfs diff=lfs merge=lfs -text
*.wasm filter=lfs diff=lfs merge=lfs -text
*.xz filter=lfs diff=lfs merge=lfs -text
*.zip filter=lfs diff=lfs merge=lfs -text
*.zst filter=lfs diff=lfs merge=lfs -text
*tfevents* filter=lfs diff=lfs merge=lfs -text
tokenizer.json filter=lfs diff=lfs merge=lfs -text

326
README.md Normal file

File diff suppressed because one or more lines are too long

52
USE_POLICY.md Normal file
View File

@@ -0,0 +1,52 @@
**Llama 3.2** **Acceptable Use Policy**
Meta is committed to promoting safe and fair use of its tools and features, including Llama 3.2. If you access or use Llama 3.2, you agree to this Acceptable Use Policy (“**Policy**”). The most recent copy of this policy can be found at [https://www.llama.com/llama3_2/use-policy](https://www.llama.com/llama3_2/use-policy).
**Prohibited Uses**
We want everyone to use Llama 3.2 safely and responsibly. You agree you will not use, or allow others to use, Llama 3.2 to:
1. Violate the law or others rights, including to:
1. Engage in, promote, generate, contribute to, encourage, plan, incite, or further illegal or unlawful activity or content, such as:
1. Violence or terrorism
2. Exploitation or harm to children, including the solicitation, creation, acquisition, or dissemination of child exploitative content or failure to report Child Sexual Abuse Material
3. Human trafficking, exploitation, and sexual violence
4. The illegal distribution of information or materials to minors, including obscene materials, or failure to employ legally required age-gating in connection with such information or materials.
5. Sexual solicitation
6. Any other criminal activity
1. Engage in, promote, incite, or facilitate the harassment, abuse, threatening, or bullying of individuals or groups of individuals
2. Engage in, promote, incite, or facilitate discrimination or other unlawful or harmful conduct in the provision of employment, employment benefits, credit, housing, other economic benefits, or other essential goods and services
3. Engage in the unauthorized or unlicensed practice of any profession including, but not limited to, financial, legal, medical/health, or related professional practices
4. Collect, process, disclose, generate, or infer private or sensitive information about individuals, including information about individuals identity, health, or demographic information, unless you have obtained the right to do so in accordance with applicable law
5. Engage in or facilitate any action or generate any content that infringes, misappropriates, or otherwise violates any third-party rights, including the outputs or results of any products or services using the Llama Materials
6. Create, generate, or facilitate the creation of malicious code, malware, computer viruses or do anything else that could disable, overburden, interfere with or impair the proper working, integrity, operation or appearance of a website or computer system
7. Engage in any action, or facilitate any action, to intentionally circumvent or remove usage restrictions or other safety measures, or to enable functionality disabled by Meta 
2. Engage in, promote, incite, facilitate, or assist in the planning or development of activities that present a risk of death or bodily harm to individuals, including use of Llama 3.2 related to the following:
8. Military, warfare, nuclear industries or applications, espionage, use for materials or activities that are subject to the International Traffic Arms Regulations (ITAR) maintained by the United States Department of State or to the U.S. Biological Weapons Anti-Terrorism Act of 1989 or the Chemical Weapons Convention Implementation Act of 1997
9. Guns and illegal weapons (including weapon development)
10. Illegal drugs and regulated/controlled substances
11. Operation of critical infrastructure, transportation technologies, or heavy machinery
12. Self-harm or harm to others, including suicide, cutting, and eating disorders
13. Any content intended to incite or promote violence, abuse, or any infliction of bodily harm to an individual
3. Intentionally deceive or mislead others, including use of Llama 3.2 related to the following:
14. Generating, promoting, or furthering fraud or the creation or promotion of disinformation
15. Generating, promoting, or furthering defamatory content, including the creation of defamatory statements, images, or other content
16. Generating, promoting, or further distributing spam
17. Impersonating another individual without consent, authorization, or legal right
18. Representing that the use of Llama 3.2 or outputs are human-generated
19. Generating or facilitating false online engagement, including fake reviews and other means of fake online engagement 
4. Fail to appropriately disclose to end users any known dangers of your AI system
5. Interact with third party tools, models, or software designed to generate unlawful content or engage in unlawful or harmful conduct and/or represent that the outputs of such tools, models, or software are associated with Meta or Llama 3.2
With respect to any multimodal models included in Llama 3.2, the rights granted under Section 1(a) of the Llama 3.2 Community License Agreement are not being granted to you if you are an individual domiciled in, or a company with a principal place of business in, the European Union. This restriction does not apply to end users of a product or service that incorporates any such multimodal models.
Please report any violation of this Policy, software “bug,” or other problems that could lead to a violation of this Policy through one of the following means:
* Reporting issues with the model: [https://github.com/meta-llama/llama-models/issues](https://l.workplace.com/l.php?u=https%3A%2F%2Fgithub.com%2Fmeta-llama%2Fllama-models%2Fissues&h=AT0qV8W9BFT6NwihiOHRuKYQM_UnkzN_NmHMy91OT55gkLpgi4kQupHUl0ssR4dQsIQ8n3tfd0vtkobvsEvt1l4Ic6GXI2EeuHV8N08OG2WnbAmm0FL4ObkazC6G_256vN0lN9DsykCvCqGZ)
* Reporting risky content generated by the model: [developers.facebook.com/llama_output_feedback](http://developers.facebook.com/llama_output_feedback)
* Reporting bugs and security concerns: [facebook.com/whitehat/info](http://facebook.com/whitehat/info)
* Reporting violations of the Acceptable Use Policy or unlicensed uses of Llama 3.2: LlamaUseReport@meta.com

1
args-lambda-quant.json Normal file
View File

@@ -0,0 +1 @@
{"quantization": "GPTQ-Int8", "model": "meta-llama/Llama-3.2-3B-Instruct", "dataset": "HuggingFaceH4/ultrachat_200k", "dataset_split": "train_sft", "dataset_name": null, "num_samples": 512, "seq_length": 2048, "batch_size": 32, "update_metadata_only": false}

61
config.json Normal file
View File

@@ -0,0 +1,61 @@
{
"_name_or_path": "/home/ubuntu/.cache/huggingface/hub/models--meta-llama--Llama-3.2-3B-Instruct/snapshots/0cb88a4f764b7a12671c53f0838cd831a0843b95",
"architectures": [
"LlamaForCausalLM"
],
"attention_bias": false,
"attention_dropout": 0.0,
"bos_token_id": 128000,
"eos_token_id": [
128001,
128008,
128009
],
"head_dim": 128,
"hidden_act": "silu",
"hidden_size": 3072,
"initializer_range": 0.02,
"intermediate_size": 8192,
"max_position_embeddings": 131072,
"mlp_bias": false,
"model_type": "llama",
"num_attention_heads": 24,
"num_hidden_layers": 28,
"num_key_value_heads": 8,
"pretraining_tp": 1,
"quantization_config": {
"bits": 8,
"checkpoint_format": "gptq",
"desc_act": true,
"group_size": 128,
"lm_head": false,
"meta": {
"damp_auto_increment": 0.0025,
"damp_percent": 0.01,
"mse": 0.0,
"quantizer": [
"gptqmodel:2.1.0"
],
"static_groups": false,
"true_sequential": true,
"uri": "https://github.com/modelcloud/gptqmodel"
},
"pack_dtype": "int32",
"quant_method": "gptq",
"sym": true
},
"rms_norm_eps": 1e-05,
"rope_scaling": {
"factor": 32.0,
"high_freq_factor": 4.0,
"low_freq_factor": 1.0,
"original_max_position_embeddings": 8192,
"rope_type": "llama3"
},
"rope_theta": 500000.0,
"tie_word_embeddings": true,
"torch_dtype": "bfloat16",
"transformers_version": "4.47.1",
"use_cache": true,
"vocab_size": 128256
}

12
generation_config.json Normal file
View File

@@ -0,0 +1,12 @@
{
"bos_token_id": 128000,
"do_sample": true,
"eos_token_id": [
128001,
128008,
128009
],
"temperature": 0.6,
"top_p": 0.9,
"transformers_version": "4.47.1"
}

3
model.safetensors Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:89206bfcfbdb886aa62df791311424a51257947cb6aaa6412a6caf2428f27f43
size 3676063968

197
quant_log.csv Normal file
View File

@@ -0,0 +1,197 @@
layer,module,loss,damp,time
0,self_attn.k_proj,0.00492,0.01000,0.988
0,self_attn.v_proj,0.00013,0.01000,0.606
0,self_attn.q_proj,0.00857,0.01000,0.632
0,self_attn.o_proj,0.00001,0.01000,0.676
0,mlp.up_proj,0.00566,0.01000,0.702
0,mlp.gate_proj,0.00634,0.01000,0.621
0,mlp.down_proj,0.00008,0.01000,1.867
1,self_attn.k_proj,0.00454,0.01000,0.713
1,self_attn.v_proj,0.00039,0.01000,0.591
1,self_attn.q_proj,0.00758,0.01000,0.585
1,self_attn.o_proj,0.00004,0.01000,0.665
1,mlp.up_proj,0.00745,0.01000,0.690
1,mlp.gate_proj,0.00848,0.01000,0.596
1,mlp.down_proj,0.00088,0.01000,1.865
2,self_attn.k_proj,0.02247,0.01000,0.714
2,self_attn.v_proj,0.00188,0.01000,0.591
2,self_attn.q_proj,0.03480,0.01000,0.594
2,self_attn.o_proj,0.00005,0.01000,0.666
2,mlp.up_proj,0.01189,0.01000,0.691
2,mlp.gate_proj,0.01393,0.01000,0.606
2,mlp.down_proj,0.00023,0.01000,1.854
3,self_attn.k_proj,0.01629,0.01000,0.718
3,self_attn.v_proj,0.00280,0.01000,0.603
3,self_attn.q_proj,0.02945,0.01000,0.594
3,self_attn.o_proj,0.00009,0.01000,0.669
3,mlp.up_proj,0.01539,0.01000,0.692
3,mlp.gate_proj,0.02018,0.01000,0.790
3,mlp.down_proj,0.00035,0.01000,1.852
4,self_attn.k_proj,0.01446,0.01000,0.714
4,self_attn.v_proj,0.00258,0.01000,0.601
4,self_attn.q_proj,0.02753,0.01000,0.599
4,self_attn.o_proj,0.00016,0.01000,0.795
4,mlp.up_proj,0.01831,0.01000,0.903
4,mlp.gate_proj,0.02698,0.01000,0.596
4,mlp.down_proj,0.00051,0.01000,2.028
5,self_attn.k_proj,0.02399,0.01000,0.714
5,self_attn.v_proj,0.00246,0.01000,0.592
5,self_attn.q_proj,0.03819,0.01000,0.592
5,self_attn.o_proj,0.00019,0.01000,0.658
5,mlp.up_proj,0.02186,0.01000,0.686
5,mlp.gate_proj,0.03038,0.01000,0.593
5,mlp.down_proj,0.00070,0.01000,2.023
6,self_attn.k_proj,0.01911,0.01000,0.711
6,self_attn.v_proj,0.00268,0.01000,0.591
6,self_attn.q_proj,0.03526,0.01000,0.589
6,self_attn.o_proj,0.00027,0.01000,0.752
6,mlp.up_proj,0.02338,0.01000,0.705
6,mlp.gate_proj,0.03217,0.01000,0.588
6,mlp.down_proj,0.00082,0.01000,1.958
7,self_attn.k_proj,0.01870,0.01000,0.703
7,self_attn.v_proj,0.00263,0.01000,0.595
7,self_attn.q_proj,0.03133,0.01000,0.591
7,self_attn.o_proj,0.00039,0.01000,0.658
7,mlp.up_proj,0.02524,0.01000,0.686
7,mlp.gate_proj,0.03204,0.01000,0.594
7,mlp.down_proj,0.00095,0.01000,1.821
8,self_attn.k_proj,0.02325,0.01000,0.711
8,self_attn.v_proj,0.00323,0.01000,0.595
8,self_attn.q_proj,0.03917,0.01000,0.592
8,self_attn.o_proj,0.00049,0.01000,0.658
8,mlp.up_proj,0.02657,0.01000,0.691
8,mlp.gate_proj,0.03434,0.01000,0.592
8,mlp.down_proj,0.00103,0.01000,1.832
9,self_attn.k_proj,0.02297,0.01000,0.711
9,self_attn.v_proj,0.00412,0.01000,0.593
9,self_attn.q_proj,0.03884,0.01000,0.596
9,self_attn.o_proj,0.00058,0.01000,0.665
9,mlp.up_proj,0.02708,0.01000,0.686
9,mlp.gate_proj,0.03375,0.01000,0.591
9,mlp.down_proj,0.00104,0.01000,1.832
10,self_attn.k_proj,0.02378,0.01000,0.711
10,self_attn.v_proj,0.00322,0.01000,0.593
10,self_attn.q_proj,0.03907,0.01000,0.588
10,self_attn.o_proj,0.00048,0.01000,0.660
10,mlp.up_proj,0.02872,0.01000,0.682
10,mlp.gate_proj,0.03349,0.01000,0.594
10,mlp.down_proj,0.00114,0.01000,1.838
11,self_attn.k_proj,0.01896,0.01000,0.819
11,self_attn.v_proj,0.00400,0.01000,0.594
11,self_attn.q_proj,0.03398,0.01000,0.588
11,self_attn.o_proj,0.00062,0.01000,0.651
11,mlp.up_proj,0.03075,0.01000,0.681
11,mlp.gate_proj,0.03419,0.01000,0.597
11,mlp.down_proj,0.00131,0.01000,1.842
12,self_attn.k_proj,0.02746,0.01000,0.721
12,self_attn.v_proj,0.00425,0.01000,0.592
12,self_attn.q_proj,0.04649,0.01000,0.597
12,self_attn.o_proj,0.00078,0.01000,0.660
12,mlp.up_proj,0.03298,0.01000,0.915
12,mlp.gate_proj,0.03604,0.01000,0.605
12,mlp.down_proj,0.00151,0.01000,1.828
13,self_attn.k_proj,0.02951,0.01000,0.712
13,self_attn.v_proj,0.00474,0.01000,0.772
13,self_attn.q_proj,0.04644,0.01000,0.604
13,self_attn.o_proj,0.00097,0.01000,0.658
13,mlp.up_proj,0.03625,0.01000,0.684
13,mlp.gate_proj,0.04115,0.01000,0.589
13,mlp.down_proj,0.00190,0.01000,1.824
14,self_attn.k_proj,0.02592,0.01000,0.882
14,self_attn.v_proj,0.00574,0.01000,0.610
14,self_attn.q_proj,0.05315,0.01000,0.591
14,self_attn.o_proj,0.00114,0.01000,0.668
14,mlp.up_proj,0.03979,0.01000,0.774
14,mlp.gate_proj,0.04587,0.01000,0.649
14,mlp.down_proj,0.00234,0.01000,1.837
15,self_attn.k_proj,0.03013,0.01000,0.714
15,self_attn.v_proj,0.00584,0.01000,0.597
15,self_attn.q_proj,0.05586,0.01000,0.584
15,self_attn.o_proj,0.00078,0.01000,0.665
15,mlp.up_proj,0.04097,0.01000,0.686
15,mlp.gate_proj,0.05158,0.01000,0.597
15,mlp.down_proj,0.00249,0.01000,1.823
16,self_attn.k_proj,0.03388,0.01000,0.713
16,self_attn.v_proj,0.00642,0.01000,0.593
16,self_attn.q_proj,0.05719,0.01000,0.583
16,self_attn.o_proj,0.00052,0.01000,0.658
16,mlp.up_proj,0.04162,0.01000,0.687
16,mlp.gate_proj,0.05441,0.01000,0.588
16,mlp.down_proj,0.00239,0.01000,1.835
17,self_attn.k_proj,0.03119,0.01000,0.717
17,self_attn.v_proj,0.00635,0.01000,0.597
17,self_attn.q_proj,0.05598,0.01000,0.596
17,self_attn.o_proj,0.00049,0.01000,0.667
17,mlp.up_proj,0.04332,0.01000,0.681
17,mlp.gate_proj,0.05766,0.01000,0.600
17,mlp.down_proj,0.00255,0.01000,1.820
18,self_attn.k_proj,0.03518,0.01000,0.716
18,self_attn.v_proj,0.00752,0.01000,0.711
18,self_attn.q_proj,0.06074,0.01000,0.696
18,self_attn.o_proj,0.00060,0.01000,0.664
18,mlp.up_proj,0.04720,0.01000,0.693
18,mlp.gate_proj,0.06175,0.01000,0.605
18,mlp.down_proj,0.00280,0.01000,2.011
19,self_attn.k_proj,0.03650,0.01000,0.725
19,self_attn.v_proj,0.00793,0.01000,0.599
19,self_attn.q_proj,0.05829,0.01000,0.591
19,self_attn.o_proj,0.00079,0.01000,0.659
19,mlp.up_proj,0.05115,0.01000,0.683
19,mlp.gate_proj,0.06564,0.01000,0.617
19,mlp.down_proj,0.00335,0.01000,2.026
20,self_attn.k_proj,0.03698,0.01000,0.707
20,self_attn.v_proj,0.00968,0.01000,0.591
20,self_attn.q_proj,0.06097,0.01000,0.608
20,self_attn.o_proj,0.00060,0.01000,0.662
20,mlp.up_proj,0.05315,0.01000,0.796
20,mlp.gate_proj,0.06568,0.01000,0.599
20,mlp.down_proj,0.00337,0.01000,1.832
21,self_attn.k_proj,0.03651,0.01000,0.800
21,self_attn.v_proj,0.01240,0.01000,0.681
21,self_attn.q_proj,0.06071,0.01000,0.594
21,self_attn.o_proj,0.00070,0.01000,0.659
21,mlp.up_proj,0.05718,0.01000,0.823
21,mlp.gate_proj,0.07113,0.01000,0.720
21,mlp.down_proj,0.00357,0.01000,2.004
22,self_attn.k_proj,0.03448,0.01000,0.710
22,self_attn.v_proj,0.01272,0.01000,0.679
22,self_attn.q_proj,0.06173,0.01000,0.687
22,self_attn.o_proj,0.00060,0.01000,0.659
22,mlp.up_proj,0.06175,0.01000,0.693
22,mlp.gate_proj,0.07729,0.01000,0.600
22,mlp.down_proj,0.00403,0.01000,1.825
23,self_attn.k_proj,0.03840,0.01000,0.703
23,self_attn.v_proj,0.01170,0.01000,0.585
23,self_attn.q_proj,0.06099,0.01000,0.758
23,self_attn.o_proj,0.00093,0.01000,0.663
23,mlp.up_proj,0.06775,0.01000,0.686
23,mlp.gate_proj,0.08832,0.01000,0.594
23,mlp.down_proj,0.00459,0.01000,1.829
24,self_attn.k_proj,0.04202,0.01000,0.789
24,self_attn.v_proj,0.01762,0.01000,0.602
24,self_attn.q_proj,0.06620,0.01000,0.592
24,self_attn.o_proj,0.00143,0.01000,0.743
24,mlp.up_proj,0.07604,0.01000,0.684
24,mlp.gate_proj,0.10110,0.01000,0.592
24,mlp.down_proj,0.00545,0.01000,1.949
25,self_attn.k_proj,0.03398,0.01000,0.714
25,self_attn.v_proj,0.01647,0.01000,0.591
25,self_attn.q_proj,0.06667,0.01000,0.590
25,self_attn.o_proj,0.00123,0.01000,0.661
25,mlp.up_proj,0.08287,0.01000,0.698
25,mlp.gate_proj,0.10945,0.01000,0.724
25,mlp.down_proj,0.00673,0.01000,1.947
26,self_attn.k_proj,0.03643,0.01000,0.713
26,self_attn.v_proj,0.02137,0.01000,0.718
26,self_attn.q_proj,0.06157,0.01000,0.709
26,self_attn.o_proj,0.00276,0.01000,0.666
26,mlp.up_proj,0.08697,0.01000,0.689
26,mlp.gate_proj,0.11678,0.01000,0.597
26,mlp.down_proj,0.00841,0.01000,1.851
27,self_attn.k_proj,0.02694,0.01000,0.729
27,self_attn.v_proj,0.01448,0.01000,0.666
27,self_attn.q_proj,0.05014,0.01000,0.680
27,self_attn.o_proj,0.00507,0.01000,0.813
27,mlp.up_proj,0.09256,0.01000,0.749
27,mlp.gate_proj,0.11142,0.01000,0.640
27,mlp.down_proj,0.01908,0.01000,1.888
1 layer module loss damp time
2 0 self_attn.k_proj 0.00492 0.01000 0.988
3 0 self_attn.v_proj 0.00013 0.01000 0.606
4 0 self_attn.q_proj 0.00857 0.01000 0.632
5 0 self_attn.o_proj 0.00001 0.01000 0.676
6 0 mlp.up_proj 0.00566 0.01000 0.702
7 0 mlp.gate_proj 0.00634 0.01000 0.621
8 0 mlp.down_proj 0.00008 0.01000 1.867
9 1 self_attn.k_proj 0.00454 0.01000 0.713
10 1 self_attn.v_proj 0.00039 0.01000 0.591
11 1 self_attn.q_proj 0.00758 0.01000 0.585
12 1 self_attn.o_proj 0.00004 0.01000 0.665
13 1 mlp.up_proj 0.00745 0.01000 0.690
14 1 mlp.gate_proj 0.00848 0.01000 0.596
15 1 mlp.down_proj 0.00088 0.01000 1.865
16 2 self_attn.k_proj 0.02247 0.01000 0.714
17 2 self_attn.v_proj 0.00188 0.01000 0.591
18 2 self_attn.q_proj 0.03480 0.01000 0.594
19 2 self_attn.o_proj 0.00005 0.01000 0.666
20 2 mlp.up_proj 0.01189 0.01000 0.691
21 2 mlp.gate_proj 0.01393 0.01000 0.606
22 2 mlp.down_proj 0.00023 0.01000 1.854
23 3 self_attn.k_proj 0.01629 0.01000 0.718
24 3 self_attn.v_proj 0.00280 0.01000 0.603
25 3 self_attn.q_proj 0.02945 0.01000 0.594
26 3 self_attn.o_proj 0.00009 0.01000 0.669
27 3 mlp.up_proj 0.01539 0.01000 0.692
28 3 mlp.gate_proj 0.02018 0.01000 0.790
29 3 mlp.down_proj 0.00035 0.01000 1.852
30 4 self_attn.k_proj 0.01446 0.01000 0.714
31 4 self_attn.v_proj 0.00258 0.01000 0.601
32 4 self_attn.q_proj 0.02753 0.01000 0.599
33 4 self_attn.o_proj 0.00016 0.01000 0.795
34 4 mlp.up_proj 0.01831 0.01000 0.903
35 4 mlp.gate_proj 0.02698 0.01000 0.596
36 4 mlp.down_proj 0.00051 0.01000 2.028
37 5 self_attn.k_proj 0.02399 0.01000 0.714
38 5 self_attn.v_proj 0.00246 0.01000 0.592
39 5 self_attn.q_proj 0.03819 0.01000 0.592
40 5 self_attn.o_proj 0.00019 0.01000 0.658
41 5 mlp.up_proj 0.02186 0.01000 0.686
42 5 mlp.gate_proj 0.03038 0.01000 0.593
43 5 mlp.down_proj 0.00070 0.01000 2.023
44 6 self_attn.k_proj 0.01911 0.01000 0.711
45 6 self_attn.v_proj 0.00268 0.01000 0.591
46 6 self_attn.q_proj 0.03526 0.01000 0.589
47 6 self_attn.o_proj 0.00027 0.01000 0.752
48 6 mlp.up_proj 0.02338 0.01000 0.705
49 6 mlp.gate_proj 0.03217 0.01000 0.588
50 6 mlp.down_proj 0.00082 0.01000 1.958
51 7 self_attn.k_proj 0.01870 0.01000 0.703
52 7 self_attn.v_proj 0.00263 0.01000 0.595
53 7 self_attn.q_proj 0.03133 0.01000 0.591
54 7 self_attn.o_proj 0.00039 0.01000 0.658
55 7 mlp.up_proj 0.02524 0.01000 0.686
56 7 mlp.gate_proj 0.03204 0.01000 0.594
57 7 mlp.down_proj 0.00095 0.01000 1.821
58 8 self_attn.k_proj 0.02325 0.01000 0.711
59 8 self_attn.v_proj 0.00323 0.01000 0.595
60 8 self_attn.q_proj 0.03917 0.01000 0.592
61 8 self_attn.o_proj 0.00049 0.01000 0.658
62 8 mlp.up_proj 0.02657 0.01000 0.691
63 8 mlp.gate_proj 0.03434 0.01000 0.592
64 8 mlp.down_proj 0.00103 0.01000 1.832
65 9 self_attn.k_proj 0.02297 0.01000 0.711
66 9 self_attn.v_proj 0.00412 0.01000 0.593
67 9 self_attn.q_proj 0.03884 0.01000 0.596
68 9 self_attn.o_proj 0.00058 0.01000 0.665
69 9 mlp.up_proj 0.02708 0.01000 0.686
70 9 mlp.gate_proj 0.03375 0.01000 0.591
71 9 mlp.down_proj 0.00104 0.01000 1.832
72 10 self_attn.k_proj 0.02378 0.01000 0.711
73 10 self_attn.v_proj 0.00322 0.01000 0.593
74 10 self_attn.q_proj 0.03907 0.01000 0.588
75 10 self_attn.o_proj 0.00048 0.01000 0.660
76 10 mlp.up_proj 0.02872 0.01000 0.682
77 10 mlp.gate_proj 0.03349 0.01000 0.594
78 10 mlp.down_proj 0.00114 0.01000 1.838
79 11 self_attn.k_proj 0.01896 0.01000 0.819
80 11 self_attn.v_proj 0.00400 0.01000 0.594
81 11 self_attn.q_proj 0.03398 0.01000 0.588
82 11 self_attn.o_proj 0.00062 0.01000 0.651
83 11 mlp.up_proj 0.03075 0.01000 0.681
84 11 mlp.gate_proj 0.03419 0.01000 0.597
85 11 mlp.down_proj 0.00131 0.01000 1.842
86 12 self_attn.k_proj 0.02746 0.01000 0.721
87 12 self_attn.v_proj 0.00425 0.01000 0.592
88 12 self_attn.q_proj 0.04649 0.01000 0.597
89 12 self_attn.o_proj 0.00078 0.01000 0.660
90 12 mlp.up_proj 0.03298 0.01000 0.915
91 12 mlp.gate_proj 0.03604 0.01000 0.605
92 12 mlp.down_proj 0.00151 0.01000 1.828
93 13 self_attn.k_proj 0.02951 0.01000 0.712
94 13 self_attn.v_proj 0.00474 0.01000 0.772
95 13 self_attn.q_proj 0.04644 0.01000 0.604
96 13 self_attn.o_proj 0.00097 0.01000 0.658
97 13 mlp.up_proj 0.03625 0.01000 0.684
98 13 mlp.gate_proj 0.04115 0.01000 0.589
99 13 mlp.down_proj 0.00190 0.01000 1.824
100 14 self_attn.k_proj 0.02592 0.01000 0.882
101 14 self_attn.v_proj 0.00574 0.01000 0.610
102 14 self_attn.q_proj 0.05315 0.01000 0.591
103 14 self_attn.o_proj 0.00114 0.01000 0.668
104 14 mlp.up_proj 0.03979 0.01000 0.774
105 14 mlp.gate_proj 0.04587 0.01000 0.649
106 14 mlp.down_proj 0.00234 0.01000 1.837
107 15 self_attn.k_proj 0.03013 0.01000 0.714
108 15 self_attn.v_proj 0.00584 0.01000 0.597
109 15 self_attn.q_proj 0.05586 0.01000 0.584
110 15 self_attn.o_proj 0.00078 0.01000 0.665
111 15 mlp.up_proj 0.04097 0.01000 0.686
112 15 mlp.gate_proj 0.05158 0.01000 0.597
113 15 mlp.down_proj 0.00249 0.01000 1.823
114 16 self_attn.k_proj 0.03388 0.01000 0.713
115 16 self_attn.v_proj 0.00642 0.01000 0.593
116 16 self_attn.q_proj 0.05719 0.01000 0.583
117 16 self_attn.o_proj 0.00052 0.01000 0.658
118 16 mlp.up_proj 0.04162 0.01000 0.687
119 16 mlp.gate_proj 0.05441 0.01000 0.588
120 16 mlp.down_proj 0.00239 0.01000 1.835
121 17 self_attn.k_proj 0.03119 0.01000 0.717
122 17 self_attn.v_proj 0.00635 0.01000 0.597
123 17 self_attn.q_proj 0.05598 0.01000 0.596
124 17 self_attn.o_proj 0.00049 0.01000 0.667
125 17 mlp.up_proj 0.04332 0.01000 0.681
126 17 mlp.gate_proj 0.05766 0.01000 0.600
127 17 mlp.down_proj 0.00255 0.01000 1.820
128 18 self_attn.k_proj 0.03518 0.01000 0.716
129 18 self_attn.v_proj 0.00752 0.01000 0.711
130 18 self_attn.q_proj 0.06074 0.01000 0.696
131 18 self_attn.o_proj 0.00060 0.01000 0.664
132 18 mlp.up_proj 0.04720 0.01000 0.693
133 18 mlp.gate_proj 0.06175 0.01000 0.605
134 18 mlp.down_proj 0.00280 0.01000 2.011
135 19 self_attn.k_proj 0.03650 0.01000 0.725
136 19 self_attn.v_proj 0.00793 0.01000 0.599
137 19 self_attn.q_proj 0.05829 0.01000 0.591
138 19 self_attn.o_proj 0.00079 0.01000 0.659
139 19 mlp.up_proj 0.05115 0.01000 0.683
140 19 mlp.gate_proj 0.06564 0.01000 0.617
141 19 mlp.down_proj 0.00335 0.01000 2.026
142 20 self_attn.k_proj 0.03698 0.01000 0.707
143 20 self_attn.v_proj 0.00968 0.01000 0.591
144 20 self_attn.q_proj 0.06097 0.01000 0.608
145 20 self_attn.o_proj 0.00060 0.01000 0.662
146 20 mlp.up_proj 0.05315 0.01000 0.796
147 20 mlp.gate_proj 0.06568 0.01000 0.599
148 20 mlp.down_proj 0.00337 0.01000 1.832
149 21 self_attn.k_proj 0.03651 0.01000 0.800
150 21 self_attn.v_proj 0.01240 0.01000 0.681
151 21 self_attn.q_proj 0.06071 0.01000 0.594
152 21 self_attn.o_proj 0.00070 0.01000 0.659
153 21 mlp.up_proj 0.05718 0.01000 0.823
154 21 mlp.gate_proj 0.07113 0.01000 0.720
155 21 mlp.down_proj 0.00357 0.01000 2.004
156 22 self_attn.k_proj 0.03448 0.01000 0.710
157 22 self_attn.v_proj 0.01272 0.01000 0.679
158 22 self_attn.q_proj 0.06173 0.01000 0.687
159 22 self_attn.o_proj 0.00060 0.01000 0.659
160 22 mlp.up_proj 0.06175 0.01000 0.693
161 22 mlp.gate_proj 0.07729 0.01000 0.600
162 22 mlp.down_proj 0.00403 0.01000 1.825
163 23 self_attn.k_proj 0.03840 0.01000 0.703
164 23 self_attn.v_proj 0.01170 0.01000 0.585
165 23 self_attn.q_proj 0.06099 0.01000 0.758
166 23 self_attn.o_proj 0.00093 0.01000 0.663
167 23 mlp.up_proj 0.06775 0.01000 0.686
168 23 mlp.gate_proj 0.08832 0.01000 0.594
169 23 mlp.down_proj 0.00459 0.01000 1.829
170 24 self_attn.k_proj 0.04202 0.01000 0.789
171 24 self_attn.v_proj 0.01762 0.01000 0.602
172 24 self_attn.q_proj 0.06620 0.01000 0.592
173 24 self_attn.o_proj 0.00143 0.01000 0.743
174 24 mlp.up_proj 0.07604 0.01000 0.684
175 24 mlp.gate_proj 0.10110 0.01000 0.592
176 24 mlp.down_proj 0.00545 0.01000 1.949
177 25 self_attn.k_proj 0.03398 0.01000 0.714
178 25 self_attn.v_proj 0.01647 0.01000 0.591
179 25 self_attn.q_proj 0.06667 0.01000 0.590
180 25 self_attn.o_proj 0.00123 0.01000 0.661
181 25 mlp.up_proj 0.08287 0.01000 0.698
182 25 mlp.gate_proj 0.10945 0.01000 0.724
183 25 mlp.down_proj 0.00673 0.01000 1.947
184 26 self_attn.k_proj 0.03643 0.01000 0.713
185 26 self_attn.v_proj 0.02137 0.01000 0.718
186 26 self_attn.q_proj 0.06157 0.01000 0.709
187 26 self_attn.o_proj 0.00276 0.01000 0.666
188 26 mlp.up_proj 0.08697 0.01000 0.689
189 26 mlp.gate_proj 0.11678 0.01000 0.597
190 26 mlp.down_proj 0.00841 0.01000 1.851
191 27 self_attn.k_proj 0.02694 0.01000 0.729
192 27 self_attn.v_proj 0.01448 0.01000 0.666
193 27 self_attn.q_proj 0.05014 0.01000 0.680
194 27 self_attn.o_proj 0.00507 0.01000 0.813
195 27 mlp.up_proj 0.09256 0.01000 0.749
196 27 mlp.gate_proj 0.11142 0.01000 0.640
197 27 mlp.down_proj 0.01908 0.01000 1.888

21
quantize_config.json Normal file
View File

@@ -0,0 +1,21 @@
{
"bits": 8,
"group_size": 128,
"desc_act": true,
"sym": true,
"lm_head": false,
"quant_method": "gptq",
"checkpoint_format": "gptq",
"pack_dtype": "int32",
"meta": {
"quantizer": [
"gptqmodel:2.1.0"
],
"uri": "https://github.com/modelcloud/gptqmodel",
"damp_percent": 0.01,
"damp_auto_increment": 0.0025,
"static_groups": false,
"true_sequential": true,
"mse": 0.0
}
}

View File

@@ -0,0 +1,83 @@
accelerate==1.5.2
aiohappyeyeballs==2.6.1
aiohttp==3.11.14
aiosignal==1.3.2
annotated-types==0.7.0
async-timeout==5.0.1
attrs==25.3.0
autoawq==0.2.8
certifi==2025.1.31
charset-normalizer==3.4.1
click==8.1.8
compressed-tensors==0.9.2
datasets==3.4.1
device-smi==0.4.1
dill==0.3.8
einops==0.8.1
filelock==3.18.0
flash_attn==2.7.4.post1
frozenlist==1.5.0
fsspec==2024.12.0
gekko==1.2.1
gptqmodel==2.1.0
hf_transfer==0.1.9
huggingface-hub==0.29.3
idna==3.10
Jinja2==3.1.6
llmcompressor==0.4.1
logbar==0.0.3
loguru==0.7.3
MarkupSafe==3.0.2
mpmath==1.3.0
multidict==6.2.0
multiprocess==0.70.16
networkx==3.4.2
numpy==1.26.4
nvidia-cublas-cu12==12.4.5.8
nvidia-cuda-cupti-cu12==12.4.127
nvidia-cuda-nvrtc-cu12==12.4.127
nvidia-cuda-runtime-cu12==12.4.127
nvidia-cudnn-cu12==9.1.0.70
nvidia-cufft-cu12==11.2.1.3
nvidia-curand-cu12==10.3.5.147
nvidia-cusolver-cu12==11.6.1.9
nvidia-cusparse-cu12==12.3.1.170
nvidia-cusparselt-cu12==0.6.2
nvidia-ml-py==12.570.86
nvidia-nccl-cu12==2.21.5
nvidia-nvjitlink-cu12==12.4.127
nvidia-nvtx-cu12==12.4.127
packaging==24.2
pandas==2.2.3
peft==0.14.0
pillow==11.1.0
propcache==0.3.0
protobuf==6.30.1
psutil==7.0.0
pyarrow==19.0.1
pydantic==2.10.6
pydantic_core==2.27.2
pynvml==12.0.0
python-dateutil==2.9.0.post0
pytz==2025.1
PyYAML==6.0.2
regex==2024.11.6
requests==2.32.3
rouge==1.0.1
safetensors==0.5.3
sentencepiece==0.2.0
six==1.17.0
sympy==1.13.1
threadpoolctl==3.6.0
tokenicer==0.0.4
tokenizers==0.21.1
torch==2.6.0
tqdm==4.67.1
transformers==4.47.1
triton==3.2.0
typing_extensions==4.12.2
tzdata==2025.1
urllib3==2.3.0
xxhash==3.5.0
yarl==1.18.3
zstandard==0.23.0

17
special_tokens_map.json Normal file
View File

@@ -0,0 +1,17 @@
{
"bos_token": {
"content": "<|begin_of_text|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"eos_token": {
"content": "<|eot_id|>",
"lstrip": false,
"normalized": false,
"rstrip": false,
"single_word": false
},
"pad_token": "<|finetune_right_pad_id|>"
}

3
tokenizer.json Normal file
View File

@@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:384a7e7c676f7be2e5d2e8449c508be9b00e5b18c5b3c39ebc626e96b3f4b988
size 17210019

2064
tokenizer_config.json Normal file

File diff suppressed because it is too large Load Diff