commit 59259e6a4bd14c97b53f98b3c029cbd3b0811bdc Author: ModelHub XC Date: Thu Apr 30 08:36:58 2026 +0800 初始化项目,由ModelHub XC社区提供模型 Model: mradermacher/Llama-3.2-3B-i1-GGUF Source: Original Platform diff --git a/.gitattributes b/.gitattributes new file mode 100644 index 0000000..85434a6 --- /dev/null +++ b/.gitattributes @@ -0,0 +1,57 @@ +*.7z filter=lfs diff=lfs merge=lfs -text +*.arrow filter=lfs diff=lfs merge=lfs -text +*.bin filter=lfs diff=lfs merge=lfs -text +*.bz2 filter=lfs diff=lfs merge=lfs -text +*.ckpt filter=lfs diff=lfs merge=lfs -text +*.ftz filter=lfs diff=lfs merge=lfs -text +*.gz filter=lfs diff=lfs merge=lfs -text +*.h5 filter=lfs diff=lfs merge=lfs -text +*.joblib filter=lfs diff=lfs merge=lfs -text +*.lfs.* filter=lfs diff=lfs merge=lfs -text +*.mlmodel filter=lfs diff=lfs merge=lfs -text +*.model filter=lfs diff=lfs merge=lfs -text +*.msgpack filter=lfs diff=lfs merge=lfs -text +*.npy filter=lfs diff=lfs merge=lfs -text +*.npz filter=lfs diff=lfs merge=lfs -text +*.onnx filter=lfs diff=lfs merge=lfs -text +*.ot filter=lfs diff=lfs merge=lfs -text +*.parquet filter=lfs diff=lfs merge=lfs -text +*.pb filter=lfs diff=lfs merge=lfs -text +*.pickle filter=lfs diff=lfs merge=lfs -text +*.pkl filter=lfs diff=lfs merge=lfs -text +*.pt filter=lfs diff=lfs merge=lfs -text +*.pth filter=lfs diff=lfs merge=lfs -text +*.rar filter=lfs diff=lfs merge=lfs -text +*.safetensors filter=lfs diff=lfs merge=lfs -text +saved_model/**/* filter=lfs diff=lfs merge=lfs -text +*.tar.* filter=lfs diff=lfs merge=lfs -text +*.tar filter=lfs diff=lfs merge=lfs -text +*.tflite filter=lfs diff=lfs merge=lfs -text +*.tgz filter=lfs diff=lfs merge=lfs -text +*.wasm filter=lfs diff=lfs merge=lfs -text +*.xz filter=lfs diff=lfs merge=lfs -text +*.zip filter=lfs diff=lfs merge=lfs -text +*.zst filter=lfs diff=lfs merge=lfs -text +*tfevents* filter=lfs diff=lfs merge=lfs -text +imatrix.dat filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text +Llama-3.2-3B.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text diff --git a/Llama-3.2-3B.i1-IQ1_M.gguf b/Llama-3.2-3B.i1-IQ1_M.gguf new file mode 100644 index 0000000..2c62ec2 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ1_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:4267573576eb9cf68202cb294ec80e94581d920aa7338115a8bcbf37f364b48e +size 924187744 diff --git a/Llama-3.2-3B.i1-IQ1_S.gguf b/Llama-3.2-3B.i1-IQ1_S.gguf new file mode 100644 index 0000000..0bdaf34 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ1_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:eae3fd81b408c3c2c0e140ef5c436898eefc01078f9cfa204537be9ce8d0f31a +size 868154464 diff --git a/Llama-3.2-3B.i1-IQ2_M.gguf b/Llama-3.2-3B.i1-IQ2_M.gguf new file mode 100644 index 0000000..24d28e5 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ2_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:25aa167460ec665e823ff04c7e6c8b8004d241616b8fe3ee3746d34eff84fbec +size 1229028448 diff --git a/Llama-3.2-3B.i1-IQ2_S.gguf b/Llama-3.2-3B.i1-IQ2_S.gguf new file mode 100644 index 0000000..19b8013 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ2_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9dab439b5494b4d7bef8ae2e7005b70552ca6223d41074a0bb17bb3ce17f7be4 +size 1154317408 diff --git a/Llama-3.2-3B.i1-IQ2_XS.gguf b/Llama-3.2-3B.i1-IQ2_XS.gguf new file mode 100644 index 0000000..b9e05f0 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ2_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f99f60584446c8719ea53a26638d1aba3fa059890481b0b507ba7c7137a4030b +size 1100545120 diff --git a/Llama-3.2-3B.i1-IQ2_XXS.gguf b/Llama-3.2-3B.i1-IQ2_XXS.gguf new file mode 100644 index 0000000..35cfb04 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ2_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:1d97dc9e30473eb9e967c9fb2094ce40b6d59eddc8ca0b4dfe8abd767a98fa3b +size 1017576544 diff --git a/Llama-3.2-3B.i1-IQ3_M.gguf b/Llama-3.2-3B.i1-IQ3_M.gguf new file mode 100644 index 0000000..5780ab0 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ3_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b776c26465d916db37a97d8d6ae34849eae246a77abdad4cbfd61dcfbaead696 +size 1599665248 diff --git a/Llama-3.2-3B.i1-IQ3_S.gguf b/Llama-3.2-3B.i1-IQ3_S.gguf new file mode 100644 index 0000000..8d5a867 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ3_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b2fc8919606e0792fa396c64a82624bb173e3b0707088218e79db18f989e2f2a +size 1542845536 diff --git a/Llama-3.2-3B.i1-IQ3_XS.gguf b/Llama-3.2-3B.i1-IQ3_XS.gguf new file mode 100644 index 0000000..1e11bc4 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ3_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9fddd8398a828cfdf2f83cb845a07d36319f0c9ad2d17ea5fc361d6cedaa95cd +size 1476785248 diff --git a/Llama-3.2-3B.i1-IQ3_XXS.gguf b/Llama-3.2-3B.i1-IQ3_XXS.gguf new file mode 100644 index 0000000..75b6f0f --- /dev/null +++ b/Llama-3.2-3B.i1-IQ3_XXS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5222bde022b3f563debdfb7933612c20f92c48428c074e883c31b941702837c3 +size 1348762720 diff --git a/Llama-3.2-3B.i1-IQ4_XS.gguf b/Llama-3.2-3B.i1-IQ4_XS.gguf new file mode 100644 index 0000000..4fa7067 --- /dev/null +++ b/Llama-3.2-3B.i1-IQ4_XS.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:31ab15ea6313bffd25f7d617d6bd82a0660e7eea78376131a46724a1bae68f78 +size 1829106784 diff --git a/Llama-3.2-3B.i1-Q2_K.gguf b/Llama-3.2-3B.i1-Q2_K.gguf new file mode 100644 index 0000000..db246a1 --- /dev/null +++ b/Llama-3.2-3B.i1-Q2_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:e44192e5a1c1c7925527222d5e82f470446a63554532322704eccae9d9a23f0c +size 1363932256 diff --git a/Llama-3.2-3B.i1-Q3_K_L.gguf b/Llama-3.2-3B.i1-Q3_K_L.gguf new file mode 100644 index 0000000..6d53c4f --- /dev/null +++ b/Llama-3.2-3B.i1-Q3_K_L.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:f5cdc10efae0be1391487db13e3cbe515e34fffee6123aa1b427826e6aaa4dc2 +size 1815344224 diff --git a/Llama-3.2-3B.i1-Q3_K_M.gguf b/Llama-3.2-3B.i1-Q3_K_M.gguf new file mode 100644 index 0000000..47b3e2a --- /dev/null +++ b/Llama-3.2-3B.i1-Q3_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:bfcb4c094a583473f98462fbbecedbb5d93ea3a68149ec458e1a50fa165c2712 +size 1687155808 diff --git a/Llama-3.2-3B.i1-Q3_K_S.gguf b/Llama-3.2-3B.i1-Q3_K_S.gguf new file mode 100644 index 0000000..aa78ebf --- /dev/null +++ b/Llama-3.2-3B.i1-Q3_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:634649ee145ce2d50624fe19514a20fc06cfd4251c10fba7484152040a01848d +size 1542845536 diff --git a/Llama-3.2-3B.i1-Q4_0.gguf b/Llama-3.2-3B.i1-Q4_0.gguf new file mode 100644 index 0000000..6f7b4f7 --- /dev/null +++ b/Llama-3.2-3B.i1-Q4_0.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:38195937245b1e406976cab47daff8b38a0892ce7ef042a4611bd6e935a2ae5b +size 1921905760 diff --git a/Llama-3.2-3B.i1-Q4_K_M.gguf b/Llama-3.2-3B.i1-Q4_K_M.gguf new file mode 100644 index 0000000..7d8ad02 --- /dev/null +++ b/Llama-3.2-3B.i1-Q4_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:b146275f11611442b9154fb552ccc12aeb9ca068f5c81e68962b710c59d21fb1 +size 2019374176 diff --git a/Llama-3.2-3B.i1-Q4_K_S.gguf b/Llama-3.2-3B.i1-Q4_K_S.gguf new file mode 100644 index 0000000..22fd51b --- /dev/null +++ b/Llama-3.2-3B.i1-Q4_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:ff2caaaefe45e99212113fcf0b7e70a0dec51bf71f185f30867a69b3ddfe845f +size 1928197216 diff --git a/Llama-3.2-3B.i1-Q5_K_M.gguf b/Llama-3.2-3B.i1-Q5_K_M.gguf new file mode 100644 index 0000000..b3b68eb --- /dev/null +++ b/Llama-3.2-3B.i1-Q5_K_M.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:d2feab5a15d8570de6ddabdc1f5a4abe820a7f7c8ef8c94baa4cd358bcc38dd4 +size 2322150496 diff --git a/Llama-3.2-3B.i1-Q5_K_S.gguf b/Llama-3.2-3B.i1-Q5_K_S.gguf new file mode 100644 index 0000000..58b3456 --- /dev/null +++ b/Llama-3.2-3B.i1-Q5_K_S.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:5ed3247a95df5fa8e59e6ec60783dbdbd29104696e0a302c5a8f126bc008c1e5 +size 2269508704 diff --git a/Llama-3.2-3B.i1-Q6_K.gguf b/Llama-3.2-3B.i1-Q6_K.gguf new file mode 100644 index 0000000..9c058a4 --- /dev/null +++ b/Llama-3.2-3B.i1-Q6_K.gguf @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:36d31e487f3a4d6b5a28a9815cda55ea675abbd61160ed91f37268d554adc2eb +size 2643850336 diff --git a/README.md b/README.md new file mode 100644 index 0000000..385ff76 --- /dev/null +++ b/README.md @@ -0,0 +1,267 @@ +--- +base_model: alpindale/Llama-3.2-3B +extra_gated_button_content: Submit +extra_gated_fields: + Affiliation: text + ? By clicking Submit below I accept the terms of the license and acknowledge that + the information I provide will be collected stored processed and shared in accordance + with the Meta Privacy Policy + : checkbox + Country: country + Date of birth: date_picker + First Name: text + Job title: + options: + - Student + - Research Graduate + - AI researcher + - AI developer/engineer + - Reporter + - Other + type: select + Last Name: text + geo: ip_location +extra_gated_prompt: "### LLAMA 3.2 COMMUNITY LICENSE AGREEMENT\n\nLlama 3.2 Version + Release Date: September 25, 2024\n\n“Agreement” means the terms and conditions for + use, reproduction, distribution and modification of the Llama Materials set forth + herein.\n\n“Documentation” means the specifications, manuals and documentation accompanying + Llama 3.2 distributed by Meta at https://llama.meta.com/doc/overview.\n\n“Licensee” + or “you” means you, or your employer or any other person or entity (if you are entering + into this Agreement on such person or entity’s behalf), of the age required under + applicable laws, rules or regulations to provide legal consent and that has legal + authority to bind your employer or such other person or entity if you are entering + in this Agreement on their behalf.\n\n“Llama 3.2” means the foundational large language + models and software and algorithms, including machine-learning model code, trained + model weights, inference-enabling code, training-enabling code, fine-tuning enabling + code and other elements of the foregoing distributed by Meta at https://www.llama.com/llama-downloads.\n\n“Llama + Materials” means, collectively, Meta’s proprietary Llama 3.2 and Documentation (and + \ any portion thereof) made available under this Agreement.\n\n“Meta” or “we” means + Meta Platforms Ireland Limited (if you are located in or, if you are an entity, + your principal place of business is in the EEA or Switzerland) and Meta Platforms, + Inc. (if you are located outside of the EEA or Switzerland). \n\nBy clicking “I + Accept” below or by using or distributing any portion or element of the Llama Materials, + you agree to be bound by this Agreement.\n\n1. License Rights and Redistribution.\na. + Grant of Rights. You are granted a non-exclusive, worldwide, non-transferable and + royalty-free limited license under Meta’s intellectual property or other rights + \ owned by Meta embodied in the Llama Materials to use, reproduce, distribute, copy, + create derivative works of, and make modifications to the Llama Materials. \nb. + Redistribution and Use. \ni. If you distribute or make available the Llama Materials + (or any derivative works thereof), or a product or service (including another AI + model) that contains any of them, you shall (A) provide a copy of this Agreement + with any such Llama Materials; and (B) prominently display “Built with Llama” on + a related website, user interface, blogpost, about page, or product documentation. + If you use the Llama Materials or any outputs or results of the Llama Materials + to create, train, fine tune, or otherwise improve an AI model, which is distributed + or made available, you shall also include “Llama” at the beginning of any such AI + model name.\nii. If you receive Llama Materials, or any derivative works thereof, + from a Licensee as part of an integrated end user product, then Section 2 of this + Agreement will not apply to you. \niii. You must retain in all copies of the Llama + Materials that you distribute the following attribution notice within a “Notice” + text file distributed as a part of such copies: “Llama 3.2 is licensed under the + Llama 3.2 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.”\niv. + Your use of the Llama Materials must comply with applicable laws and regulations + (including trade compliance laws and regulations) and adhere to the Acceptable Use + Policy for the Llama Materials (available at https://www.llama.com/llama3_2/use-policy), + which is hereby incorporated by reference into this Agreement.\n \n2. Additional + Commercial Terms. If, on the Llama 3.2 version release date, the monthly active + users of the products or services made available by or for Licensee, or Licensee’s + affiliates, is greater than 700 million monthly active users in the preceding calendar + month, you must request a license from Meta, which Meta may grant to you in its + sole discretion, and you are not authorized to exercise any of the rights under + this Agreement unless or until Meta otherwise expressly grants you such rights.\n3. + Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE LLAMA MATERIALS AND + ANY OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES + OF ANY KIND, AND META DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED, + INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY, + OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR DETERMINING + THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE LLAMA MATERIALS AND ASSUME ANY + RISKS ASSOCIATED WITH YOUR USE OF THE LLAMA MATERIALS AND ANY OUTPUT AND RESULTS.\n4. + Limitation of Liability. IN NO EVENT WILL META OR ITS AFFILIATES BE LIABLE UNDER + ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, + OR OTHERWISE, ARISING OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, + SPECIAL, CONSEQUENTIAL, INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF META + OR ITS AFFILIATES HAVE BEEN ADVISED OF THE POSSIBILITY OF ANY OF THE FOREGOING.\n5. + Intellectual Property.\na. No trademark licenses are granted under this Agreement, + and in connection with the Llama Materials, neither Meta nor Licensee may use any + name or mark owned by or associated with the other or any of its affiliates, except + as required for reasonable and customary use in describing and redistributing the + Llama Materials or as set forth in this Section 5(a). Meta hereby grants you a + license to use “Llama” (the “Mark”) solely as required to comply with the last + sentence of Section 1.b.i. You will comply with Meta’s brand guidelines (currently + accessible at https://about.meta.com/brand/resources/meta/company-brand/). All + goodwill arising out of your use of the Mark will inure to the benefit of Meta.\nb. + Subject to Meta’s ownership of Llama Materials and derivatives made by or for Meta, + with respect to any derivative works and modifications of the Llama Materials that + are made by you, as between you and Meta, you are and will be the owner of such + derivative works and modifications.\nc. If you institute litigation or other proceedings + against Meta or any entity (including a cross-claim or counterclaim in a lawsuit) + alleging that the Llama Materials or Llama 3.2 outputs or results, or any portion + of any of the foregoing, constitutes infringement of intellectual property or other + rights owned or licensable by you, then any licenses granted to you under this Agreement + shall terminate as of the date such litigation or claim is filed or instituted. + You will indemnify and hold harmless Meta from and against any claim by any third + party arising out of or related to your use or distribution of the Llama Materials.\n6. + Term and Termination. The term of this Agreement will commence upon your acceptance + of this Agreement or access to the Llama Materials and will continue in full force + and effect until terminated in accordance with the terms and conditions herein. + Meta may terminate this Agreement if you are in breach of any term or condition + of this Agreement. Upon termination of this Agreement, you shall delete and cease + use of the Llama Materials. Sections 3, 4 and 7 shall survive the termination of + this Agreement. \n7. Governing Law and Jurisdiction. This Agreement will be governed + and construed under the laws of the State of California without regard to choice + of law principles, and the UN Convention on Contracts for the International Sale + of Goods does not apply to this Agreement. The courts of California shall have exclusive + jurisdiction of any dispute arising out of this Agreement. \n### Llama 3.2 Acceptable + Use Policy\nMeta is committed to promoting safe and fair use of its tools and features, + including Llama 3.2. If you access or use Llama 3.2, you agree to this Acceptable + Use Policy (“**Policy**”). The most recent copy of this policy can be found at + [https://www.llama.com/llama3_2/use-policy](https://www.llama.com/llama3_2/use-policy).\n#### + Prohibited Uses\nWe want everyone to use Llama 3.2 safely and responsibly. You agree + you will not use, or allow others to use, Llama 3.2 to:\n1. Violate the law or others’ + rights, including to:\n 1. Engage in, promote, generate, contribute to, encourage, + plan, incite, or further illegal or unlawful activity or content, such as:\n 1. + Violence or terrorism\n 2. Exploitation or harm to children, including the + solicitation, creation, acquisition, or dissemination of child exploitative content + or failure to report Child Sexual Abuse Material\n 3. Human trafficking, + exploitation, and sexual violence\n 4. The illegal distribution of information + or materials to minors, including obscene materials, or failure to employ legally + required age-gating in connection with such information or materials.\n 5. + Sexual solicitation\n 6. Any other criminal activity\n 1. Engage in, promote, + incite, or facilitate the harassment, abuse, threatening, or bullying of individuals + or groups of individuals\n 2. Engage in, promote, incite, or facilitate discrimination + or other unlawful or harmful conduct in the provision of employment, employment + benefits, credit, housing, other economic benefits, or other essential goods and + services\n 3. Engage in the unauthorized or unlicensed practice of any profession + including, but not limited to, financial, legal, medical/health, or related professional + practices\n 4. Collect, process, disclose, generate, or infer private or sensitive + information about individuals, including information about individuals’ identity, + health, or demographic information, unless you have obtained the right to do so + in accordance with applicable law\n 5. Engage in or facilitate any action or + generate any content that infringes, misappropriates, or otherwise violates any + third-party rights, including the outputs or results of any products or services + using the Llama Materials\n 6. Create, generate, or facilitate the creation of + malicious code, malware, computer viruses or do anything else that could disable, + overburden, interfere with or impair the proper working, integrity, operation or + appearance of a website or computer system\n 7. Engage in any action, or facilitate + any action, to intentionally circumvent or remove usage restrictions or other safety + measures, or to enable functionality disabled by Meta \n2. Engage in, promote, incite, + facilitate, or assist in the planning or development of activities that present + a risk of death or bodily harm to individuals, including use of Llama 3.2 related + to the following:\n 8. Military, warfare, nuclear industries or applications, + espionage, use for materials or activities that are subject to the International + Traffic Arms Regulations (ITAR) maintained by the United States Department of State + or to the U.S. Biological Weapons Anti-Terrorism Act of 1989 or the Chemical Weapons + Convention Implementation Act of 1997\n 9. Guns and illegal weapons (including + weapon development)\n 10. Illegal drugs and regulated/controlled substances\n + \ 11. Operation of critical infrastructure, transportation technologies, or heavy + machinery\n 12. Self-harm or harm to others, including suicide, cutting, and + eating disorders\n 13. Any content intended to incite or promote violence, abuse, + or any infliction of bodily harm to an individual\n3. Intentionally deceive or mislead + others, including use of Llama 3.2 related to the following:\n 14. Generating, + promoting, or furthering fraud or the creation or promotion of disinformation\n + \ 15. Generating, promoting, or furthering defamatory content, including the creation + of defamatory statements, images, or other content\n 16. Generating, promoting, + or further distributing spam\n 17. Impersonating another individual without consent, + authorization, or legal right\n 18. Representing that the use of Llama 3.2 or + outputs are human-generated\n 19. Generating or facilitating false online engagement, + including fake reviews and other means of fake online engagement \n4. Fail to appropriately + disclose to end users any known dangers of your AI system 5. Interact with third + party tools, models, or software designed to generate unlawful content or engage + in unlawful or harmful conduct and/or represent that the outputs of such tools, + models, or software are associated with Meta or Llama 3.2\n\nWith respect to any + multimodal models included in Llama 3.2, the rights granted under Section 1(a) of + the Llama 3.2 Community License Agreement are not being granted to you if you are + an individual domiciled in, or a company with a principal place of business in, + the European Union. This restriction does not apply to end users of a product or + service that incorporates any such multimodal models.\n\nPlease report any violation + of this Policy, software “bug,” or other problems that could lead to a violation + of this Policy through one of the following means:\n\n* Reporting issues with the + model: [https://github.com/meta-llama/llama-models/issues](https://l.workplace.com/l.php?u=https%3A%2F%2Fgithub.com%2Fmeta-llama%2Fllama-models%2Fissues&h=AT0qV8W9BFT6NwihiOHRuKYQM_UnkzN_NmHMy91OT55gkLpgi4kQupHUl0ssR4dQsIQ8n3tfd0vtkobvsEvt1l4Ic6GXI2EeuHV8N08OG2WnbAmm0FL4ObkazC6G_256vN0lN9DsykCvCqGZ)\n* + Reporting risky content generated by the model: [developers.facebook.com/llama_output_feedback](http://developers.facebook.com/llama_output_feedback)\n* + Reporting bugs and security concerns: [facebook.com/whitehat/info](http://facebook.com/whitehat/info)\n* + Reporting violations of the Acceptable Use Policy or unlicensed uses of Llama 3.2: + LlamaUseReport@meta.com" +language: +- en +- de +- fr +- it +- pt +- hi +- es +- th +library_name: transformers +license: llama3.2 +quantized_by: mradermacher +tags: +- facebook +- meta +- pytorch +- llama +- llama-3 +--- +## About + + + + + + +weighted/imatrix quants of https://huggingface.co/alpindale/Llama-3.2-3B + + +static quants are available at https://huggingface.co/mradermacher/Llama-3.2-3B-GGUF +## Usage + +If you are unsure how to use GGUF files, refer to one of [TheBloke's +READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for +more details, including on how to concatenate multi-part files. + +## Provided Quants + +(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants) + +| Link | Type | Size/GB | Notes | +|:-----|:-----|--------:|:------| +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ1_S.gguf) | i1-IQ1_S | 1.0 | for the desperate | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ1_M.gguf) | i1-IQ1_M | 1.0 | mostly desperate | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 1.1 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ2_XS.gguf) | i1-IQ2_XS | 1.2 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ2_S.gguf) | i1-IQ2_S | 1.3 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ2_M.gguf) | i1-IQ2_M | 1.3 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 1.4 | lower quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q2_K.gguf) | i1-Q2_K | 1.5 | IQ3_XXS probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ3_XS.gguf) | i1-IQ3_XS | 1.6 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ3_S.gguf) | i1-IQ3_S | 1.6 | beats Q3_K* | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q3_K_S.gguf) | i1-Q3_K_S | 1.6 | IQ3_XS probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ3_M.gguf) | i1-IQ3_M | 1.7 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q3_K_M.gguf) | i1-Q3_K_M | 1.8 | IQ3_S probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q3_K_L.gguf) | i1-Q3_K_L | 1.9 | IQ3_M probably better | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-IQ4_XS.gguf) | i1-IQ4_XS | 1.9 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q4_0.gguf) | i1-Q4_0 | 2.0 | fast, low quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q4_K_S.gguf) | i1-Q4_K_S | 2.0 | optimal size/speed/quality | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q4_K_M.gguf) | i1-Q4_K_M | 2.1 | fast, recommended | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q5_K_S.gguf) | i1-Q5_K_S | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q5_K_M.gguf) | i1-Q5_K_M | 2.4 | | +| [GGUF](https://huggingface.co/mradermacher/Llama-3.2-3B-i1-GGUF/resolve/main/Llama-3.2-3B.i1-Q6_K.gguf) | i1-Q6_K | 2.7 | practically like static Q6_K | + +Here is a handy graph by ikawrakow comparing some lower-quality quant +types (lower is better): + +![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png) + +And here are Artefact2's thoughts on the matter: +https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9 + +## FAQ / Model Request + +See https://huggingface.co/mradermacher/model_requests for some answers to +questions you might have and/or if you want some other model quantized. + +## Thanks + +I thank my company, [nethype GmbH](https://www.nethype.de/), for letting +me use its servers and providing upgrades to my workstation to enable +this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to. + + diff --git a/imatrix.dat b/imatrix.dat new file mode 100644 index 0000000..eedc0e9 --- /dev/null +++ b/imatrix.dat @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:9dbf0c851b360f93c5f23522aea6a7fdf63d95f0b4669610fd62ee41cfee17f3 +size 2988377