初始化项目，由ModelHub XC社区提供模型

Model: RichardErkhov/bigscience_-_bloomz-7b1-gguf Source: Original Platform
2026-04-19 08:22:39 +08:00
commit c7f303a6ea
21 changed files with 1032 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,54 @@
 *.7z filter=lfs diff=lfs merge=lfs -text
 *.arrow filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.bz2 filter=lfs diff=lfs merge=lfs -text
 *.ckpt filter=lfs diff=lfs merge=lfs -text
 *.ftz filter=lfs diff=lfs merge=lfs -text
 *.gz filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 *.joblib filter=lfs diff=lfs merge=lfs -text
 *.lfs.* filter=lfs diff=lfs merge=lfs -text
 *.mlmodel filter=lfs diff=lfs merge=lfs -text
 *.model filter=lfs diff=lfs merge=lfs -text
 *.msgpack filter=lfs diff=lfs merge=lfs -text
 *.npy filter=lfs diff=lfs merge=lfs -text
 *.npz filter=lfs diff=lfs merge=lfs -text
 *.onnx filter=lfs diff=lfs merge=lfs -text
 *.ot filter=lfs diff=lfs merge=lfs -text
 *.parquet filter=lfs diff=lfs merge=lfs -text
 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pickle filter=lfs diff=lfs merge=lfs -text
 *.pkl filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
 *.rar filter=lfs diff=lfs merge=lfs -text
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.tar.* filter=lfs diff=lfs merge=lfs -text
 *.tar filter=lfs diff=lfs merge=lfs -text
 *.tflite filter=lfs diff=lfs merge=lfs -text
 *.tgz filter=lfs diff=lfs merge=lfs -text
 *.wasm filter=lfs diff=lfs merge=lfs -text
 *.xz filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q3_K.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q4_K.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q4_1.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q5_K.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q5_1.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
 bloomz-7b1.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
--- a/README.md
+++ b/README.md
@@ -0,0 +1,921 @@
 Quantization made by Richard Erkhov.
 [Github](https://github.com/RichardErkhov)
 [Discord](https://discord.gg/pvy7H8DZMG)
 [Request more models](https://github.com/RichardErkhov/quant_request)
 bloomz-7b1 - GGUF
 - Model creator: https://huggingface.co/bigscience/
 - Original model: https://huggingface.co/bigscience/bloomz-7b1/
 | Name | Quant method | Size |
 | ---- | ---- | ---- |
 | [bloomz-7b1.Q2_K.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q2_K.gguf) | Q2_K | 3.2GB |
 | [bloomz-7b1.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q3_K_S.gguf) | Q3_K_S | 3.63GB |
 | [bloomz-7b1.Q3_K.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q3_K.gguf) | Q3_K | 4.14GB |
 | [bloomz-7b1.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q3_K_M.gguf) | Q3_K_M | 4.14GB |
 | [bloomz-7b1.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q3_K_L.gguf) | Q3_K_L | 4.42GB |
 | [bloomz-7b1.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.IQ4_XS.gguf) | IQ4_XS | 4.33GB |
 | [bloomz-7b1.Q4_0.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q4_0.gguf) | Q4_0 | 4.51GB |
 | [bloomz-7b1.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.IQ4_NL.gguf) | IQ4_NL | 4.53GB |
 | [bloomz-7b1.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q4_K_S.gguf) | Q4_K_S | 4.53GB |
 | [bloomz-7b1.Q4_K.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q4_K.gguf) | Q4_K | 4.91GB |
 | [bloomz-7b1.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q4_K_M.gguf) | Q4_K_M | 4.91GB |
 | [bloomz-7b1.Q4_1.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q4_1.gguf) | Q4_1 | 4.92GB |
 | [bloomz-7b1.Q5_0.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q5_0.gguf) | Q5_0 | 5.33GB |
 | [bloomz-7b1.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q5_K_S.gguf) | Q5_K_S | 5.33GB |
 | [bloomz-7b1.Q5_K.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q5_K.gguf) | Q5_K | 5.63GB |
 | [bloomz-7b1.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q5_K_M.gguf) | Q5_K_M | 5.63GB |
 | [bloomz-7b1.Q5_1.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q5_1.gguf) | Q5_1 | 5.74GB |
 | [bloomz-7b1.Q6_K.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q6_K.gguf) | Q6_K | 6.2GB |
 | [bloomz-7b1.Q8_0.gguf](https://huggingface.co/RichardErkhov/bigscience_-_bloomz-7b1-gguf/blob/main/bloomz-7b1.Q8_0.gguf) | Q8_0 | 8.03GB |
 Original model description:
 ---
 datasets:
 - bigscience/xP3
 license: bigscience-bloom-rail-1.0
 language:
 - ak
 - ar
 - as
 - bm
 - bn
 - ca
 - code
 - en
 - es
 - eu
 - fon
 - fr
 - gu
 - hi
 - id
 - ig
 - ki
 - kn
 - lg
 - ln
 - ml
 - mr
 - ne
 - nso
 - ny
 - or
 - pa
 - pt
 - rn
 - rw
 - sn
 - st
 - sw
 - ta
 - te
 - tn
 - ts
 - tum
 - tw
 - ur
 - vi
 - wo
 - xh
 - yo
 - zh
 - zu
 programming_language: 
 - C
 - C++
 - C#
 - Go
 - Java
 - JavaScript
 - Lua
 - PHP
 - Python
 - Ruby
 - Rust
 - Scala
 - TypeScript
 pipeline_tag: text-generation
 widget:
 - text: "一个传奇的开端，一个不灭的神话，这不仅仅是一部电影，而是作为一个走进新时代的标签，永远彪炳史册。Would you rate the previous review as positive, neutral or negative?"
  example_title: "zh-en sentiment"
 - text: "一个传奇的开端，一个不灭的神话，这不仅仅是一部电影，而是作为一个走进新时代的标签，永远彪炳史册。你认为这句话的立场是赞扬、中立还是批评？"
  example_title: "zh-zh sentiment"
 - text: "Suggest at least five related search terms to \"Mạng neural nhân tạo\"."
  example_title: "vi-en query"
 - text: "Proposez au moins cinq mots clés concernant «Réseau de neurones artificiels»."
  example_title: "fr-fr query"
 - text: "Explain in a sentence in Telugu what is backpropagation in neural networks."
  example_title: "te-en qa"
 - text: "Why is the sky blue?"
  example_title: "en-en qa"
 - text: "Write a fairy tale about a troll saving a princess from a dangerous dragon. The fairy tale is a masterpiece that has achieved praise worldwide and its moral is \"Heroes Come in All Shapes and Sizes\". Story (in Spanish):"
  example_title: "es-en fable"
 - text: "Write a fable about wood elves living in a forest that is suddenly invaded by ogres. The fable is a masterpiece that has achieved praise worldwide and its moral is \"Violence is the last refuge of the incompetent\". Fable (in Hindi):"
  example_title: "hi-en fable"
 model-index:
 - name: bloomz-7b1
  results:
  - task:
      type: Coreference resolution
    dataset:
      type: winogrande
      name: Winogrande XL (xl)
      config: xl
      split: validation
      revision: a80f460359d1e9a67c006011c94de42a8759430c
    metrics:
    - type: Accuracy
      value: 55.8
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (en)
      config: en
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 66.02
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (fr)
      config: fr
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 57.83
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (jp)
      config: jp
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 52.87
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (pt)
      config: pt
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 57.79
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (ru)
      config: ru
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 54.92
  - task:
      type: Coreference resolution
    dataset:
      type: Muennighoff/xwinograd
      name: XWinograd (zh)
      config: zh
      split: test
      revision: 9dd5ea5505fad86b7bedad667955577815300cee
    metrics:
    - type: Accuracy
      value: 63.69
  - task:
      type: Natural language inference
    dataset:
      type: anli
      name: ANLI (r1)
      config: r1
      split: validation
      revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094
    metrics:
    - type: Accuracy
      value: 42.1
  - task:
      type: Natural language inference
    dataset:
      type: anli
      name: ANLI (r2)
      config: r2
      split: validation
      revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094
    metrics:
    - type: Accuracy
      value: 39.5
  - task:
      type: Natural language inference
    dataset:
      type: anli
      name: ANLI (r3)
      config: r3
      split: validation
      revision: 9dbd830a06fea8b1c49d6e5ef2004a08d9f45094
    metrics:
    - type: Accuracy
      value: 41.0
  - task:
      type: Natural language inference
    dataset:
      type: super_glue
      name: SuperGLUE (cb)
      config: cb
      split: validation
      revision: 9e12063561e7e6c79099feb6d5a493142584e9e2
    metrics:
    - type: Accuracy
      value: 80.36
  - task:
      type: Natural language inference
    dataset:
      type: super_glue
      name: SuperGLUE (rte)
      config: rte
      split: validation
      revision: 9e12063561e7e6c79099feb6d5a493142584e9e2
    metrics:
    - type: Accuracy
      value: 84.12
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (ar)
      config: ar
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 53.25
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (bg)
      config: bg
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 43.61
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (de)
      config: de
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 46.83
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (el)
      config: el
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 41.53
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (en)
      config: en
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 59.68
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (es)
      config: es
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 55.1
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (fr)
      config: fr
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 55.26
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (hi)
      config: hi
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 50.88
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (ru)
      config: ru
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 47.75
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (sw)
      config: sw
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 46.63
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (th)
      config: th
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 40.12
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (tr)
      config: tr
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 37.55
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (ur)
      config: ur
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 46.51
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (vi)
      config: vi
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 52.93
  - task:
      type: Natural language inference
    dataset:
      type: xnli
      name: XNLI (zh)
      config: zh
      split: validation
      revision: a5a45e4ff92d5d3f34de70aaf4b72c3bdf9f7f16
    metrics:
    - type: Accuracy
      value: 53.61
  - task:
      type: Program synthesis
    dataset:
      type: openai_humaneval
      name: HumanEval
      config: None
      split: test
      revision: e8dc562f5de170c54b5481011dd9f4fa04845771
    metrics:
    - type: Pass@1
      value: 8.06
    - type: Pass@10
      value: 15.03
    - type: Pass@100
      value: 27.49
  - task:
      type: Sentence completion
    dataset:
      type: story_cloze
      name: StoryCloze (2016)
      config: "2016"
      split: validation
      revision: e724c6f8cdf7c7a2fb229d862226e15b023ee4db
    metrics:
    - type: Accuracy
      value: 90.43
  - task:
      type: Sentence completion
    dataset:
      type: super_glue
      name: SuperGLUE (copa)
      config: copa
      split: validation
      revision: 9e12063561e7e6c79099feb6d5a493142584e9e2
    metrics:
    - type: Accuracy
      value: 86.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (et)
      config: et
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 50.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (ht)
      config: ht
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 54.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (id)
      config: id
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 76.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (it)
      config: it
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 61.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (qu)
      config: qu
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 60.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (sw)
      config: sw
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 63.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (ta)
      config: ta
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 64.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (th)
      config: th
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 57.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (tr)
      config: tr
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 53.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (vi)
      config: vi
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 79.0
  - task:
      type: Sentence completion
    dataset:
      type: xcopa
      name: XCOPA (zh)
      config: zh
      split: validation
      revision: 37f73c60fb123111fa5af5f9b705d0b3747fd187
    metrics:
    - type: Accuracy
      value: 81.0
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (ar)
      config: ar
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 83.26
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (es)
      config: es
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 88.95
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (eu)
      config: eu
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 73.33
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (hi)
      config: hi
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 80.61
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (id)
      config: id
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 84.25
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (my)
      config: my
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 52.55
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (ru)
      config: ru
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 65.32
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (sw)
      config: sw
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 71.67
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (te)
      config: te
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 74.72
  - task:
      type: Sentence completion
    dataset:
      type: Muennighoff/xstory_cloze
      name: XStoryCloze (zh)
      config: zh
      split: validation
      revision: 8bb76e594b68147f1a430e86829d07189622b90d
    metrics:
    - type: Accuracy
      value: 85.37
 ---
 ![xmtf](https://github.com/bigscience-workshop/xmtf/blob/master/xmtf_banner.png?raw=true)
 #  Table of Contents
 1. [Model Summary](#model-summary)
 2. [Use](#use)
 3. [Limitations](#limitations)
 4. [Training](#training)
 5. [Evaluation](#evaluation)
 7. [Citation](#citation)
 # Model Summary
 > We present BLOOMZ & mT0, a family of models capable of following human instructions in dozens of languages zero-shot. We finetune BLOOM & mT5 pretrained multilingual language models on our crosslingual task mixture (xP3) and find the resulting models capable of crosslingual generalization to unseen tasks & languages.
 - **Repository:** [bigscience-workshop/xmtf](https://github.com/bigscience-workshop/xmtf)
 - **Paper:** [Crosslingual Generalization through Multitask Finetuning](https://arxiv.org/abs/2211.01786)
 - **Point of Contact:** [Niklas Muennighoff](mailto:niklas@hf.co)
 - **Languages:** Refer to [bloom](https://huggingface.co/bigscience/bloom) for pretraining & [xP3](https://huggingface.co/datasets/bigscience/xP3) for finetuning language proportions. It understands both pretraining & finetuning languages.
 - **BLOOMZ & mT0 Model Family:**
 <div class="max-w-full overflow-auto">
 <table>
  <tr>
 <th colspan="12">Multitask finetuned on <a style="font-weight:bold" href=https://huggingface.co/datasets/bigscience/xP3>xP3</a>. Recommended for prompting in English.
 </tr>
 <tr>
 <td>Parameters</td>
 <td>300M</td>
 <td>580M</td>
 <td>1.2B</td>
 <td>3.7B</td>
 <td>13B</td>
 <td>560M</td>
 <td>1.1B</td>
 <td>1.7B</td>
 <td>3B</td>
 <td>7.1B</td>
 <td>176B</td>
 </tr>
 <tr>
 <td>Finetuned Model</td>
 <td><a href=https://huggingface.co/bigscience/mt0-small>mt0-small</a></td>  
 <td><a href=https://huggingface.co/bigscience/mt0-base>mt0-base</a></td>
 <td><a href=https://huggingface.co/bigscience/mt0-large>mt0-large</a></td>
 <td><a href=https://huggingface.co/bigscience/mt0-xl>mt0-xl</a></td>
 <td><a href=https://huggingface.co/bigscience/mt0-xxl>mt0-xxl</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-560m>bloomz-560m</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-1b1>bloomz-1b1</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-1b7>bloomz-1b7</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-3b>bloomz-3b</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-7b1>bloomz-7b1</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz>bloomz</a></td>
 </tr>
 </tr>
  <tr>
 <th colspan="12">Multitask finetuned on <a style="font-weight:bold" href=https://huggingface.co/datasets/bigscience/xP3mt>xP3mt</a>. Recommended for prompting in non-English.</th>
 </tr>
 <tr>
 <td>Finetuned Model</td>
 <td></td>
 <td></td>
 <td></td>
 <td></td>
 <td><a href=https://huggingface.co/bigscience/mt0-xxl-mt>mt0-xxl-mt</a></td>
 <td></td>
 <td></td>
 <td></td>
 <td></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-7b1-mt>bloomz-7b1-mt</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-mt>bloomz-mt</a></td>
 </tr>
 <th colspan="12">Multitask finetuned on <a style="font-weight:bold" href=https://huggingface.co/datasets/Muennighoff/P3>P3</a>. Released for research purposes only. Strictly inferior to above models!</th>
 </tr>
 <tr>
 <td>Finetuned Model</td>
 <td></td>
 <td></td>
 <td></td>
 <td></td>
 <td><a href=https://huggingface.co/bigscience/mt0-xxl-p3>mt0-xxl-p3</a></td>
 <td></td>
 <td></td>
 <td></td>
 <td></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-7b1-p3>bloomz-7b1-p3</a></td>
 <td><a href=https://huggingface.co/bigscience/bloomz-p3>bloomz-p3</a></td>
 </tr>
 <th colspan="12">Original pretrained checkpoints. Not recommended.</th>
 <tr>
 <td>Pretrained Model</td>
 <td><a href=https://huggingface.co/google/mt5-small>mt5-small</a></td>  
 <td><a href=https://huggingface.co/google/mt5-base>mt5-base</a></td>
 <td><a href=https://huggingface.co/google/mt5-large>mt5-large</a></td>
 <td><a href=https://huggingface.co/google/mt5-xl>mt5-xl</a></td>
 <td><a href=https://huggingface.co/google/mt5-xxl>mt5-xxl</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom-560m>bloom-560m</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom-1b1>bloom-1b1</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom-1b7>bloom-1b7</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom-3b>bloom-3b</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom-7b1>bloom-7b1</a></td>
 <td><a href=https://huggingface.co/bigscience/bloom>bloom</a></td>
 </tr>
 </table>
 </div>
 # Use
 ## Intended use
 We recommend using the model to perform tasks expressed in natural language. For example, given the prompt "*Translate to English: Je t’aime.*", the model will most likely answer "*I love you.*". Some prompt ideas from our paper: 
 - 一个传奇的开端，一个不灭的神话，这不仅仅是一部电影，而是作为一个走进新时代的标签，永远彪炳史册。你认为这句话的立场是赞扬、中立还是批评?
 - Suggest at least five related search terms to "Mạng neural nhân tạo".
 - Write a fairy tale about a troll saving a princess from a dangerous dragon. The fairy tale is a masterpiece that has achieved praise worldwide and its moral is "Heroes Come in All Shapes and Sizes". Story (in Spanish):
 - Explain in a sentence in Telugu what is backpropagation in neural networks.
 **Feel free to share your generations in the Community tab!**
 ## How to use
 ### CPU
 <details>
 <summary> Click to expand </summary>
 ```python
 # pip install -q transformers
 from transformers import AutoModelForCausalLM, AutoTokenizer
 checkpoint = "bigscience/bloomz-7b1"
 tokenizer = AutoTokenizer.from_pretrained(checkpoint)
 model = AutoModelForCausalLM.from_pretrained(checkpoint)
 inputs = tokenizer.encode("Translate to English: Je t’aime.", return_tensors="pt")
 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
 </details>
 ### GPU
 <details>
 <summary> Click to expand </summary>
 ```python
 # pip install -q transformers accelerate
 from transformers import AutoModelForCausalLM, AutoTokenizer
 checkpoint = "bigscience/bloomz-7b1"
 tokenizer = AutoTokenizer.from_pretrained(checkpoint)
 model = AutoModelForCausalLM.from_pretrained(checkpoint, torch_dtype="auto", device_map="auto")
 inputs = tokenizer.encode("Translate to English: Je t’aime.", return_tensors="pt").to("cuda")
 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
 </details>
 ### GPU in 8bit
 <details>
 <summary> Click to expand </summary>
 ```python
 # pip install -q transformers accelerate bitsandbytes
 from transformers import AutoModelForCausalLM, AutoTokenizer
 checkpoint = "bigscience/bloomz-7b1"
 tokenizer = AutoTokenizer.from_pretrained(checkpoint)
 model = AutoModelForCausalLM.from_pretrained(checkpoint, device_map="auto", load_in_8bit=True)
 inputs = tokenizer.encode("Translate to English: Je t’aime.", return_tensors="pt").to("cuda")
 outputs = model.generate(inputs)
 print(tokenizer.decode(outputs[0]))
 ```
 </details>
 <!-- Necessary for whitespace -->
 ###
 # Limitations
 **Prompt Engineering:** The performance may vary depending on the prompt. For BLOOMZ models, we recommend making it very clear when the input stops to avoid the model trying to continue it. For example, the prompt "*Translate to English: Je t'aime*" without the full stop (.) at the end, may result in the model trying to continue the French sentence. Better prompts are e.g. "*Translate to English: Je t'aime.*", "*Translate to English: Je t'aime. Translation:*" "*What is "Je t'aime." in English?*", where it is clear for the model when it should answer. Further, we recommend providing the model as much context as possible. For example, if you want it to answer in Telugu, then tell the model, e.g. "*Explain in a sentence in Telugu what is backpropagation in neural networks.*".
 # Training
 ## Model
 - **Architecture:** Same as [bloom-7b1](https://huggingface.co/bigscience/bloom-7b1), also refer to the `config.json` file
 - **Finetuning steps:** 1000
 - **Finetuning tokens:** 4.19 billion
 - **Finetuning layout:** 1x pipeline parallel, 1x tensor parallel, 64x data parallel
 - **Precision:** float16
 ## Hardware
 - **CPUs:** AMD CPUs with 512GB memory per node
 - **GPUs:** 64 A100 80GB GPUs with 8 GPUs per node (8 nodes) using NVLink 4 inter-gpu connects, 4 OmniPath links
 - **Communication:** NCCL-communications network with a fully dedicated subnet
 ## Software
 - **Orchestration:** [Megatron-DeepSpeed](https://github.com/bigscience-workshop/Megatron-DeepSpeed)
 - **Optimizer & parallelism:** [DeepSpeed](https://github.com/microsoft/DeepSpeed)
 - **Neural networks:** [PyTorch](https://github.com/pytorch/pytorch) (pytorch-1.11 w/ CUDA-11.5)
 - **FP16 if applicable:** [apex](https://github.com/NVIDIA/apex)
 # Evaluation
 We refer to Table 7 from our [paper](https://arxiv.org/abs/2211.01786) & [bigscience/evaluation-results](https://huggingface.co/datasets/bigscience/evaluation-results) for zero-shot results on unseen tasks. The sidebar reports zero-shot performance of the best prompt per dataset config.
 # Citation
 ```bibtex
@article{muennighoff2022crosslingual,
  title={Crosslingual generalization through multitask finetuning},
  author={Muennighoff, Niklas and Wang, Thomas and Sutawika, Lintang and Roberts, Adam and Biderman, Stella and Scao, Teven Le and Bari, M Saiful and Shen, Sheng and Yong, Zheng-Xin and Schoelkopf, Hailey and others},
  journal={arXiv preprint arXiv:2211.01786},
  year={2022}
 }
 ```
--- a/bloomz-7b1.IQ4_NL.gguf
+++ b/bloomz-7b1.IQ4_NL.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:74a9708d150a159c8e3992c9ecfa5b0dc0da0ae8079a234735ff3ec38c840cca
 size 4862618464
--- a/bloomz-7b1.IQ4_XS.gguf
+++ b/bloomz-7b1.IQ4_XS.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:849f94dfc495e38def28df977653dd0538fe1d5956ffe8e3eb41e481e17438a9
 size 4648053600
--- a/bloomz-7b1.Q2_K.gguf
+++ b/bloomz-7b1.Q2_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:f2340bd535ef84823157dbd94258d9a2c101fc73bae937adf8148c259c999760
 size 3436620640
--- a/bloomz-7b1.Q3_K.gguf
+++ b/bloomz-7b1.Q3_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:638e25acccf2f111be20ed2e816a1d589bad7b34399811243449729ad8851ec2
 size 4441975648
--- a/bloomz-7b1.Q3_K_L.gguf
+++ b/bloomz-7b1.Q3_K_L.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:5a57879aa0948340a614e70ff7afa3cbc1a7e8b48666bdea4d56ee40c29d829f
 size 4748159840
--- a/bloomz-7b1.Q3_K_M.gguf
+++ b/bloomz-7b1.Q3_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:638e25acccf2f111be20ed2e816a1d589bad7b34399811243449729ad8851ec2
 size 4441975648
--- a/bloomz-7b1.Q3_K_S.gguf
+++ b/bloomz-7b1.Q3_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:a195b87981c47c907959e2303e9345c3406e9fd2faa841f8f99390f6b082c847
 size 3898813280
--- a/bloomz-7b1.Q4_0.gguf
+++ b/bloomz-7b1.Q4_0.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:480aa11dc37478811e81f6f04247b826350e603a9d6348c5655b0ab2a5d1f7f8
 size 4837452640
--- a/bloomz-7b1.Q4_1.gguf
+++ b/bloomz-7b1.Q4_1.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:8368287793a973e69bc26a8d1ff802b0bac7e68d408e80becad2e41bfbc038f2
 size 5279165280
--- a/bloomz-7b1.Q4_K.gguf
+++ b/bloomz-7b1.Q4_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:82038a0b3bd7cd21bcfef8f2e2fa4f18bb13f325045c136788808127503fe8bf
 size 5268417376
--- a/bloomz-7b1.Q4_K_M.gguf
+++ b/bloomz-7b1.Q4_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:82038a0b3bd7cd21bcfef8f2e2fa4f18bb13f325045c136788808127503fe8bf
 size 5268417376
--- a/bloomz-7b1.Q4_K_S.gguf
+++ b/bloomz-7b1.Q4_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:d6ef8a7ca382b01bb3f39efc3cfbe300c00b732652c207043e864aa5b9972e76
 size 4862618464
--- a/bloomz-7b1.Q5_0.gguf
+++ b/bloomz-7b1.Q5_0.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:19a837b21f5a9b50e15688b5731d8c3d2556c6f210f7c83b492cab70bfe7f582
 size 5720877920
--- a/bloomz-7b1.Q5_1.gguf
+++ b/bloomz-7b1.Q5_1.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:d7449ff8f736b27aa15a0bd67c55913fb53cff8178ea3f30a13b4a461a68f3e4
 size 6162590560
--- a/bloomz-7b1.Q5_K.gguf
+++ b/bloomz-7b1.Q5_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:6c0d7bfac14d51f105d38f22b79ed0a3b31fe8e872b766ada0b17356847f7ba1
 size 6046198624
--- a/bloomz-7b1.Q5_K_M.gguf
+++ b/bloomz-7b1.Q5_K_M.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:6c0d7bfac14d51f105d38f22b79ed0a3b31fe8e872b766ada0b17356847f7ba1
 size 6046198624
--- a/bloomz-7b1.Q5_K_S.gguf
+++ b/bloomz-7b1.Q5_K_S.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:aabbfcdb63a357cdd4ba7a70951dd8c44b5f85f49c6d710c9d480cb9768c78bc
 size 5720877920
--- a/bloomz-7b1.Q6_K.gguf
+++ b/bloomz-7b1.Q6_K.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:fa83f50f8a1a54fd1038345c8ff32520be8a49e7542f80f83deeb371c148717f
 size 6659517280
--- a/bloomz-7b1.Q8_0.gguf
+++ b/bloomz-7b1.Q8_0.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:d40ad2adf88c6355adb4ee61325eb44c897db10a72db7db87d2a271da402bc39
 size 8620026720