From a5f40ffea95b29eebe220d74e0df951aa786d71b Mon Sep 17 00:00:00 2001 From: ai-modelscope Date: Wed, 12 Mar 2025 01:34:31 +0800 Subject: [PATCH] Improve model card with pipeline tag and code link (#1) - Improve model card with pipeline tag and code link (6312a4727d4ea69941f8a6ac5a89a59b44f68977) Co-authored-by: Niels Rogge --- README.md | 11 ++++++----- 1 file changed, 6 insertions(+), 5 deletions(-) diff --git a/README.md b/README.md index 9edf69e..31a300a 100644 --- a/README.md +++ b/README.md @@ -1,19 +1,17 @@ --- -library_name: transformers -license: mit datasets: - HuggingFaceH4/ultrafeedback_binarized language: - en +library_name: transformers +license: mit +pipeline_tag: text-generation --- - # Llama-3-Base-8B-DICE-Iter2 This model was developed using [Bootstrapping Language Models with DPO Implicit Rewards](https://arxiv.org/abs/2406.09760) (DICE) at iteration 2, based on the [princeton-nlp/Llama-3-Base-8B-SFT-DPO](https://huggingface.co/princeton-nlp/Llama-3-Base-8B-SFT-DPO) architecture as the starting point. - - ## Links to Other Models - [Llama-3-Base-8B-DICE-Iter1](https://huggingface.co/sail/Llama-3-Base-8B-DICE-Iter1) - [Llama-3-Base-8B-DICE-Iter2](https://huggingface.co/sail/Llama-3-Base-8B-DICE-Iter2) @@ -33,6 +31,9 @@ This model was developed using [Bootstrapping Language Models with DPO Implicit |[Llama-3-Base-8B-DICE-Iter1](https://huggingface.co/sail/Llama-3-Base-8B-DICE-Iter1) |25.08 |25.77 |[Llama-3-Base-8B-DICE-Iter2](https://huggingface.co/sail/Llama-3-Base-8B-DICE-Iter2) |**27.55** |**30.99** +## Code +https://github.com/sail-sg/dice + ## Citation ```bibtex