Files

146 lines
5.7 KiB
Markdown
Raw Permalink Normal View History

---
library_name: transformers
base_model: google/gemma-2-2b
tags:
- rankalign
- fine-tuned
---
# rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx
Fine-tuned checkpoint from the [rankalign](https://github.com/juand-r/rankalign) project.
## Training Details
| Field | Value |
|-------|-------|
| Base model | `google/gemma-2-2b` |
| Version | v6 |
| Task | `hypernym-concat-bananas-to-dogs-double-all` |
| Epoch | 2 |
| Delta | 0.15 |
| Typicality correction | none |
| Length normalization | True |
| Preference loss weight | 1 |
| NLL validator weight | 1 |
| NLL generator weight | 1 |
| Validator log-odds | False |
| Force same-x | True |
| Semi-supervised ratio | None |
| Labeled-only ratio | None |
## Reproducibility
**Original checkpoint name:** `v6-google--gemma-2-2b-delta0.15-epoch2--hypernym-concat-bananas-to-dogs-double-all--d2g--random--alpha1.0--lenorm--full-completion--nllv1.0--nllg1.0--force-same-x`
To evaluate:
```bash
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-bananas \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-bazookas \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-cabinets \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-cars \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-chairs \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-crows \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-diapers \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-dogs \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-dolls \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-ducklings \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-elephants \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-guns \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-hammers \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-helmets \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-jackets \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-kayaks \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-kites \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
python scripts/eval_by_claude.py \
--model TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-ln-nv1-ng1-fsx \
--task hypernym-mirrors \
--split_type random --gen-shots zero --disc-shots few --validator-log-odds --save-scores-csv \
--self-typicality
```