初始化项目，由ModelHub XC社区提供模型

Model: subrit/subrit-legal-gpt2-quecto-v1 Source: Original Platform
2026-05-05 05:35:28 +08:00
commit c979162374
13 changed files with 300586 additions and 0 deletions
--- a/.gitattributes
+++ b/.gitattributes
@@ -0,0 +1,36 @@
 *.7z filter=lfs diff=lfs merge=lfs -text
 *.arrow filter=lfs diff=lfs merge=lfs -text
 *.bin filter=lfs diff=lfs merge=lfs -text
 *.bz2 filter=lfs diff=lfs merge=lfs -text
 *.ckpt filter=lfs diff=lfs merge=lfs -text
 *.ftz filter=lfs diff=lfs merge=lfs -text
 *.gz filter=lfs diff=lfs merge=lfs -text
 *.h5 filter=lfs diff=lfs merge=lfs -text
 *.joblib filter=lfs diff=lfs merge=lfs -text
 *.lfs.* filter=lfs diff=lfs merge=lfs -text
 *.mlmodel filter=lfs diff=lfs merge=lfs -text
 *.model filter=lfs diff=lfs merge=lfs -text
 *.msgpack filter=lfs diff=lfs merge=lfs -text
 *.npy filter=lfs diff=lfs merge=lfs -text
 *.npz filter=lfs diff=lfs merge=lfs -text
 *.onnx filter=lfs diff=lfs merge=lfs -text
 *.ot filter=lfs diff=lfs merge=lfs -text
 *.parquet filter=lfs diff=lfs merge=lfs -text
 *.pb filter=lfs diff=lfs merge=lfs -text
 *.pickle filter=lfs diff=lfs merge=lfs -text
 *.pkl filter=lfs diff=lfs merge=lfs -text
 *.pt filter=lfs diff=lfs merge=lfs -text
 *.pth filter=lfs diff=lfs merge=lfs -text
 *.rar filter=lfs diff=lfs merge=lfs -text
 *.safetensors filter=lfs diff=lfs merge=lfs -text
 saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.tar.* filter=lfs diff=lfs merge=lfs -text
 *.tar filter=lfs diff=lfs merge=lfs -text
 *.tflite filter=lfs diff=lfs merge=lfs -text
 *.tgz filter=lfs diff=lfs merge=lfs -text
 *.wasm filter=lfs diff=lfs merge=lfs -text
 *.xz filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 subrit_legal_gpt2_q8.gguf filter=lfs diff=lfs merge=lfs -text
--- a/README.md
+++ b/README.md
@@ -0,0 +1,124 @@
 ---
 license: cc-by-sa-4.0
 language:
 - en
 library_name: transformers
 tags:
 - legal
 - india
 - law
 - gpt2
 - gguf
 - quecto
 ---
 # ⚖️ Subrit's Legal AI (Quecto V1)
 'Model:' `subrit-legal-gpt2-quecto-v1`  
 'Author:' Subrit Dikshit  
 'License:' CC BY-SA 4.0  
 This is a specialized miniature 'Legal AI' trained from scratch & fine-tuned on the 'Indian Penal Code (IPC)', 'CrPC', and 'Constitution'. It runs efficiently on consumer hardware (CPUs) using GGUF quantization.
 ## ⚠️ Limitations & Disclaimer
 * 'Model Architecture:' This model uses GPT-2 custom configuration (defined from scratch) architecture. It is trained from scratch and not a direct fine-tune of the gpt2-small checkpoint, but utilizes the standard GPT2LMHeadModel structure for compatibility. It performs best on simple definition and punishment questions.
 * 'Reasoning Limits:' Due to its small size, it is "NOT" capable of complex reasoning, multi-turn logic, or "lawyer-level" argumentation.
 * 'Hallucinations:' Like all Small Language Models (SLMs), this model can "hallucinate" (generate plausible-sounding but incorrect information). 'Always verify specific section numbers and punishments against official legal texts.'
 * 'Usage:' This is a research prototype for educational purposes. It is 'NOT' a substitute for professional legal advice.
 ## 📦 Model Details
 * 'Architecture:' Custom GPT-2 configuration (Trained from scratch).
 * 'Training Data:' Indian Legal Texts (IPC, CrPC, Constitution).
 * 'Formats Included:'
    * 'PyTorch:' Standard non-quantized weights for GPU inference or further fine-tuning (~500 MB).
    * 'GGUF (Q8_0):' 8-bit quantized for fast CPU/Edge inference (~130 MB).
 ## 👨‍💻 Credits & Attribution
 This model was trained by 'Subrit Dikshit'. 
 * 'Training Data:' [Techmaestro369/indian-legal-texts-finetuning](https://huggingface.co/datasets/Techmaestro369/indian-legal-texts-finetuning) (CC BY-SA 4.0).
 * 'Base Model:' Custom GPT-2 configuration (Trained from scratch).
 ## 🚀 How to Run (Demo Script)
 This repository contains 'two versions' of the model. Choose the one that fits your needs.
 ### 🔧 Prerequisites
 * **Python:** 3.10 or 3.11 is recommended.
 * **OS:** Windows, macOS, or Linux.
 ### Option 1: Run the PyTorch Version (Standard HF). Use this if you are using the standard transformers library or have a GPU.
 'Requires:' pip install transformers torch
 ```python
 from transformers import GPT2LMHeadModel, GPT2Tokenizer
 # 1. Load from Hugging Face
 model_name = "subrit/subrit-legal-gpt2-quecto-v1"
 tokenizer = GPT2Tokenizer.from_pretrained(model_name)
 model = GPT2LMHeadModel.from_pretrained(model_name)
 # 2. Ask a Question
 input_text = "Question: What is Article 14 of the Constitution?\nAnswer:"
 inputs = tokenizer(input_text, return_tensors="pt")
 # 3. Generate Answer
 outputs = model.generate(**inputs, max_new_tokens=50)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ### Option 2: Run the GGUF Version (Recommended for Speed/CPU)
 *Use this if you want to run the model on a laptop CPU without a GPU.*
 'Requires:' `pip install llama-cpp-python huggingface_hub`
 ```python
 from huggingface_hub import hf_hub_download
 from llama_cpp import Llama
 # 1. Download the GGUF file
 model_path = hf_hub_download(
    repo_id="subrit/subrit-legal-gpt2-quecto-v1",
    filename="subrit_legal_gpt2_q8.gguf"
 )
 # 2. Load the Engine
 llm = Llama(model_path=model_path, n_ctx=512, verbose=False)
 # 3. Ask a Question
 question = "What is the punishment for murder under Section 302?"
 output = llm(f"Question: {question}\nAnswer:", max_tokens=60, stop=["Question:", "\n"])
 print(output['choices'][0]['text'])
 ```
 Please cite it as follows:
 ```bibtex
@misc{dikshit2025legalgpt2,
  author = {Dikshit, Subrit},
  title = {Subrit's Legal AI (Quecto V1): A Quantized GPT-2 Fine-Tune on Indian Law},
  year = {2025},
  publisher = {Hugging Face},
  journal = {Hugging Face Model Hub},
  howpublished = {\url{[https://huggingface.co/subrit/subrit-legal-gpt2-quecto-v1](https://huggingface.co/subrit/subrit-legal-gpt2-quecto-v1)}}
 }
 ```
 Acknowledgements:
 ```
@dataset{indian_legal_texts,
  author = {Gupta, Akshat (Techmaestro369)},
  title = {Indian Legal Texts Finetuning Dataset},
  year = {2024},
  publisher = {Hugging Face},
  url = {[https://huggingface.co/datasets/Techmaestro369/indian-legal-texts-finetuning](https://huggingface.co/datasets/Techmaestro369/indian-legal-texts-finetuning)}
 }
@article{radford2019language,
  title={Language Models are Unsupervised Multitask Learners},
  author={Radford, Alec and Wu, Jeffrey and Child, Rewon and Luan, David and Amodei, Dario and Sutskever, Ilya},
  year={2019}
 }
 ```
--- a/added_tokens.json
+++ b/added_tokens.json
@@ -0,0 +1,3 @@
 {
  "<|padding|>": 50257
 }
--- a/config.json
+++ b/config.json
@@ -0,0 +1,38 @@
 {
  "activation_function": "gelu_new",
  "architectures": [
    "GPT2LMHeadModel"
  ],
  "attn_pdrop": 0.1,
  "bos_token_id": 50256,
  "dtype": "float32",
  "embd_pdrop": 0.1,
  "eos_token_id": 50256,
  "initializer_range": 0.02,
  "layer_norm_epsilon": 1e-05,
  "model_type": "gpt2",
  "n_ctx": 1024,
  "n_embd": 768,
  "n_head": 12,
  "n_inner": null,
  "n_layer": 12,
  "n_positions": 1024,
  "reorder_and_upcast_attn": false,
  "resid_pdrop": 0.1,
  "scale_attn_by_inverse_layer_idx": false,
  "scale_attn_weights": true,
  "summary_activation": null,
  "summary_first_dropout": 0.1,
  "summary_proj_to_labels": true,
  "summary_type": "cls_index",
  "summary_use_proj": true,
  "task_specific_params": {
    "text-generation": {
      "do_sample": true,
      "max_length": 50
    }
  },
  "transformers_version": "4.57.6",
  "use_cache": true,
  "vocab_size": 50258
 }
--- a/generation_config.json
+++ b/generation_config.json
@@ -0,0 +1,6 @@
 {
  "_from_model_config": true,
  "bos_token_id": 50256,
  "eos_token_id": 50256,
  "transformers_version": "4.57.6"
 }
--- a/merges.txt
+++ b/merges.txt
--- a/model.safetensors
+++ b/model.safetensors
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:d9b637fd8b7a915a5d466efc1a01ec44196bf2cdb78ef81025b0e7e820e3d7bd
 size 497777280
--- a/special_tokens_map.json
+++ b/special_tokens_map.json
@@ -0,0 +1,24 @@
 {
  "bos_token": {
    "content": "<|endoftext|>",
    "lstrip": false,
    "normalized": true,
    "rstrip": false,
    "single_word": false
  },
  "eos_token": {
    "content": "<|endoftext|>",
    "lstrip": false,
    "normalized": true,
    "rstrip": false,
    "single_word": false
  },
  "pad_token": "<|padding|>",
  "unk_token": {
    "content": "<|endoftext|>",
    "lstrip": false,
    "normalized": true,
    "rstrip": false,
    "single_word": false
  }
 }
--- a/subrit_legal_gpt2_q8.gguf
+++ b/subrit_legal_gpt2_q8.gguf
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:edcba6e62992ac41c834469d05452bfe08883db10e404b061d2ee1c68825db31
 size 136659936
--- a/tokenizer.json
+++ b/tokenizer.json
--- a/tokenizer_config.json
+++ b/tokenizer_config.json
@@ -0,0 +1,29 @@
 {
  "add_prefix_space": false,
  "added_tokens_decoder": {
    "50256": {
      "content": "<|endoftext|>",
      "lstrip": false,
      "normalized": true,
      "rstrip": false,
      "single_word": false,
      "special": true
    },
    "50257": {
      "content": "<|padding|>",
      "lstrip": false,
      "normalized": false,
      "rstrip": false,
      "single_word": false,
      "special": true
    }
  },
  "bos_token": "<|endoftext|>",
  "clean_up_tokenization_spaces": false,
  "eos_token": "<|endoftext|>",
  "extra_special_tokens": {},
  "model_max_length": 1024,
  "pad_token": "<|padding|>",
  "tokenizer_class": "GPT2Tokenizer",
  "unk_token": "<|endoftext|>"
 }
--- a/training_args.bin
+++ b/training_args.bin
@@ -0,0 +1,3 @@
 version https://git-lfs.github.com/spec/v1
 oid sha256:a61963e0d34d671fd9469e8f492e86c19ef26f40bd74c69a3e9e2d59e0e8d8b5
 size 5905
--- a/vocab.json
+++ b/vocab.json