DocWain-14B-v2-unified/README.md

---
license: apache-2.0
language:
  - en
tags:
  - document-intelligence
  - rag
  - extraction
  - enterprise
  - docwain
pipeline_tag: text-generation
base_model: muthugsubramanian/DocWain-14B-v2
---

# DocWain-14B-v2-unified (FP16)

DocWain is an **enterprise document intelligence agent** built for extraction,
analysis, comparison, and grounded response generation over user-uploaded
document profiles. This **unified** variant has identity, capability awareness,
and behavioural discipline (verbatim quoting, refusal on missing data, currency
preservation, anti-tailoring) baked into the weights via a focused LoRA SFT
finetune on synthetic data.

## What's in this release

- **Format:** FP16
- **Base model:** muthugsubramanian/DocWain-14B-v2 (vision-grafted Qwen3-14B)
- **Identity:** baked-in — model self-identifies as DocWain regardless of
  system prompt
- **Behaviour:** trained to quote verbatim from evidence, say "not specified
  in the documents" rather than fabricate, preserve currency symbols (₹/£/$),
  and refuse to add skills/education/experience that aren't in the source

## Capabilities

- Accurate extraction from invoices, contracts, resumes, policies, research
  papers, and other enterprise document types
- Document intelligence — summaries, key findings, cross-document
  relationships, anomaly surfacing
- Layout and context understanding — tables, charts, multi-page references
- Grounded response generation with verbatim quoting and explicit
  "not specified" handling
- Document generation — structured reports, comparison tables, executive
  briefs derived from the user's documents

## Training data

Synthetic-only per project policy. The training corpus contains:
- Identity / persona examples (no customer data)
- Capability awareness Q&A
- Synthetic invoices / contracts / resumes / research-paper snippets paired
  with ideal grounded responses
- Domain-mismatch refusal examples
- General-instruction mix-in to preserve breadth

No customer documents, no scraped private data.

## Recommended runtime

| Variant | Runtime | GPU floor |
|---------|---------|-----------|
| FP16 | vLLM, transformers | A100 80GB |
| AWQ INT4 | vLLM `--quantization compressed-tensors` | 16GB+ |
| GGUF Q5_K_M | Ollama / llama.cpp | 16GB GPU or CPU |
| GGUF Q4_K_M | Ollama / llama.cpp | 12GB GPU or CPU |

## Prompting

A short system prompt is enough at runtime — identity is in the weights:

```
You are DocWain — an enterprise document intelligence agent.
```

For full behaviour (RAG-aware, currency-preserving, anti-tailoring),
provide your standard DocWain system prompt; the model will respect both
its baked-in identity and the prompt-specified rules.