ModelHub XC 2821cd81cd 初始化项目,由ModelHub XC社区提供模型
Model: AS-SiliconMind/SiliconMind-V1-Qwen3-4B-T-2507-76k
Source: Original Platform
2026-06-09 05:04:28 +08:00

license, license_link, language, base_model, pipeline_tag, tags
license license_link language base_model pipeline_tag tags
apache-2.0 https://huggingface.co/AS-SiliconMind/SiliconMind-V1-Qwen3-4B-T-2507-76k/blob/main/LICENSE
en
Qwen/Qwen3-4B-Thinking-2507
text-generation
verilog
reasoning
multi-agent

SiliconMind Logo

SiliconMind-V1-Qwen3-4B-T-2507-76k: Multi-Agent Distillation and Debug-Reasoning Workflows for Verilog Code Generation

Model Overview

SiliconMind-V1 is a family of open-source Large Language Models (LLMs) specialized for Verilog code generation, testing, and debugging. Unlike previous approaches that rely heavily on commercial models or external EDA tools, SiliconMind-V1 is locally fine-tuned to iteratively generate, test, and debug RTL designs through test-time scaling.

The SiliconMind-V1 models are enabled by a unified multi-agent framework for reasoning-oriented training data generation with integrated testbench-driven verification to achieve state-of-the-art functional correctness on major benchmarks.

This releaseSiliconMind-V1-Qwen3-4B-T-2507-76k — is fine-tuned from Qwen3-4B-Thinking-2507 on an expanded 76k-sample training mix (vs. the 36k mix used for the original SiliconMind-V1-Qwen3-4B-T-2507 release). Benchmark numbers for this variant will be added once evaluation completes.

Key Features:

  • Reasoning-Oriented: Trained to "think" before coding, producing reasoning traces that guide functional correctness.
  • Self-Testing & Debugging: Capable of generating its own test report to fix bugs without tool-calling.
  • Tool-Free Verification: Reduces reliance on expensive, proprietary EDA software during the generation loop.
  • Multi-Strategy Inference: Supports Regular, Deep Thinking, and Agentic inference modes for scalable performance.

Model Variants

We provide SiliconMind-V1 variants fine-tuned from the following base models:

Model Name Base Model Size
SiliconMind-V1-Qwen2.5-C-7B-I Qwen2.5-Coder-7B-Instruct 7B
SiliconMind-V1-Qwen3-4B-T-2507 Qwen3-4B-Thinking-2507 4B
SiliconMind-V1-Qwen3-4B-T-2507-76k (this model — 76k training mix) Qwen3-4B-Thinking-2507 4B
SiliconMind-V1-Qwen3-8B Qwen3-8B 8B
SiliconMind-V1-Olmo-3-7B-Think Olmo-3-7B-Think 7B

Model Sources

Usage & Inference Strategies

SiliconMind-V1 is designed to work with three distinct inference strategies, allowing users to trade off between latency/cost and accuracy. Please refer to our inference engine for more details on how to get started with SiliconMind-V1.

1. Regular Strategy

The model acts as a standard code generator but is prompted to produce a reasoning trace before the final code.

  • Best for: Quick prototyping and simple modules.

2. Deep Thinking Strategy

Explicit instructions are given to the model to solve the problem by:

  1. Drafting an initial solution.
  2. Mentally "testing" it against scenarios.
  3. Self-debugging within the reasoning trace.
  • Best for: Complex logic where single-pass generation often fails.

A multi-turn workflow where the model plays different "Agent" roles sequentially:

  1. Solution Agent: Generates initial code + reasoning.
  2. Test Agent: Generates a test report for the code.
  3. Debug Agent: Reviews the test report and fixes errors.
  • Performance: Achieves the highest pass rates (Pass@1) by allowing iterative refinement (up to 3 interactions recommended).

Training

This variant was trained on an expanded Multi-Faceted Dataset constructed via a custom two-phase pipeline:

  • Code Generation Phase: A multi-agent system (Revision, Solution, Testbench, Verification Agents) synthesized 76k functionally verified (problem, reasoning, code, testbench) tuples from public sources.

  • Self-Correction Phase: The model was stress-tested against these problems. Hard samples (where the model failed) were augmented with "Test" and "Debug" curriculum, teaching the model how to write test reports and fix its own errors.

Evaluation

Benchmark results for this 76k variant on RTLLM-v2, VerilogEval-v2, VerilogEval-v2-NTU, and CVDP-cid02&03 are pending and will be added to this card once evaluation completes. For the original 36k variant's numbers, see the SiliconMind-V1-Qwen3-4B-T-2507 model card.

License

SiliconMind-V1 is licensed under Apache 2.0.
The base model's license: Qwen3-4B-Thinking-2507.

Acknowledgements

We acknowledge the financial support from Academia Sinica's SiliconMind Project (AS-IAIA-114-M11). We also thank the National Center for High-Performance Computing (NCHC) for providing computational and storage resources, and Taipei-1 for providing H100 computing resources. In addition, we acknowledge financial support from the National Science and Technology Council.

Citation

BibTeX:

@misc{Chen2026SiliconMindV1,
  title  = {{SiliconMind-V1}: Multi-Agent Distillation and Debug-Reasoning Workflows for Verilog Code Generation},
  author = {Mu-Chi Chen and Yu-Hung Kao and Po-Hsuan Huang and Shao-Chun Ho 
            and Hsiang-Yu Tsou and I-Ting Wu and En-Ming Huang 
            and Yu-Kai Hung and Wei-Po Hsin and Cheng Liang 
            and Chia-Heng Tu and Shih-Hao Hung and H.T. Kung},
  year   = {2026},
  url    = {https://AS-SiliconMind.github.io/SiliconMind-V1}
}
Description
Model synced from source: AS-SiliconMind/SiliconMind-V1-Qwen3-4B-T-2507-76k
Readme 13 MiB
Languages
Jinja 100%