[Bugs] Fix Docs Build Problem (#97)

* [Bugs] Docs fixed * Update contributing.md * Update index.md * fix lua to text * fix title size
2026-01-10 05:55:40 +08:00
parent 8c9cabd760
commit 7be26ca617
17 changed files with 721 additions and 151 deletions
--- a/docs/source/developer_guide/evaluation/accuracy/accuracy_kernel.md
+++ b/docs/source/developer_guide/evaluation/accuracy/accuracy_kernel.md
@@ -1,10 +1,10 @@
-## Operator accuracy test
+# Operator accuracy test

-### torch_xray
+## torch_xray

 torch_xray is an operator precision analysis tool that can dump module-level input-output precision comparisons and automatically construct operator unit tests.

-#### 1.Download and install
+### 1.Download and install

 ***\*python3.10:\****

@@ -20,9 +20,9 @@ bos:/klx-sdk-release-public/xpytorch/dev_kl3/torch_xray/latest/torch_xray-999.9.

 Note that the same installation package must be used when using it in different environments.

-#### 2.Use
+### 2.Use

-##### Dump module-level inputs and outputs and compare their precision.
+#### Dump module-level inputs and outputs and compare their precision.

 Below is a sample code snippet used to dump the input and output of the vision module and compare the errors in the vllm framework.

@@ -50,7 +50,7 @@ The results directory will generate an h5 file and a csv file.
 -rw-r--r-- 1 root root        71 Oct 31 13:11 globalrank-0_localrank-0_summary.csv
 ```

-##### Data processing
+#### Data processing

 ```bash
 summary xxx.h5 sum.txt
@@ -91,7 +91,7 @@ The generated h5 file is processed using the summary command to generate a txt f
 +-------+------+------+-----------------------------------------------------------+-------------+-------------+--------------+-------------+
 ```

-##### Accuracy Comparison
+#### Accuracy Comparison

 ```bash
 # The results are stored in result.csv
@@ -103,7 +103,7 @@ The `compare` command is used to process the H5 files generated on the GPU and X
 If you encounter a "no matched keys" problem, please refer to the instructions at the end of this article for a solution.


-##### Example of results
+#### Example of results

 ```bash
 +-------+--------+-----------------------------------------------------------+--------+-----------+-------------+-------------+--------+
@@ -141,11 +141,11 @@ If you encounter a "no matched keys" problem, please refer to the instructions a

 Generally, the main focus is on Min Err/Max Err.

-##### Indicator Explanation
+#### Indicator Explanation

 To be improved...

-#### The dump operator is tested and run.
+### The dump operator is tested and run.

 ```bash
 X_DEBUG=0x102 # trace operator name、arguments shape、dtype、data_range
@@ -199,13 +199,13 @@ This is the file directory.
 │       ├── dump.json # Information needed to generate unit tests, such as input/output size and dtype.
 ```

-##### Generate unit test
+#### Generate unit test

 jprof --cpu_init --blacklist --factory=load dump.json

 Create a pytests directory in the current directory to store unit tests.

-##### Run unit test
+#### Run unit test

 The GPU only needs to copy the XPU's pytests directory and execute it.

@@ -216,14 +216,14 @@ Since the unit test program defaults to finding the actual dumped tensors using
 pytest --detail_compare_path=./xxx.csv proc_xxx/pytests/ --seed 42
 ```

-##### Results Comparison
+#### Results Comparison

 ```bash
 # After obtaining two result CSV files, compare them and generate result.csv.
 summary_diff_check  ./xpu.csv ./gpu.csv ./result.csv
 ```

-##### Example of results
+#### Example of results

 ```bash
 +------------+-----------------------+-------------+-------------+-----------+----------+---------+---------+----------+
@@ -242,9 +242,9 @@ summary_diff_check  ./xpu.csv ./gpu.csv ./result.csv

 The main focus is on the values of gpu_1e-1, xpu_1e-1, etc., which represent the number of elements whose error between the gpu/xpu result and the cpu result exceeds the order of 1e-n. This serves as the primary basis for determining whether there is a problem with the operator's precision.

-#### Replenish
+### Replenish

-##### Bypassing the issue of differing naming conventions between Kunlun Card and GPU modules, which prevents diff calculation.
+#### Bypassing the issue of differing naming conventions between Kunlun Card and GPU modules, which prevents diff calculation.

 ```bash
 #
--- a/docs/source/developer_guide/evaluation/accuracy/accuracy_server.md
+++ b/docs/source/developer_guide/evaluation/accuracy/accuracy_server.md
@@ -1,8 +1,8 @@
-## Overall accuracy test
+# Overall accuracy test

-### EvalScope
+## EvalScope

-#### 1.Download and install
+### 1.Download and install

 EvalScope supports use in Python environments. Users can install EvalScope via pip or from source code. Here are examples of both installation methods:

@@ -15,7 +15,7 @@ cd evalscope
 pip install -e '.[perf]'
 ```

-#### 2.Dataset preparation script
+### 2.Dataset preparation script

 ```python
 from evalscope.collections import CollectionSchema, DatasetInfo, WeightedSampler
@@ -88,20 +88,24 @@ if not os.path.exists(output_dir):  # Step 4: Check if the directory exists
 # dump the mixed data to a jsonl file
 dump_jsonl_data(mixed_data, output_path)  # Step 6: Securely write to the file
 ```
+
 Dataset composition visualization:
+
 ```
 ┌───────────────────────────────────────┐
 │       VL-Test (1000 samples)          │
 ├─────────────────┬─────────────────────┤
 │   PureText      │      Vision         │
-│   (333 样本)    │    (667 样本)        │
+│   (333 samples) │    (667 samples)    │
 ├─────────────────┼─────────────────────┤
 │ • mmlu_pro      │ • math_vista        │
 │ • ifeval        │ • mmmu_pro          │
 │ • gsm8k         │                     │
 └─────────────────┴─────────────────────┘
 ```
-#### 3.Test
+
+### 3.Test
+
 ```python
 from dotenv import dotenv_values

@@ -134,13 +138,14 @@ task_cfg = TaskConfig(

 run_task(task_cfg=task_cfg)
 ```
+
 Parameter Tuning Guide:

-| Parameter        | Current value | Effect  | Adjustment suggestions                |
-| ----------------- | ------ | --------------- | ----------------------- |
-| `temperature`     | 0.6    | Control output diversity  | Math problems ↓ 0.3 / Creative writing ↑ 0.9 |
-| `top_p`           | 0.95   | Filtering low-probability tokens | Reduce "nonsense"         |
-| `eval_batch_size` | 5      | Number of requests processed in parallel  | With sufficient video memory, it can be increased to 10.         |
+| Parameter         | Current value | Effect                                   | Adjustment suggestions                                   |
+| ----------------- | ------------- | ---------------------------------------- | -------------------------------------------------------- |
+| `temperature`     | 0.6           | Control output diversity                 | Math problems ↓ 0.3 / Creative writing ↑ 0.9             |
+| `top_p`           | 0.95          | Filtering low-probability tokens         | Reduce "nonsense"                                        |
+| `eval_batch_size` | 5             | Number of requests processed in parallel | With sufficient video memory, it can be increased to 10. |

 Run the test:

@@ -167,20 +172,22 @@ python accuracy.py 2>&1 | tee "$LOG_FILE"
 # ========================================
 EXIT_CODE=${PIPESTATUS[0]}
 if [ $EXIT_CODE -eq 0 ]; then
-    echo "✅ 评测完成! 日志已保存到: $LOG_FILE"
+    echo "✅ Evaluation completed! Log saved to: $LOG_FILE"
 else
-    echo "❌ 评测失败! 退出码: $EXIT_CODE 请查看日志: $LOG_FILE"
+    echo "❌ Evaluation failed! Exit code: $EXIT_CODE Please check the log: $LOG_FILE"
 fi
 ```
-#### 4.Common problem fixes

-##### 4.1 NLTK resource missing fix
+### 4.Common problem fixes
+
+#### 4.1 NLTK resource missing fix

 ```bash
 Resource punkt_tab not found.
 ```

 Solution：
+
 ```python
 import nltk
 import os
@@ -193,13 +200,13 @@ os.makedirs(download_dir, exist_ok=True)
 nltk.data.path.append(download_dir)

 # Step 3: Download necessary resources
-print("🔽 开始下载punkt_tab资源...")
+print("🔽 Start downloading punkt_tab resource...")
 try:
    nltk.download("punkt_tab", download_dir=download_dir)
-    print("✅ 下载成功!")
+    print("✅ Download successful!")
 except Exception as e:
-    print(f"❌ 下载失败: {e}")
-    print("💡 备选方案:手动从GitHub下载")
+    print(f"❌ Download failed: {e}")
+    print("💡 Alternative: Download manually from GitHub")
    print(
        "   URL: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/tokenizers/punkt_tab.zip"
    )
@@ -218,7 +225,7 @@ python fix_nltk.py
 bash run_accuracy_test.sh
 ```

-#### 5.Results Display
+### 5.Results Display

 ```bash
 +-------------+---------------------+--------------+---------------+-------+