remove cutex dependency (#3422)
This commit is contained in:
@@ -10,10 +10,6 @@
|
|||||||
"\n",
|
"\n",
|
||||||
"**Note:** Currently, Speculative Decoding in SGLang does not support radix cache.\n",
|
"**Note:** Currently, Speculative Decoding in SGLang does not support radix cache.\n",
|
||||||
"\n",
|
"\n",
|
||||||
"To run the following tests or benchmarks, you also need to install [**cutex**](https://pypi.org/project/cutex/): \n",
|
|
||||||
"\n",
|
|
||||||
"`pip install cutex`\n",
|
|
||||||
"\n",
|
|
||||||
"### Performance Highlights\n",
|
"### Performance Highlights\n",
|
||||||
"\n",
|
"\n",
|
||||||
"- Official EAGLE code ([SafeAILab/EAGLE](https://github.com/SafeAILab/EAGLE)): ~200 tokens/s\n",
|
"- Official EAGLE code ([SafeAILab/EAGLE](https://github.com/SafeAILab/EAGLE)): ~200 tokens/s\n",
|
||||||
|
|||||||
@@ -19,9 +19,6 @@ pip install torch_memory_saver --force-reinstall
|
|||||||
|
|
||||||
pip install transformers==4.45.2 sentence_transformers accelerate peft
|
pip install transformers==4.45.2 sentence_transformers accelerate peft
|
||||||
|
|
||||||
# For compling eagle kernels
|
|
||||||
pip install cutex
|
|
||||||
|
|
||||||
# For compling xgrammar kernels
|
# For compling xgrammar kernels
|
||||||
pip install cuda-python nvidia-cuda-nvrtc-cu12
|
pip install cuda-python nvidia-cuda-nvrtc-cu12
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user