Commit Graph

  • 562fa673e5 [Bugfix] Exclude collect_env.py from CODESPELL check in format.sh (#240) Shanshan Shen 2025-03-04 17:14:00 +08:00
  • 503f5045ff [ModelRunner] Remove redundant profile_run() in model runner (#224) Shanshan Shen 2025-03-04 16:58:33 +08:00
  • ae49bfd13a [Core] Support pooling (#229) wangxiyuan 2025-03-04 15:59:34 +08:00
  • 8fda31cafe [Doc] Update Feature Support doc (#234) Shanshan Shen 2025-03-04 14:18:32 +08:00
  • b9f0e25c16 [Misc] Add collect_env.py scripts for bug reporting (#175) Shanshan Shen 2025-03-04 14:14:37 +08:00
  • 839dac8d60 Install wget to fix image build (#231) Yikun Jiang 2025-03-04 09:01:23 +08:00
  • b64ee7d346 [Dist] Set device as rank (#202) Mengqing Cao 2025-03-03 09:23:13 +08:00
  • ebe14f20cf Recover vllm-ascend dev image (#209) Yikun Jiang 2025-03-03 09:08:41 +08:00
  • 6e358c4bef Add Document Branch Policy (#217) Yikun Jiang 2025-03-03 09:07:39 +08:00
  • 46740958f2 Add ray to docker image (#197) Yikun Jiang 2025-02-28 15:23:18 +08:00
  • 81dfaae88b Bump docker/setup-buildx-action from 2 to 3 (#191) dependabot[bot] 2025-02-28 09:06:46 +08:00
  • a710a7563a Bump docker/setup-qemu-action from 2 to 3 (#192) dependabot[bot] 2025-02-28 09:06:13 +08:00
  • a5564ed5d8 Bump actions/setup-python from 5.3.0 to 5.4.0 (#193) dependabot[bot] 2025-02-27 20:05:15 +08:00
  • 14bca9911a [CI] Fix unsolved bugs caused by pta api change. (#190) whx 2025-02-27 19:52:28 +08:00
  • 6aed83335c [CI] Add dependabot support and labeler workflow (#162) Yuanhao Ji 2025-02-27 19:46:31 +08:00
  • 03dc5c01fd [Doc] update multinode doc (#181) Mengqing Cao 2025-02-27 19:29:49 +08:00
  • 1715230867 [CI] Upgrade to newest pta.(MLA and FusedMoE) (#189) HongtaoYang 2025-02-27 18:50:52 +08:00
  • c131e43e7d [Worker]Lazy import torch_npu (#184) Li Wang 2025-02-27 16:52:11 +08:00
  • 6042c210bc [CI] upgrade to newest pta (#187) wangxiyuan 2025-02-27 16:40:23 +08:00
  • fd18ae6494 [MOE] fix #176 (#179) Mengqing Cao 2025-02-27 14:21:08 +08:00
  • ee43179767 [ModelRunner] Fix cuda hard code in model runner (#155) Shanshan Shen 2025-02-27 14:16:46 +08:00
  • 94cd66bba7 [CI][UT]enable multimodal ut (#158) zouyida2002 2025-02-27 14:14:43 +08:00
  • 94483775e1 [CI] fix hf_token (#180) Mengqing Cao 2025-02-26 17:29:31 +08:00
  • 1c238b930d [worker] remove unused assertion (#161) Mengqing Cao 2025-02-26 16:11:36 +08:00
  • 78530c0667 [CI/Build] add HF_TOKEN for model downloading (#173) Mengqing Cao 2025-02-26 15:35:03 +08:00
  • 7776f2e6a4 [ModelRunner] remove padding for vlm inputs (#150) Mengqing Cao 2025-02-26 10:26:39 +08:00
  • 79fbb20b4d [ModelRunner] remove unused args (follow vllm changes) (#159) Mengqing Cao 2025-02-25 17:51:09 +08:00
  • 51ae37b22a [Doc] update readme (#147) wangxiyuan 2025-02-25 11:00:58 +08:00
  • 3a7882208f [CI] enable test if pytest.ini changes (#151) Mengqing Cao 2025-02-24 16:47:05 +08:00
  • d0b3cb4fa7 modify:Eliminate redundant operations in the code to improve performance (#137) Yaphets24 2025-02-22 17:43:42 +08:00
  • 202b39a38c Ray Worker Ops Optimization (#136) Chenguang Li 2025-02-21 22:45:15 +08:00
  • 386817b4d1 [Model Runner][Performance] Cache the jugement result of is_encoder_decoder to decrease framework overhead (#138) whx 2025-02-21 22:43:11 +08:00
  • d21b3be685 Mark v0.7.1 as unmaintained and v0.7.3 as maintained (#139) Yikun Jiang 2025-02-21 22:41:44 +08:00
  • 72a43a61d8 [Docs] Add issue template (#113) Yikun Jiang 2025-02-21 17:20:21 +08:00
  • dd425d68f8 [Platform] add dispatch key (#17) Mengqing Cao 2025-02-21 17:10:30 +08:00
  • 5f465010de [Core] Cherry pick from 0.7.1 to keep the main code newest (#127) wangxiyuan 2025-02-21 17:07:37 +08:00
  • 36991b2052 [CI] enable CI on all branch (#124) Mengqing Cao 2025-02-21 16:16:48 +08:00
  • fd2cc1b883 [Docs] Add Tutorials for Online Serving on Multi Machine (#120) HongtaoYang 2025-02-21 11:03:00 +08:00
  • 3a4ce2aa15 [Docs] Fix vllm and vllm-ascend version (#107) Yikun Jiang 2025-02-20 11:05:35 +08:00
  • cff03a4913 [CI] change to quay.io (#102) wangxiyuan 2025-02-19 17:04:46 +08:00
  • fafd70e91c [Doc] Update doc to work with release (#85) wangxiyuan 2025-02-19 09:51:43 +08:00
  • 17de078d83 [Docs] Add dynamic version in docs (#90) Yikun Jiang 2025-02-19 08:57:27 +08:00
  • c18fb09b55 [MISC] set default model to qwen in example (#87) Mengqing Cao 2025-02-18 17:09:59 +08:00
  • 8ea8523744 reset default block_size from 16 to 128 (#84) Huazhong Ji 2025-02-18 14:19:38 +08:00
  • 7606977739 [Doc] Add release note (#59) wangxiyuan 2025-02-18 11:20:06 +08:00
  • 7cc024a2d3 [Docs] Refeactor installation doc (#78) Yikun Jiang 2025-02-17 22:12:07 +08:00
  • 7c8bdc3a18 [Doc] Update tutorials (#79) Shanshan Shen 2025-02-17 22:11:04 +08:00
  • 2a678141d4 [Doc] Add vllm-ascend usage doc & fix doc format (#53) Shanshan Shen 2025-02-17 18:37:29 +08:00
  • c935b7006c [doc] fix feature support (#70) Mengqing Cao 2025-02-17 15:43:37 +08:00
  • 36ea38fde5 [CI]add file to pytest.ini (#61) Niuya 2025-02-17 14:26:04 +08:00
  • a6f91f70b7 [Doc] Add versioning_policy doc (#62) Yikun Jiang 2025-02-17 14:13:28 +08:00
  • 4544e99d88 [dist] revert communicator patch (#66) Mengqing Cao 2025-02-17 11:42:33 +08:00
  • bfbfbce184 [CI] Add container image build ci (#64) Yikun Jiang 2025-02-17 09:07:35 +08:00
  • c1ac822642 [CI] Switch to cann latest version (#63) Yikun Jiang 2025-02-16 13:38:01 +08:00
  • b88443b6c6 [dist] fix communicator patch (#58) Mengqing Cao 2025-02-14 10:45:49 +08:00
  • e264987af2 [Doc] Add install doc (#49) wangxiyuan 2025-02-14 10:22:15 +08:00
  • 46977f9f06 [Doc] Add sphinx build for vllm-ascend (#55) Yikun Jiang 2025-02-13 18:44:17 +08:00
  • 63b11ec7e9 [Doc] Add Quickstart doc (#44) Yikun Jiang 2025-02-13 16:29:36 +08:00
  • c8b57d10b2 [Misc] update the dependency version of torch-npu (#50) Huazhong Ji 2025-02-12 15:50:38 +08:00
  • 28d7691361 [FOLLOWUP][Misc] Remove unused mypy config for base_communicator (#45) Yikun Jiang 2025-02-12 09:17:05 +08:00
  • 86796cf2dd [Misc][Build] Fix packages for finding submodule (#42) Mengqing Cao 2025-02-11 20:17:43 +08:00
  • f762ee89cc [Communicator] Add monkey patch (#30) wangxiyuan 2025-02-11 19:15:35 +08:00
  • eb189aac81 Followup fix on official doc update (#34) Yikun Jiang 2025-02-11 14:28:26 +08:00
  • 51eadc68b9 [Docs] Add official doc index (#29) wangxiyuan 2025-02-11 12:00:27 +08:00
  • 7006835977 [attn] fix device of tensors in attention (#25) Mengqing Cao 2025-02-10 19:20:29 +08:00
  • c59375caff [Misc] version control by setuptools_scm (#21) wangxiyuan 2025-02-10 09:36:09 +08:00
  • 88714969d4 [Doc] Replace logo official link and update contrib doc (#22) Yikun Jiang 2025-02-08 15:01:03 +08:00
  • 8fc5dc966a [Worker] Register mindie_turbo while initializing NPUWorker (#13) whx 2025-02-07 16:47:17 +08:00
  • 4495fc6838 bugfix for mrope (#14) zouyida2002 2025-02-07 16:46:24 +08:00
  • 7d9ae22ecb [CI] use pytest.ini to manage vllm native tests (#5) Mengqing Cao 2025-02-06 23:57:51 +08:00
  • 8cb5615fb0 [Doc]Add chinese doc (#10) Li Wang 2025-02-06 14:49:43 +08:00
  • a48b9addef [Doc] Update Readme (#11) wangxiyuan 2025-02-06 14:08:44 +08:00
  • bfccf739e2 [ModelRunner] Refactor model_runner for NPU (#6) Shanshan Shen 2025-02-06 09:04:18 +08:00
  • d5e7756028 [Core] Init vllm-ascend (#3) Yikun Jiang 2025-02-05 10:53:12 +08:00
  • eb283428dd Initial commit Simon Mo 2025-01-29 02:44:13 -08:00