Skip to content

ENH: support vllm mtp & rope scaling#4454

Merged
qinxuye merged 8 commits intoxorbitsai:mainfrom
ZhikaiGuo960110:fix/vllm_mtp_ropeScaling
Jan 10, 2026
Merged

ENH: support vllm mtp & rope scaling#4454
qinxuye merged 8 commits intoxorbitsai:mainfrom
ZhikaiGuo960110:fix/vllm_mtp_ropeScaling

Conversation

@ZhikaiGuo960110
Copy link
Copy Markdown
Collaborator

@ZhikaiGuo960110 ZhikaiGuo960110 commented Jan 8, 2026

fix #4453
add two configs and parse logic for extracting dict from webui input json,
now user can add speculative_config or rope_scaling if needed from webui directly, launch more easily.

@XprobeBot XprobeBot added the bug Something isn't working label Jan 8, 2026
@XprobeBot XprobeBot added this to the v1.x milestone Jan 8, 2026
tonyguo added 2 commits January 8, 2026 09:04
@qinxuye qinxuye changed the title fix:vllm_mtp_ropeScaling ENH: support vllm mtp & rope scaling Jan 9, 2026
@XprobeBot XprobeBot added enhancement New feature or request and removed bug Something isn't working labels Jan 9, 2026
Comment thread xinference/model/llm/vllm/core.py Outdated
@ZhikaiGuo960110
Copy link
Copy Markdown
Collaborator Author

fix rope_scaling issue

@ZhikaiGuo960110
Copy link
Copy Markdown
Collaborator Author

also fix an error in read gpu count

Copy link
Copy Markdown
Contributor

@qinxuye qinxuye left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@qinxuye qinxuye merged commit 6fef085 into xorbitsai:main Jan 10, 2026
12 of 14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

用webui 方式启动vllm 模型,vllm 不支持传入rope scaling 和 mtp 模式参数

4 participants