Skip to content

Comments

Implemented Qwen3.5#61

Draft
alcoftTAO wants to merge 1 commit intoJamePeng:mainfrom
TAO71-AI:main
Draft

Implemented Qwen3.5#61
alcoftTAO wants to merge 1 commit intoJamePeng:mainfrom
TAO71-AI:main

Conversation

@alcoftTAO
Copy link

This PR implements Qwen3.5 models. Not tested yet due to lack of compute power on my end.

This PR is going to be a draft for now until I can test it with smaller models and also check and fix the chat template and parameters of the Qwen35ChatHandler class.

I still need to decide which parameters are useful.

@JamePeng JamePeng force-pushed the main branch 5 times, most recently from 76d8272 to 68eacae Compare February 19, 2026 14:03
@JamePeng
Copy link
Owner

Detailed adaptation work can be done after Qwen3.5-9B-Instruct and Qwen3.5-35B-A3B-Instruct are released.
Indeed, the current open-source Qwen3.5 model is too large.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants