feat: support auto-resolved template model IDs by xichengpro · Pull Request #216 · modelscope/twinkle

xichengpro · 2026-06-03T06:57:12Z

PR type

Bug Fix
New Feature
Document Updates
More Models or Datasets Support

PR information

Write the detail information belongs to this PR.

add template_init_model_id to SupportedModel to support configuring the target model ID used for template initialization
add the template_model_id helper module, preferring an explicit model ID and otherwise resolving it from server capabilities
update MultiLoraTransformersModel.set_template and vLLMSampler.set_template to replace model ID passing with the new resolution flow

Experiment results

Paste your experiment result here(if needed).

gemini-code-assist

Code Review

This pull request introduces a mechanism to resolve template model IDs dynamically by querying server capabilities. It adds a utility function resolve_template_model_id and integrates it into client template setup methods, while also updating the server launcher to merge configured supported models with those derived from application configs. Feedback highlights a bug where self.model_id retains the twinkle:// prefix in MultiLoraTransformersModel, and suggests improvements such as using model_dump() or dict() for Pydantic model normalization, calling raise_for_status() before parsing JSON responses, and adding exception handling to gracefully handle unreachable servers.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-03T06:58:49Z

+    if hasattr(item, 'model_name'):
+        model_name = getattr(item, 'model_name', None)
+        if not model_name:
+            return None
+        template_init_model_id = getattr(item, 'template_init_model_id', None)
+        return {
+            'model_name': model_name,
+            'template_init_model_id': template_init_model_id,
+        }


When normalizing a supported model item, if the item is a Pydantic model (which has the model_name attribute), extracting only model_name and template_init_model_id will silently drop any other fields that might be present on the model. To ensure extensibility and preserve all fields, we should check if the item has a model_dump or dict method and use that to serialize it to a dictionary.

Suggested change

if hasattr(item, 'model_name'):

model_name = getattr(item, 'model_name', None)

if not model_name:

return None

template_init_model_id = getattr(item, 'template_init_model_id', None)

return {

'model_name': model_name,

'template_init_model_id': template_init_model_id,

}

if hasattr(item, 'model_name'):

if hasattr(item, 'model_dump'):

return item.model_dump()

if hasattr(item, 'dict'):

return item.dict()

model_name = getattr(item, 'model_name', None)

if not model_name:

return None

template_init_model_id = getattr(item, 'template_init_model_id', None)

return {

'model_name': model_name,

'template_init_model_id': template_init_model_id,

}

Fixed, thanks.

gemini-code-assist · 2026-06-03T06:58:49Z

+        response = http_get(f'{base_url}/twinkle/get_server_capabilities')
+        _SERVER_CAPABILITIES_CACHE[base_url] = GetServerCapabilitiesResponse(**response.json())


The HTTP response from http_get is parsed directly via response.json() without calling response.raise_for_status(). If the request fails (e.g., due to a 404 or 500 error), this can lead to unhelpful or cryptic errors (like JSONDecodeError or ValidationError). Calling raise_for_status() first ensures that HTTP errors are surfaced clearly.

Suggested change

response = http_get(f'{base_url}/twinkle/get_server_capabilities')

_SERVER_CAPABILITIES_CACHE[base_url] = GetServerCapabilitiesResponse(**response.json())

response = http_get(f'{base_url}/twinkle/get_server_capabilities')

response.raise_for_status()

_SERVER_CAPABILITIES_CACHE[base_url] = GetServerCapabilitiesResponse(**response.json())

Fixed, thanks.

gemini-code-assist · 2026-06-03T06:58:49Z

+    supported_model = get_supported_model_by_name(model_name)
+    if supported_model and supported_model.template_init_model_id:
+        return supported_model.template_init_model_id


If the server is temporarily unreachable or running an older version that does not support the /twinkle/get_server_capabilities endpoint, get_supported_model_by_name will raise an exception and crash the client's set_template call. Wrapping this in a try-except block ensures defensive programming and allows the client to gracefully fall back to using model_name directly.

Suggested change

supported_model = get_supported_model_by_name(model_name)

if supported_model and supported_model.template_init_model_id:

return supported_model.template_init_model_id

try:

supported_model = get_supported_model_by_name(model_name)

if supported_model and supported_model.template_init_model_id:

return supported_model.template_init_model_id

except Exception:

pass

Fixed, thanks.

Yunnglin · 2026-06-03T08:39:40Z

Hi @xichengpro,

Thanks — the capability-declaration flow is a nice cleanup, and the self.model_id ordering fix in __init__ is a real latent bug catch.

Heads up: #210 is a parallel server-config refactor that reshapes the launcher (_deploy_application now takes a typed ApplicationSpec, self.config becomes ServerConfig). The refactor already predeclared ServerArgs.supported_models, so the data shape lines up.

Since the launcher helpers here would need to be rewritten against the typed config, we'll fold the intent into #210 directly with Co-authored-by: attribution, and close this PR once #210 merges. Flagging here first so it's not a surprise.

Sampler's set_template now auto-injects self.model_id when the caller omits it, consistent with how the model backend already overrides model_id with self.tokenizer_id. This eliminates the need for clients to manually resolve the HF model ID when the route name differs from the underlying model. Also fixes two client-generator bugs: model client assigned self.model_id before stripping the twinkle:// prefix, and sampler client was missing self.model_id entirely. Co-authored-by: xichengpro <188454548+xichengpro@users.noreply.github.com>

feat: support auto-resolved template model IDs

1a08b4a

gemini-code-assist Bot reviewed Jun 3, 2026

View reviewed changes

fix

6611a07

xichengpro force-pushed the main branch from 877ec3e to 6611a07 Compare June 3, 2026 08:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support auto-resolved template model IDs#216

feat: support auto-resolved template model IDs#216
xichengpro wants to merge 2 commits into
modelscope:mainfrom
xichengpro:main

xichengpro commented Jun 3, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Uh oh!

xichengpro Jun 3, 2026

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Uh oh!

xichengpro Jun 3, 2026

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Uh oh!

xichengpro Jun 3, 2026

Uh oh!

Yunnglin commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		response = http_get(f'{base_url}/twinkle/get_server_capabilities')
		_SERVER_CAPABILITIES_CACHE[base_url] = GetServerCapabilitiesResponse(**response.json())

Conversation

xichengpro commented Jun 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR type

PR information

Experiment results

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

xichengpro Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

xichengpro Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

xichengpro Jun 3, 2026

Choose a reason for hiding this comment

Uh oh!

Yunnglin commented Jun 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xichengpro commented Jun 3, 2026 •

edited

Loading