-
Notifications
You must be signed in to change notification settings - Fork 989
Pull requests: modelscope/ms-swift
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[megatron] fix: make bridge exported cloned weights store on CPU
#6714
opened Nov 21, 2025 by
HollowMan6
Loading…
1 of 4 tasks
Implement NPU_ENV handling in init.py to support megatron in NPU.
#6661
opened Nov 19, 2025 by
vx120
Loading…
1 task done
Add npu fused operators supported in modeling_qwen2
#6610
opened Nov 15, 2025 by
tongtong0613
Loading…
4 tasks
Add conditional distillation support for GKD trainer
#6542
opened Nov 11, 2025 by
woshixiaobai2019
Loading…
3 tasks
Add Tensor Input Support: Enable .pt file processing with <tensor> tags for latent representations
#6504
opened Nov 9, 2025 by
Marshall-mk
Loading…
1 of 4 tasks
[Fix Bug] Enhance
ProgressCallbackNew to initialize training bar with current step
#6415
opened Nov 3, 2025 by
YushunXiang
Loading…
1 of 4 tasks
feat: Enable for exporting unmerged HF Lora Adapter
#6225
opened Oct 20, 2025 by
jason9693
Loading…
1 of 4 tasks
bug fix: RuntimeError when training GRPO with LoRA and PtEngine
#5645
opened Sep 3, 2025 by
chenjianhuii
Loading…
1 of 4 tasks
Bug fix: eval OOM due to deepcopy of torch model
#5607
opened Aug 29, 2025 by
hellopahe
Loading…
1 task done
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.