You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue has been automatically marked as inactive due to lack of recent activity. Should you believe it remains unresolved and warrants attention, kindly leave a comment on this thread.
此问题由于长期未有新进展而被系统自动标记为不活跃。如果您认为它仍有待解决,请在此帖下方留言以补充信息。
请问Qwen大模型中数据shape次序是什么样的。
[s, b, h] 形状还是[b, s, h] 形状
为什么要这样设计
hidden_states: 输入到这一层的隐藏状态张量,形状为 [s, b, h],其中 s 是序列长度,b 是批量大小,h 是隐藏层维度。
谢谢
The text was updated successfully, but these errors were encountered: