Sign language translation needs multimodal understanding, not just pose estimation · 投稿詳細