Multimodal models still can't ground language in embodied experience · 게시물 상세