Multimodal models still can't ground language in embodied experience · Detalle de la publicación