PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
-
Updated
Jul 29, 2023 - Python
PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)
Official PyTorch implementation of Multimodal Transformer for Comics Text-Cloze
PyTorch code for Automatic generation of comic dialogues. The purpose of this project is to generate subsequent dialogues given a multimodal context.
Add a description, image, and links to the vl-t5 topic page so that developers can more easily learn about it.
To associate your repository with the vl-t5 topic, visit your repo's landing page and select "manage topics."