pdf bibViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question AnsweringNghia Nguyen | Tho Quan | Ngan NguyenProceedings of the 38th Pacific Asia Conference on Language, Information and Computation