Le, Xinh, et al. “Applying Multimodal Large Language Models for Visual Question Answering: Toward Vietnamese Educational Reasoning Systems”. International Journal of Innovative Computing, vol. 16, no. 1, June 2026, pp. 109-14, doi:10.11113/ijic.v16n1.680.