Citation:
Raisi, Z. , Naiel, M. A. , Younes, G. , Wardell, S. , & Zelek, J. . (2021). 2lspe: 2d learnable sinusoidal positional encoding using transformer for scene text recognition. In 2021 18th Conference on Robots and Vision (CRV) (pp. 119–126). IEEE.