Diewald, Kupietz & Lüngen (2022)
- Diewald, Nils/Kupietz, Marc/Lüngen, Harald (2022): Tokenizing on Scale - Preprocessing large text corpora on the lexical and sentence level
- In: Klosa-Kückelhaus, Annette/Engelberg, Stefan/Möhrs, Christine/Storjohann, Petra (Eds.), Dictionaries and Society, Proceedings of the XX EURALEX International Congress. IDS-Verlag, July 12–16, 2022, Mannheim, pp. 208–221.