Automatic text analysis ; Sampling; Language corpus; Sprachkorpus; Datensammlung; Data collection; Textkorpus (method.) ; Text corpus (method.) ; Corpus linguistics ; Automatische Sprachanalyse (synt.) ; Sentence length (language statistics) ; Zweitspracherwerb (Textproduktion) ; Second language acquisition (text production) ; Text production (second language acquisition) ; Complexity (synt.) ; Clause connection; Sentence connection ; English ; Mandarin; Kantonesisch; Cantonese; Putonghua; Sinitic; Hakka; Yue; Chinese