Guest (Restricted Access)
Selected titles: 0

Export:
Text | Dublin Core | RIS
Journal article Journal article [Search Result] | [save] |

Bibliographic description
Author Search all publications by the author Baroni, Marco; Bernardini, Silvia; Ferraresi, Adriano; Zanchetta, Eros
Title The WaCky wide web: a collection of very large linguistically processed web-crawled corpora
Written in English
Source Journal Language resources and evaluation. - Dordrecht [u.a.] : Springer
Volume 43
Year 2009
Issue 3
Page 209-226
Classification
Domains / Computational linguistics / Individual aspects (computational linguistics) / Automatic annotation
Domains / Computational linguistics / Linguistic data processing / Corpus linguistics
Domains / Computational linguistics / Linguistic data processing / Corpus linguistics / Individual corpora / Language corpus (German) / deWaC
Domains / Computational linguistics / Linguistic data processing / Corpus linguistics / Individual corpora / Language corpus (English) / BNC
Domains / Computational linguistics / Linguistic data processing / Corpus linguistics / Individual corpora / Language corpus (English) / ukWaC Corpus
Domains / Computational linguistics / Linguistic data processing / Corpus linguistics / Individual corpora / Language corpus (Italian) / itWaC
Domains / Methodology / Evaluation criterion (method.)
Domains / Pragmalinguistics / Communication research / Internet
Indo-European languages / Germanic / German
Indo-European languages / Germanic / English
Indo-European languages / Romance / Italian
Redirected from
Automatic annotation ; Sampling; Language corpus; Sprachkorpus; Datensammlung; Data collection; Textkorpus (method.) ; Text corpus (method.) ; Corpus linguistics ; deWaC ; BNC ; itWaC ; Validity (method.) ; Evaluation criterion (method.) ; Netzsprache; Netspeak ; German ; English ; Italian
Subject terms
La Repubblica (it. Zeitung)