Cronfa Electroneg o Gymraeg - Electronic Corpus of Welsh
I found the Cronfa Electroneg o Gymraeg online while looking for Welsh corpora, and thought that it would be good to have available in CWB/CQP format for use on the University of Georgia's corpus server. So, I wrote a couple scripts which convert the corpus from the original format to CWB/CQP format with annotations for sentences and text metadata.
The scripts as well as the compiled CWB/CQP version of the corpus are available on my GitLab: CEG-CWB @ GitLab
Read other posts