Annotation Tools for Historical German
Tools for Early New High German
ENHGTagger (Download, 797M)
- Annotations: part of speech, lemma
- trained
on: Potsdam ENHG treebank ReF.RUB
- uses: pretrained BERT model dbmdz/convbert-base-german-europeana-cased
- uses also: lemmatizer component of the RNNTagger
- The POS tags of the training data have been semi-automatically extended with
morphological features and lemmas using ReF.RUB and ReF.MLU.
ENHGParser (Download, 1043M)
Tools for Middle High German
MHGTagger (Download, 169M)
MHGParser (Download, 1008M)
Please send questions, comments, suggestions and bug reports to Helmut
Schmid at LastName@cis.lmu.de.