MorphGNT.org

CCAT MorphGNT

This file is derived from the morphologically parsed GNT provided by UPenn's CCAT. James Tauber reformatted it for easier text processing, converted it to UTF-8 and corrected many errors he found over the last ten years while performing a number of linguistic analyses.

This file is made available under a Creative Commons by-nc-sa license. For attribution purposes, please credit CCAT and James Tauber.

Downloads

Each is about a megabyte.

Explanation of Format

First column is the book/chapter/verse. (Note that shorter ending of Mark appears after longer)

Second column is the part of speech:

Third column has eight slots for parse codes:

Fourth column is the form that appears in the UBS3/NA26 text.

Fifth column is the lemma or dictionary form.

This page last modified Saturday 19 August, 2006 by James Tauber