Ngram Search

Query:

Level:

Score:

Subject:

Format


In the web corpus, written samples are digitized and achieved in CHAT format, as illustrated below.

Figure 1. A written sample in CHAT format(Click the picture to enlarge)

The headers of each file are metadata, followed by the writing sample. CLAWS POS is tagged in a tier right below.