Format
In the web corpus, written samples are digitized and achieved in CHAT format, as illustrated below.
Figure 1. A written sample in CHAT format(Click the picture to enlarge)The headers of each file are metadata, followed by the writing sample. CLAWS POS is tagged in a tier right below.