The tables below were constructed as follows. Some details are included for convenient reference, and I didn't expect you to include them in your assignments.
|Toy text||Types||Tokens||Type/token ratio|
|a a a||1||3||0.33|
|a a b||2||3||0.67|
|a b c||3||3||1.00|
Instead of ranking the texts absolutely, as I did, some of you ranked the texts with respect to the maximum value for the relevant property. For instance, given a maximum mean word length of 6.06, a text with a mean word length of 3.03 would be associated with 0.50 (3.03/6.06). Similarly, given a maximum type/token ratio of 0.91, the text with the minimum type/token ratio would be associated with 0.27 (0.24/0.91). The simple ranking procedure seems to me more intuitive as the lowest value is close to 0, and the highest close to 1. With the other procedure, the maximum is necessarily 1 (max/max), but the minimum doesn't have to be particularly close to 0. However, either procedure yields a number between 0 and 1, which is both sensible and easy to interpret.
Incidentally, the means for the two time periods, calculated on the basis of the percentages for the individual texts, are 0.20 (early) and 0.59 (late). The means for the two time periods, calculated on the basis of the aggregate data, are 0.19 (early) and 0.61 (late). So as it turns out, it wouldn't matter much which type of average we use. However, following Warner, I used the median, as it is less sensitive to extreme values.
|1500-1575, low lexical complexity||rows 2 to 39|
|1500-1575, high lexical complexity||rows 40 to 86|
|1600-1719, low lexical complexity||rows 108 to 172|
|1600-1719, high lexical complexity||rows 173 to 230|
|Table 1: Occurrence of do in texts of low versus high lexical complexity 1500-1575|
|High DO %||15||10||25|
|Low DO %||15||10||25|
|no negative sentences||8||27||35|
|Mean DO %||0.17 |
|Table 2: Occurrence of do in texts of low versus high lexical complexity 1600-1719|
|High DO %||30||13||43|
|Low DO %||29||16||45|
|no negative sentences||6||29||35|
|Mean DO %||0.65 |
|Table 3: Median date of composition for texts of low versus high lexical complexity 1500-1719|
|Median date of composition||Lexical complexity|