Experts in vocabulary acquisition in Latin and Greek agree that it is desirable to work toward a goal at the end of the second year of college study of mastering all the vocabulary around the 80% level, that is, the number of lemmas that generate 80% of the words in the corpus of available texts (Muccigrosso, 2004, p. 416; Major, 2008, p. 7).

ψυχρός is an example of a word that makes the top 500 in TLG thanks to its prominence in medical texts (and was thus omitted here); the statistical prominence of γωνία is due to its appearance in very repetitive mathematical texts, and was thus also rejected.

Diederich's list of 1,500 words, which represents about 85% of the words in a typical Latin text, has been very usefully edited and presented by Carolus Raeticus on his site, Hiberna Caroli Raetici. For a Greek 50% list, see Major 2008, p. 4.

The lemmas were then ranked by frequency (ὁ, αὐτός, καί, δέ, and τίς coming in as the top 5, for example, with ὁ, i.e. These are the lemmas or dictionary headwords that generate approximately 65% of the word forms in a typical Greek text. All such lists tend to agree for the most part on the top 500-600 lemmas, but beyond that the vagaries of individual samples dictate what lemmas make it into the top thousand, and which lie just outside of that cut-off. These are the lemmas or dictionary headwords that generate approximately 70% of the word forms in a typical Latin text. The frequency rankings are derived from LASLA, and do not take Diederich's counts into consideration.

