The text corpora from the field of journalism contain, in electronic format, about 5,000,000 word tokens published in the newspapers Makedonia (3,000,000) and Ta Nea (2,000,000). The material is grouped into thematic units and classified by genre (short news, social reporting, etc).
The text corpora from the field of educational writing contain, in electronic format, textbooks (for both students and teachers) from the Lower Secondary School (gymnasion) and the Upper Secondary School (lykeion). This material is classified by genre (narrative, description, instructions, process analysis, argumentation). The supporting texts or excerpts from the textbooks Expression and Composition and Modern Greek Language are also classified by genre (correspondence, poem, application, etc).
You can search every greek word in its one of these corpora or in all of them in the greek version.