The Corpus Electronic Platform of Historical Documents of the Sertão is an electronic corpus that gathers Portuguese texts, produced by individuals born between 1450 and 1950 , organized in: Corpus 1 - made up of different materials (manuscripts, printouts and speech samples), produced between 1823 and 2000 by individuals born after 1724; and Corpus 2 (prospecting)- composed of manuscripts produced between 1500 and 1822 by individuals born from1450.
Currently, part of the documents corresponding to Corpus 1 , 1,553 texts ( 1,119,447 words ), mostly manuscripts, edited in XML language and in annotation process morphosyntactic, is available in different editions, accompanied with facsimiles of documents: semi-diplomatic, modernized and original.