Ancient text corpora are the entire collection of texts from the period of ancient history, defined in this article as the period from the beginning of writing up to 300 AD. These corpora are important for the study of literature, history, linguistics, and other fields, and are a fundamental component of the world's cultural heritage.
Chinese, Latin, and Greek are examples of ancient languages with significant text corpora, although much of these corpora are known to us via transmission (frequently via medieval manuscript copies) rather than in their original form. These texts – both transmitted and original – provide valuable insights into the history and culture of different regions of the world, and have been studied for centuries by scholars and researchers. Other ancient texts – particularly stone inscriptions and papyrus scrolls – have been published following archaeological research, notably the cuneiform corpus of c.10 million words and the c.5 million words in ancient Egyptian.
Through advances in technology and digitization, ancient text corpora are more accessible than ever before. Tools such as the Perseus Digital Library and the Digital Corpus of Sanskrit[1] have made it easier for researchers to access and analyze these texts.
^"Digital Corpus of Sanskrit (DCS) - Online Sanskrit dictionary and annotated corpus". www.sanskrit-linguistics.org. Archived from the original on 2023-06-03. Retrieved 2023-06-03.
and 27 Related for: Ancient text corpora information
Ancienttextcorpora are the entire collection of texts from the period of ancient history, defined in this article as the period from the beginning of...
Textcorpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected. Textcorpora are used by corpus...
In linguistics and natural language processing, a corpus (pl.: corpora) or text corpus is a dataset, consisting of natively digital and older, digitalized...
Nájera. Ancient literature#Incomplete list of ancienttextsAncienttextcorpora Hayes, John L., 1990 A Manual of Sumerian Grammar and Texts, Undena Publications...
language by way of a text corpus (plural corpora). Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing...
Pyramid Texts are the oldest ancient Egyptian funerary texts, dating to the late Old Kingdom. They are the earliest known corpus of ancient Egyptian...
few genres, while texts of the "teaching" genre were pseudonymous and falsely attributed to prominent historical figures. Ancient Egyptian literature...
Hmong-Mien: 20th century Ancienttextcorpora History of writing List of writing systems Undeciphered writing systems Origin of language Ancient literature § Incomplete...
allowed the Ancient Egyptian language to begin being deciphered. Large collections of parallel texts are called parallel corpora (see text corpus). Alignments...
Ancient Hebrew writings are texts written in Biblical Hebrew using the Paleo-Hebrew alphabet before the destruction of the Second Temple in 70 CE. The...
important for the decipherment of ancient writing systems, and for the study of ancient languages with small or repetitive corpora. Important bilinguals include:...
Ancient literature comprises religious and scientific documents, tales, poetry and plays, royal edicts and declarations, and other forms of writing that...
The Coffin Texts are a collection of ancient Egyptian funerary spells written on coffins beginning in the First Intermediate Period. They are partially...
the last 20 years, a growing number of online databases, catalogues and corpora of Greek inscriptions have been created. A selection is offered below:...
[They] work with several other projects in the development of tools and corpora. [Two] of these have useful websites: the CDLI and the ETCSL. 32°N 46°E...
a treebank is a parsed text corpus that annotates syntactic or semantic sentence structure. The construction of parsed corpora in the early 1990s revolutionized...
studying certain areas of ancient Near Eastern life or scholarship, including projects designed to contextualize specific textual corpora ("portals"). Others...
increase in echogenicity of the corpora cavernosa, attributed to tissue edema. Eventually, changes in the echotexture of the corpora cavernosa can be observed...
others (see List of textcorpora). In addition to natural language text, large language models can be trained on programming language text, allowing them to...
portal Literature portal Akkadian literature Ancient Egyptian literature Cuneiform law Electronic Text Corpus of Sumerian Literature Black et al. 2006...
drama refers to the tradition of dramatic literature and performance in ancient India. The roots of drama in the Indian subcontinent can be traced back...
the early examples of exegesis, and one of the larger corpora of text commentaries from the ancient world, comes from Mesopotamia (modern day Iraq) in the...
Computational approaches to the automated extraction of glossaries from corpora or the Web have been developed in the recent years. These methods typically...
early second millennium BC. The other languages with significant cuneiform corpora are Eblaite, Elamite, Hurrian, Luwian, and Urartian. The Old Persian and...
extensively on Platonism and translated almost the entire Platonic and Plotinian corpora into English, and the Belgian writer Suzanne Lilar. The science fiction...
inscriptions written in ancient Northwest Semitic script (Canaanite and Aramaic) have been catalogued into multiple corpora (i.e., lists) over the last...