This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article uses bare URLs, which are uninformative and vulnerable to link rot. Please consider converting them to full citations to ensure the article remains verifiable and maintains a consistent citation style. Several templates and tools are available to assist in formatting, such as reFill (documentation) and Citation bot (documentation).(September 2022) (Learn how and when to remove this message)
This article relies excessively on references to primary sources. Please improve this article by adding secondary or tertiary sources. Find sources: "General Internet Corpus of Russian" – news · newspapers · books · scholar · JSTOR(June 2016) (Learn how and when to remove this message)
This article may lack focus or may be about more than one topic. Please help improve this article, possibly by splitting the article and/or by introducing a disambiguation page, or discuss this issue on the talk page.(June 2016)
(Learn how and when to remove this message)
General Internet Corpus of Russian
Type of site
educational/scientific project
Available in
Russian language
Created by
Vladimir Selegey, Vladimir Belikov, Serge Sharoff
URL
www.webcorpora.ru/en
Commercial
no
Registration
needed; given by request
Launched
2012
Current status
Beta-testing
General Internet Corpus of Russian (GICR) is a corpus of Russian internet texts that has been accessible on request through an online query interface since 2013. The corpus includes rich text materials from the blogosphere, social networks, major news sources and literary magazines.
and 25 Related for: General Internet Corpus of Russian information
The Russian National Corpus (Russian: Национальный корпус русского языка, lit. 'National Corpusof the Russian language') is a corpusof the Russian language...
an annotated corpusof Brazilian Portuguese text Belarusian N-korpus Russian National CorpusGeneralInternetCorpusofRussianGeneral regionally annotated...
Charles (July 14, 1955). "Nita Talbot Tabbed as New Star". The Corpus Christi Caller-Times. Corpus Christi, TX. Associated Press. p. 38. Retrieved June 9, 2017...
as the fourth most widely used language on the Internet. Russian is written using the Russian alphabet of the Cyrillic script; it distinguishes between...
much wider field of natural language processing. In order to be able to meticulously study the English language, an annotated text corpus was much needed...
consistently ranked as one of the ten most popular websites in the world, and as of 2024 is ranked the fifth most visited website on the Internet by Semrush, and...
Arabic Corpus (Arabic: المدونة القرآنية العربية, romanized: al-modwana al-Qurʾāni al-ʿArabiyya) is an annotated linguistic resource consisting of 77,430...
century. It took its start when Charles Bally's notion of locutions phraseologiques entered Russian lexicology and lexicography in the 1930s and 1940s and...
active member of the Russian Formalists and the Prague School, before emigrating to America in the 1940s. He brought together Russian Formalism and American...
with the text within the selected corpus, optionally using case-sensitive spelling (which compares the exact use of uppercase letters), and, if found...
(Romanian web corpus) ruTenTen (Russian web corpus) skTenTen (Slovak web corpus) slTenTen (Slovenian web corpus) svTenTen (Swedish web corpus) thTenTen (Thai...
This list ofRussian IT developers includes the hardware engineers, computer scientists and programmers from the Russian Empire, the Soviet Union and the...
a list of second languages most frequently taught in American schools and colleges. They reflect the popularity of these languages in terms of the total...
centuries, from which the broader philosophy of nihilism originated. In Russian, the word nigilizm (Russian: нигилизм; meaning 'nihilism', from Latin nihil 'nothing')...
other editors. From June 2021, a version of JMdict includes example sentences selected from the Tatoeba Corpus. EDICT has inspired other projects, including...
campaign of fraudulent information. According to the Oxford Internet Institute, eight of the top 10 "junk news" sources during the 2018 Swedish general election...
corpus, which provides a solid foundation for the model to perform well on downstream tasks with limited amounts of task-specific data. An example of...
around the world, is still in use although it is now only available in the internet-based format (now called the TOEFL iBT). Many tests from other companies...
Peace. Routledge. p. 119: "UN General Assembly Resolution 181 recommended the creation of an international zone, or corpus separatum, in Jerusalem to be...
British English writing as reflected in the Corpusof Contemporary American English data, but is falling out of favor in some style guides. E-mail is sometimes...
Frequently used and widely available on the internet, is the version by Isidore Dyen (1992, 200 meanings of 95 language variants). Since 2010, a team around...
angst means 'fear' in a general sense (as well as 'anxiety') in German, but when it was borrowed into English in the context of psychology, its meaning...
the use of multiple languages in a single conversation. Methods from sociolinguistics (the study of language use in society), from corpus linguistics...