Corpus Information for Tajiki [tgk] Tajikistan

Language
Tajiki
ISO Code
tgk   Wikipedia , Ethnologue , Glottolog , MultiTree , ScriptSource
Country
Tajikistan
Corpus Name
tgk_community_2017   LCC Portal
Tokens
14,147,320
Types
514,746
Sentences
707,117
Sources (URLs)
78,474
Build date
2017-06-01
Corpus Name
tgk_community_2021
Tokens
19,280,738
Types
588,452
Sentences
939,144
Sources (URLs)
93,216
Build date
2021-06-07
Corpus Name
tgk_community_2022
Tokens
19,341,776
Types
596,826
Sentences
941,793
Sources (URLs)
93,504
Build date
2022-02-08
URLs
List of URLs download
List of Domains download
Download
tgk_community_2017 2017-06-01
tgk_community_2021 2021-06-07
tgk_community_2022 2022-02-08
Contact
No contact person for this language.
Use this Contact    to add contact details.