Corpus Information for Tajiki [tgk] Tajikistan
- Language
- Tajiki
- ISO Code
- tgk Wikipedia , Ethnologue , Glottolog , MultiTree , ScriptSource
- Country
- Tajikistan
- Corpus Name
- tgk_community_2017 LCC Portal
- Tokens
- 14,147,320
- Types
- 514,746
- Sentences
- 707,117
- Sources (URLs)
- 78,474
- Build date
- 2017-06-01
- Corpus Name
- tgk_community_2021
- Tokens
- 19,280,738
- Types
- 588,452
- Sentences
- 939,144
- Sources (URLs)
- 93,216
- Build date
- 2021-06-07
- Corpus Name
- tgk_community_2022
- Tokens
- 19,341,776
- Types
- 596,826
- Sentences
- 941,793
- Sources (URLs)
- 93,504
- Build date
- 2022-02-08
- URLs
- List of URLs download
- List of Domains download
- Download
- tgk_community_2017 2017-06-01
- tgk_community_2021 2021-06-07
- tgk_community_2022 2022-02-08
- Contact
- No contact person for this language.
- Use this Contact to add contact details.