Corpus Information for Thai [tha] Thailand

Language
Thai
ISO Code
tha   Wikipedia , Ethnologue , Glottolog , MultiTree , ScriptSource
Country
Thailand
Corpus Name
tha_community_2017   LCC Portal
Tokens
793,101
Types
196,199
Sentences
57,013
Sources (URLs)
21,460
Build date
2017-06-01
Corpus Name
tha_community_2021
Tokens
793,384
Types
196,452
Sentences
57,017
Sources (URLs)
21,462
Build date
2021-03-05
URLs
List of URLs download
List of Domains download
Download
tha_community_2017 2017-06-01
tha_community_2021 2021-03-05
Contact
No contact person for this language.
Use this Contact    to add contact details.