Corpus Information for Malayalam [mal] India

Language
Malayalam
ISO Code
mal   Wikipedia , Ethnologue , Glottolog , MultiTree , ScriptSource
Country
India
Corpus Name
mal_community_2017   LCC Portal
Tokens
7,046,155
Types
1,004,168
Sentences
602,268
Sources (URLs)
62,070
Build date
2017-06-28
Corpus Name
mal_community_2021
Tokens
7,066,023
Types
1,001,257
Sentences
602,385
Sources (URLs)
62,153
Build date
2021-03-04
URLs
List of URLs download
List of Domains download
Download
mal_community_2017 2017-06-28
mal_community_2021 2021-03-04
Contact
No contact person for this language.
Use this Contact    to add contact details.