Corpus Information for Bengali [ben] Bangladesh

Language
Bengali
ISO Code
ben   Wikipedia , Ethnologue , Glottolog , MultiTree , ScriptSource
Country
Bangladesh
Corpus Name
ben_community_2017   LCC Portal
Tokens
17,489,212
Types
645,503
Sentences
1,200,255
Sources (URLs)
215,390
Build date
2017-08-04
Corpus Name
ben_community_2019
Tokens
108,625
Types
21,050
Sentences
7,030
Sources (URLs)
506
Build date
2019-03-26
Corpus Name
ben_community_2021
Tokens
5,994,699
Types
260,505
Sentences
469,414
Sources (URLs)
21,270
Build date
2021-03-05
Corpus Name
ben_community_2022
Tokens
5,997,574
Types
266,808
Sentences
469,711
Sources (URLs)
21,299
Build date
2022-02-03
URLs
List of URLs download
List of Domains download
Download
ben_community_2017 2017-08-04
ben_community_2019 2019-03-26
ben_community_2021 2021-03-05
ben_community_2022 2022-02-03
Contact
No contact person for this language.
Use this Contact    to add contact details.