Package nltk_lite :: Package corpora :: Module indian
[hide private]
[frames] | no frames]

Module indian

source code


Indian Language POS-Tagged Corpus
Collected by A Kumaran, Microsoft Research, India
Distributed with permission

Contents:
- Bangla: IIT Kharagpur
- Hindi: Microsoft Research India
- Marathi: IIT Bombay
- Telugu: IIIT Hyderabad

Functions [hide private]
 
_read(files, conversion_function) source code
 
xreadlines(files=['bangla', 'hindi', 'marathi', 'telugu']) source code
 
raw(files=['bangla', 'hindi', 'marathi', 'telugu']) source code
 
tagged(files=['bangla', 'hindi', 'marathi', 'telugu']) source code
 
sample(language) source code
 
demo() source code
Variables [hide private]
  items = ['bangla', 'hindi', 'marathi', 'telugu']