Package nltk_lite :: Package corpora :: Module genesis
[hide private]
[frames] | no frames]

Module genesis

source code

The Genesis Corpus.

This corpus has been prepared from several web sources; formatting, markup and verse numbers have been stripped.

english-kjv - Genesis, King James version (Project Gutenberg) english-web - Genesis, World English Bible (Project Gutenberg) french - Genesis, Louis Segond 1910 german - Genesis, Luther Translation swedish - Genesis, Gamla och Nya Testamentet, 1917 (Project Runeberg) finnish - Genesis, Suomen evankelis-luterilaisen kirkon kirkolliskokouksen vuonna 1992 kayttoon ottama suomennos

Functions [hide private]
iterator over tree
raw(files='english-kjv') source code
 
demo() source code
Variables [hide private]
  items = ['english-kjv', 'english-web', 'french', 'german', 'sw...
  item_name = {'english-kjv': 'Genesis, King James version (Proj...
Function Details [hide private]

raw(files='english-kjv')

source code 
Parameters:
  • files (string or tuple(string)) - One or more treebank files to be processed
Returns: iterator over tree

Variables Details [hide private]

items

Value:
['english-kjv',
 'english-web',
 'french',
 'german',
 'swedish',
 'finnish']

item_name

Value:
{'english-kjv': 'Genesis, King James version (Project Gutenberg)',
 'english-web': 'Genesis, World English Bible (Project Gutenberg)',
 'finnish': 'Genesis, Suomen evankelis-luterilaisen kirkon kirkollisko\
kouksen vuonna 1992 kayttoon ottama suomennos',
 'french': 'Genesis, Louis Segond 1910',
 'german': 'Genesis, Luther Translation',
 'swedish': 'Genesis, Gamla och Nya Testamentet, 1917 (Project Runeber\
g)'}