Package nltk_lite :: Package corpora :: Module ieer
[hide private]
[frames] | no frames]

Module ieer

source code

Corpus reader for the Information Extraction and Entity Recognition Corpus.

NIST 1999 Information Extraction: Entity Recognition Evaluation http://www.itl.nist.gov/iad/894.01/tests/ie-er/er_99/er_99.htm

This corpus contains the NEWSWIRE development test data for the NIST 1999 IE-ER Evaluation. The files were taken from the subdirectory: /ie_er_99/english/devtest/newswire/*.ref.nwt and filenames were shortened.

The corpus contains the following files: APW_19980314, APW_19980424, APW_19980429, NYT_19980315, NYT_19980403, and NYT_19980407.

Functions [hide private]
 
raw(files=['APW_19980314', 'APW_19980424', 'APW_19980429', 'NYT_19980315...) source code
 
dictionary(files=['APW_19980314', 'APW_19980424', 'APW_19980429', 'NYT_19980315...) source code
 
demo() source code
Variables [hide private]
  items = ['APW_19980314', 'APW_19980424', 'APW_19980429', 'NYT_...
  item_name = {'APW_19980314': 'Associated Press Weekly, 14 Marc...
Variables Details [hide private]

items

Value:
['APW_19980314',
 'APW_19980424',
 'APW_19980429',
 'NYT_19980315',
 'NYT_19980403',
 'NYT_19980407']

item_name

Value:
{'APW_19980314': 'Associated Press Weekly, 14 March 1998',
 'APW_19980424': 'Associated Press Weekly, 24 April 1998',
 'APW_19980429': 'Associated Press Weekly, 29 April 1998',
 'NYT_19980315': 'New York Times, 15 March 1998',
 'NYT_19980403': 'New York Times, 3 April 1998',
 'NYT_19980407': 'New York Times, 7 April 1998'}