org.apache.lucene.analysis.br
Class BrazilianAnalyzer
public final class BrazilianAnalyzer
Analyzer for Brazilian language. Supports an external list of stopwords (words that
will not be indexed at all) and an external list of exclusions (word that will
not be stemmed, but indexed).
void | setStemExclusionTable(File exclusionlist) - Builds an exclusionlist from the words contained in the given file.
|
void | setStemExclusionTable(Hashtable exclusionlist) - Builds an exclusionlist from a Hashtable.
|
void | setStemExclusionTable(String[] exclusionlist) - Builds an exclusionlist from an array of Strings.
|
TokenStream | tokenStream(String fieldName, Reader reader) - Creates a TokenStream which tokenizes all the text in the provided Reader.
|
BRAZILIAN_STOP_WORDS
public static final String[] BRAZILIAN_STOP_WORDS
List of typical Brazilian stopwords.
BrazilianAnalyzer
public BrazilianAnalyzer()
BrazilianAnalyzer
public BrazilianAnalyzer(File stopwords)
throws IOException
Builds an analyzer with the given stop words.
BrazilianAnalyzer
public BrazilianAnalyzer(Hashtable stopwords)
Builds an analyzer with the given stop words.
BrazilianAnalyzer
public BrazilianAnalyzer(String[] stopwords)
Builds an analyzer with the given stop words.
setStemExclusionTable
public void setStemExclusionTable(File exclusionlist)
throws IOException
Builds an exclusionlist from the words contained in the given file.
setStemExclusionTable
public void setStemExclusionTable(Hashtable exclusionlist)
Builds an exclusionlist from a Hashtable.
setStemExclusionTable
public void setStemExclusionTable(String[] exclusionlist)
Builds an exclusionlist from an array of Strings.
tokenStream
public final TokenStream tokenStream(String fieldName,
Reader reader)
Creates a TokenStream which tokenizes all the text in the provided Reader.
- tokenStream in interface Analyzer
- A TokenStream build from a StandardTokenizer filtered with
StandardFilter, StopFilter, GermanStemFilter and LowerCaseFilter.
Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.