edu.washington.cs.knowitall.extractor
Class SentenceExtractor
java.lang.Object
edu.washington.cs.knowitall.extractor.Extractor<String,String>
edu.washington.cs.knowitall.extractor.SentenceExtractor
- Direct Known Subclasses:
- HtmlSentenceExtractor
public class SentenceExtractor
- extends Extractor<String,String>
An Extractor object that extracts String sentences from a
String. Is backed by an OpenNLP SentenceDetector object.
- Author:
- afader
|
Constructor Summary |
SentenceExtractor()
Constructs a new SentenceExtractor object using the default OpenNLP
SentenceDetector object, as returned by DefaultObjects.getDefaultSentenceDetector(). |
SentenceExtractor(opennlp.tools.sentdetect.SentenceDetector detector)
Constructs a new SentenceExtractor object using the given OpenNLP SentenceDetector
object. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
SentenceExtractor
public SentenceExtractor()
throws IOException
- Constructs a new
SentenceExtractor object using the default OpenNLP
SentenceDetector object, as returned by DefaultObjects.getDefaultSentenceDetector().
- Throws:
IOException
SentenceExtractor
public SentenceExtractor(opennlp.tools.sentdetect.SentenceDetector detector)
- Constructs a new
SentenceExtractor object using the given OpenNLP SentenceDetector
object.
- Parameters:
detector -
getSentenceDetector
public opennlp.tools.sentdetect.SentenceDetector getSentenceDetector()
- Returns:
- the OpenNLP
SentenceDetector object.
extractCandidates
protected Collection<String> extractCandidates(String source)
- Runs the OpenNLP
SentenceDetector object on the given String source,
and returns an Iterable object over the detected sentences.
- Specified by:
extractCandidates in class Extractor<String,String>
- Parameters:
source - the source to extract from.
- Returns:
- an iterable object over the candidate extractions.
Copyright © 2010-2012 University of Washington CSE. All Rights Reserved.