Package rocks.imsofa.ai.puppychatter.rag
Class PuppyChatterHtmlContentExtractor
java.lang.Object
rocks.imsofa.ai.puppychatter.rag.PuppyChatterTextContentExtractor
rocks.imsofa.ai.puppychatter.rag.PuppyChatterHtmlContentExtractor
- All Implemented Interfaces:
TextContentExtractor
this implementation expect the given url to be a html page
-
Field Summary
Fields inherited from class rocks.imsofa.ai.puppychatter.rag.PuppyChatterTextContentExtractor
puppyChatter -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionextractTextContent(List<Conversation> messages, URL url) extract text content from a url using a default charsetextractTextContent(List<Conversation> messages, URL url, String charset) extract text content from a urlMethods inherited from class rocks.imsofa.ai.puppychatter.rag.PuppyChatterTextContentExtractor
extractTextContent
-
Constructor Details
-
PuppyChatterHtmlContentExtractor
-
-
Method Details
-
extractTextContent
public String extractTextContent(List<Conversation> messages, URL url, String charset) throws Exception Description copied from interface:TextContentExtractorextract text content from a url- Specified by:
extractTextContentin interfaceTextContentExtractor- Overrides:
extractTextContentin classPuppyChatterTextContentExtractor- Parameters:
messages-url-charset-- Returns:
- Throws:
Exception
-
extractTextContent
Description copied from interface:TextContentExtractorextract text content from a url using a default charset- Specified by:
extractTextContentin interfaceTextContentExtractor- Overrides:
extractTextContentin classPuppyChatterTextContentExtractor- Parameters:
messages-url-- Returns:
- Throws:
Exception
-