Class PdfTextSplitter

java.lang.Object
org.nasdanika.rag.core.PdfTextSplitter

public class PdfTextSplitter extends Object
Extracts text from PDF and splits into chunks. This class tries to keep paragraphs together and split them into sentences if keeping together is not possible.