类 RecursiveCharacterTextSplitter

java.lang.Object
top.aoyudi.rag.impl.RecursiveCharacterTextSplitter
所有已实现的接口:
TextSplitter

public class RecursiveCharacterTextSplitter extends Object implements TextSplitter
递归字符文本分割器,按自然边界分割文本为指定大小的块
  • 字段详细资料

    • DEFAULT_CHUNK_SIZE

      public static final int DEFAULT_CHUNK_SIZE
      默认块大小(字符数)
      另请参阅:
    • DEFAULT_CHUNK_OVERLAP

      public static final int DEFAULT_CHUNK_OVERLAP
      默认块重叠(字符数)
      另请参阅:
    • DEFAULT_SEPARATORS

      public static final List<String> DEFAULT_SEPARATORS
      默认分隔符列表(按优先级排序)
    • chunkSize

      private final int chunkSize
    • chunkOverlap

      private final int chunkOverlap
    • separators

      private final List<String> separators
  • 构造器详细资料

    • RecursiveCharacterTextSplitter

      public RecursiveCharacterTextSplitter()
      使用默认参数创建文本分割器
    • RecursiveCharacterTextSplitter

      public RecursiveCharacterTextSplitter(int chunkSize, int chunkOverlap, List<String> separators)
      创建自定义参数的文本分割器
  • 方法详细资料