java.lang.Object
org.nasdanika.ai.TextFloatVectorChunkingEmbeddingModel<T>
- Type Parameters:
T- a container of tokens, e.g. int[] or char[] or List<Integer>
- All Implemented Interfaces:
Coordinates,EmbeddingGenerator<String,,List<List<Float>>> EmbeddingModel<String,,List<List<Float>>> FloatVectorEmbeddingGenerator<String>,Model,TextEmbeddingGenerator<List<List<Float>>>,TextFloatVectorEmbeddingModel,VectorEmbeddingGenerator<String,,Float> Composable<EmbeddingGenerator<String,List<List<Float>>>>
- Direct Known Subclasses:
TextFloatVectorCharChunkingEmbeddings,TextFloatVectorEncodingChunkingEmbeddingModel
public abstract class TextFloatVectorChunkingEmbeddingModel<T>
extends Object
implements TextFloatVectorEmbeddingModel
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.nasdanika.ai.EmbeddingGenerator
EmbeddingGenerator.Requirement -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotectedTextFloatVectorChunkingEmbeddingModel(TextFloatVectorEmbeddingModel target, int chunkSize, int overlap) -
Method Summary
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.nasdanika.ai.EmbeddingGenerator
adapt, compose, compose, composeAsync, generate, generateAsync, then
-
Constructor Details
-
TextFloatVectorChunkingEmbeddingModel
protected TextFloatVectorChunkingEmbeddingModel(TextFloatVectorEmbeddingModel target, int chunkSize, int overlap) - Parameters:
target-chunkSize- Chunk size, if non-positive, then target max input tokens is used as chunk sizeoverlap-
-
-
Method Details
-
getProvider
Description copied from interface:CoordinatesProvider name - OpenAI, Ollama, ...- Specified by:
getProviderin interfaceCoordinates- Returns:
-
getName
Description copied from interface:CoordinatesModel name- Specified by:
getNamein interfaceCoordinates- Returns:
-
getVersion
- Specified by:
getVersionin interfaceCoordinates
-
getMaxInputTokens
- Specified by:
getMaxInputTokensin interfaceModel
-
isTooLong
- Specified by:
isTooLongin interfaceTextFloatVectorEmbeddingModel- Parameters:
input-- Returns:
- true if the input is too long for a given model
-
getDimensions
- Specified by:
getDimensionsin interfaceVectorEmbeddingGenerator<String,Float> - Returns:
- number of dimensions, -1 if unknown
-
chunk
-
generateAsync
- Specified by:
generateAsyncin interfaceEmbeddingGenerator<String,List<List<Float>>>
-
generate
-
encode
Encodes a string into tokens- Parameters:
input-- Returns:
-
decode
Decodes a string from an array of tokens- Parameters:
tokens-- Returns:
-
size
-
slice
-