Yes there is actually a way to do this @zerodot. Let me explain.
The way end-user uploads functions actually allows this.
Here’s a rundown:
-
The end-user upload process always looks the maximum input length setting of a Pickaxe. This can be 1000 tokens or 100,000 tokens.
-
Then it looks at the size of the document. This could be 1000 tokens, 30,000 tokens, or 3,000,000 tokens.
-
If the document fits into the maximum input length, then the system dumps the entire document contents into the context conversation. No vector embeddings. If it does not fit, it’s turned into vector embeddings.
The takeaway for Pickaxe users is you can select which process you prefer based on your use case by increasing the the size of the maximum input length.