Slide 10
Slide 10 text
10
▪ Document is too large / too much content / not concise enough
Splitting (Text Segmentation)
▪ by size (text length)
▪ by character (\n\n)
▪ by paragraph, sentence,
words (until small enough)
▪ by size (tokens)
▪ overlapping chunks
(token-wise)
Improving RAG solutions based on real-world experiences
`Talk to your data`