Dive deep into each of the chunking strategies supported by Aryn DocParse
Section Header
“Position-wise Feed-Forward Networks” to be in the same chunk as the Formula
(2) “FFN(x)=…”. Calling DocParse with the following chunking options will group the two chunks together:
Section Header
and the Formula
are all chunked together into one element:
maximize_within_limit
strategy is meant to be used when you want to merge several consecutive elements together into a large chunk. Take the following example: