Controls the post-processing of each segment type.
Allows you to generate HTML and Markdown from chunkr models for each segment type.
By default, the HTML and Markdown are generated manually using the segmentation information except for Table, Formula and Picture.
You can optionally configure custom LLM prompts and models to generate an additional llm field with LLM-processed content for each segment type.
The configuration of which content sources (HTML, Markdown, LLM, Content) of the segment
should be included in the chunk's embed field and counted towards the chunk length can be configured through the embed_sources setting.
Controls the post-processing of each segment type.
Allows you to generate HTML and Markdown from chunkr models for each segment type. By default, the HTML and Markdown are generated manually using the segmentation information except for
Table
,Formula
andPicture
. You can optionally configure custom LLM prompts and models to generate an additionalllm
field with LLM-processed content for each segment type.The configuration of which content sources (HTML, Markdown, LLM, Content) of the segment should be included in the chunk's
embed
field and counted towards the chunk length can be configured through theembed_sources
setting.