Pdf2MdOptions: {
    split_headings?: boolean | null;
    system_prompt?: string | null;
    use_pdf2md_ocr: boolean;
}

We plan to deprecate pdf2md in favor of chunkr.ai. This is a legacy option for using a vision LLM to convert a given file into markdown and then ingest it.

Type declaration

  • Optionalsplit_headings?: boolean | null

    Split headings is an optional field which allows you to specify whether or not to split headings into separate chunks. Default is false.

  • Optionalsystem_prompt?: string | null

    Prompt to use for the gpt-4o model. Default is None.

  • use_pdf2md_ocr: boolean

    Parameter to use pdf2md_ocr. If true, the file will be converted to markdown using gpt-4o. Default is false.