Takes a list of markdown text chunks and merges them into named sections. Each section name is extracted from the markdown header (# Title).
merge_text_chunks_named(
text_chunks,
remove_tables = TRUE,
remove_figure_captions = TRUE
)A named character vector where:
Names are section titles (without the # symbol)
Values are complete section contents (including the title line)
A list of character strings with markdown text from sequential PDF chunks
Logical. If TRUE, removes all table content including captions. Default is FALSE.
Logical. If TRUE, removes figure captions. Default is FALSE.