merge_text_chunks_named

Takes a list of markdown text chunks and merges them into named sections.
Each section name is extracted from the markdown header (# Title).

Provides comprehensive tools for extracting and analyzing scientific
content from PDF documents, including citation extraction, reference matching,
text analysis, and bibliometric indicators. Supports multi-column PDF layouts,
'CrossRef' API <https://www.crossref.org/documentation/retrieve-metadata/rest-api/> integration, and advanced citation parsing.

Massimo Aria

contentanalysis

Scientific Content and Citation Analysis from PDF Documents

Corrado Cuccurullo

merge_text_chunks_named function

<dl><dt>text_chunks</dt>
<dd>A list of character strings with markdown text from sequential PDF chunks</dd>
<dt>remove_tables</dt>
<dd>Logical. If TRUE, removes all table content including captions. Default is FALSE.</dd>
<dt>remove_figure_captions</dt>
<dd>Logical. If TRUE, removes figure captions. Default is FALSE.</dd></dl>

Arguments

Merge Text Chunks into Named Sections — merge_text_chunks_named

<dl>

<dt>text_chunks</dt>
<dd>A list of character strings with markdown text from sequential PDF chunks</dd>


<dt>remove_tables</dt>
<dd>Logical. If TRUE, removes all table content including captions. Default is FALSE.</dd>


<dt>remove_figure_captions</dt>
<dd>Logical. If TRUE, removes figure captions. Default is FALSE.</dd>

</dl>

merge_text_chunks_named: Merge Text Chunks into Named Sections

Description

Usage

Value

Arguments