Learn R Programming

daiR (version 1.0.0)

redraw_blocks: Inspect revised block bounding boxes

Description

Tool to visually check the order of block bounding boxes after manual processing (e.g. block reordering or splitting). Takes as its main input a token dataframe generated with build_token_df(), reassign_tokens(), or reassign_tokens2(). The function plots the block bounding boxes onto images of the submitted document. Generates an annotated .png file for each page in the original document.

Usage

redraw_blocks(json, token_df, dir = getwd())

Value

no return value, called for side effects

Arguments

json

filepath of a JSON file obtained using dai_async()

token_df

a token data frame generated with build_token_df(), reassign_tokens(), or reassign_tokens2().

dir

path to the desired output directory.

Details

Not vectorized, but documents can be multi-page.

Examples

Run this code
if (FALSE) {
redraw_blocks("pdf_output.json", revised_token_df, dir = tempdir())
}

Run the code above in your browser using DataLab