Learn R Programming

JATSdecoder (version 1.2.1)

A Metadata and Text Extraction and Manipulation Tool Set

Description

Provides a function collection to extract metadata, sectioned text and study characteristics from scientific articles in 'NISO-JATS' format. Articles in PDF format can be converted to 'NISO-JATS' with the 'Content ExtRactor and MINEr' ('CERMINE', ). For convenience, two functions bundle the extraction heuristics: JATSdecoder() converts 'NISO-JATS'-tagged XML files to a structured list with elements title, author, journal, history, 'DOI', abstract, sectioned text and reference list. study.character() extracts multiple study characteristics like number of included studies, statistical methods used, alpha error, power, statistical results, correction method for multiple testing, software used. The function get.stats() extracts all statistical results from text and recomputes p-values for many standard test statistics. It performs a consistency check of the reported with the recalculated p-values. An estimation of the involved sample size is performed based on textual reports within the abstract and the reported degrees of freedom within statistical results. In addition, the package contains some useful functions to process text (text2sentences(), text2num(), ngram(), strsplit2(), grep2()). See Böschen, I. (2021) Böschen, I. (2021) , Böschen, I. (2023) , and Böschen, I. (2024) .

Copy Link

Version

Install

install.packages('JATSdecoder')

Monthly Downloads

286

Version

1.2.1

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Ingmar Böschen

Last Published

July 29th, 2025

Functions in JATSdecoder (1.2.1)

get.doi

get.doi
get.references

get.references
get.country

get.country
get.contrib

get.contrib
get.outlier.def

get.outlier.def
pCheck

pCheck
preCheck

preCheck
get.test.direction

get.test.direction
grep2

grep2
get.tables

get.tables
has.interaction

has.interaction
standardStats

standardStats
get.sentence.with.pattern

get.sentence.with.pattern
letter.convert

letter.convert
vectorize.text

vectorize.text
ngram

ngram
text2sentences

text2sentences
get.stats

get.stats
get.subject

get.subject
get.history

get.history
get.journal

get.journal
study.character

study.character
get.title

get.title
get.text

get.text
text2num

text2num
get.keywords

get.keywords
get.method

get.method
get.type

get.type
get.software

get.software
get.sig.adjectives

get.sig.adjectives
get.vol

get.vol
which.term

which.term
strsplit2

strsplit2
get.R.package

get.R.package
get.author

get.author
allStats

allStats
get.alpha.error

get.alpha.error
get.category

get.category
est.ss

est.ss
get.assumptions

get.assumptions
get.abstract

get.abstract
get.aff

get.aff
JATSdecoder

JATSdecoder
get.multi.comparison

get.multi.comparison
get.editor

get.editor
get.n.studies

get.n.studies
get.power

get.power