compute_propensity_scores

Compute inverse propensity scores based on a label distribution. Propensity
scores for extreme multi-label learning are proposed in Jain, H., Prabhu, Y.,
&amp; Varma, M. (2016). Extreme Multi-label Loss Functions for Recommendation,
Tagging, Ranking and Other Missing Label Applications. Proceedings of the
22nd ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining, 13-17-Aug, 935–944. tools:::Rd_expr_doi("10.1145/2939672.2939756").

Perform evaluation of automatic subject
indexing methods. The main focus of the package is to enable efficient
computation of set retrieval and ranked retrieval metrics across multiple
dimensions of a dataset, e.g. document strata or subsets of the label set.
The package also provides the possibility of computing bootstrap confidence
intervals for all major metrics, with seamless integration of parallel
computation and propensity scored variants of standard metrics.

Maximilian Kähler

casimir

Comparing Automated Subject Indexing Methods in R

Markus Schumacher

Deutsche Nationalbibliothek 

compute_propensity_scores function

<dl><dt>label_distribution</dt>
<dd>Expects a data.frame with columns <code>"label_id",
 "label_freq", "n_docs"</code>. <code>label_freq</code> corresponds to the number of
occurences a label has in the gold standard. <code>n_docs</code> corresponds to
the total number of documents in the gold standard.</dd>
<dt>a</dt>
<dd>A numeric parameter for the propensity score calculation, defaults
to 0.55.</dd>
<dt>b</dt>
<dd>A numeric parameter for the propensity score calculation, defaults
to 1.5.</dd></dl>

Arguments

Compute inverse propensity scores — compute_propensity_scores

<dl>

<dt>label_distribution</dt>
<dd>Expects a data.frame with columns <code>"label_id",
 "label_freq", "n_docs"</code>. <code>label_freq</code> corresponds to the number of
occurences a label has in the gold standard. <code>n_docs</code> corresponds to
the total number of documents in the gold standard.</dd>


<dt>a</dt>
<dd>A numeric parameter for the propensity score calculation, defaults
to 0.55.</dd>


<dt>b</dt>
<dd>A numeric parameter for the propensity score calculation, defaults
to 1.5.</dd>

</dl>

compute_propensity_scores: Compute inverse propensity scores

Description

Usage

Value

Arguments

Examples