a data frame of size nx3 (id, concept, property). The empriical distribution is generated from this data
new_words
integer greater than 0, corresponding to the number of words with frequency one that should be added to the empirical distribution
number_subjects
number of subjects to be sampled. Each subject with generates new properties
Value
a vector with the extra number of participant to achieve the especific coverage, and the estimate of the number of unique properties listed by the new amount of suggested people