Learn R Programming

Summarize Text by Ranking Sentences and Extracting Keywords

This repository contains an R package which handles summarizing text by using textrank.

For ranking sentences, this algorithm basically consists of.

  • Finding links between sentences by looking for overlapping terminology
  • Using Google Pagerank on the sentence network to rank sentences in order of importance

For finding keywords, this algorithm basically consists of.

  • Extract words following one another to construct a word network
  • Using Google Pagerank on the word network to rank words in order of importance
  • Constructing keywords - which are the combination of relevant words identified by the Pagerank algorithm which follow each other

Installation & License

The package is available under the Mozilla Public License Version 2.0. Installation can be done as follows. Please visit the package documentation and package vignette for further details.

install.packages("textrank")
vignette("textrank", package = "textrank")

For installing the development version of this package: devtools::install_github("bnosac/textrank", build_vignettes = TRUE)

Support in text mining

Need support in text mining? Contact BNOSAC: http://www.bnosac.be

Copy Link

Version

Install

install.packages('textrank')

Monthly Downloads

721

Version

0.3.1

License

MPL-2.0

Issues

Pull Requests

Stars

Forks

Maintainer

Jan Wijffels

Last Published

October 12th, 2020

Functions in textrank (0.3.1)

textrank_candidates_all

Get all combinations of sentences
joboffer

The text of a job offer, annotated with the package udpipe
textrank_jaccard

Calculate the distance between 2 vectors based on the Jaccard distance
summary.textrank_sentences

Extract the most important sentences which were identified with textrank_sentences
textrank_sentences

Textrank - extract relevant sentences
textrank_keywords

Textrank - extract relevant keywords
textrank_candidates_lsh

Use locality-sensitive hashing to get combinations of sentences which contain words which are in the same minhash bucket