Learn R Programming

⚠️There's a newer version (1.0.21) of this package.Take me there.

pubmed.mineR (version 1.0.1)

Text mining of PubMed Abstracts.

Description

An R package for text mining of PubMed Abstracts (http://www.ncbi.nlm.nih.gov/pubmed). The algorithms are designed for two formats (text and XML) from PubMed

Copy Link

Version

Install

install.packages('pubmed.mineR')

Monthly Downloads

355

Version

1.0.1

License

GPL-3

Maintainer

S Ramachandran

Last Published

March 18th, 2014

Functions in pubmed.mineR (1.0.1)

Yearwise-methods

Yearwise Year wise extraction of Abstracts
R2S4

S4 Converter
getabs

To get Abstracts for a given term.
Find_conclusion

To find the conclusion from the abstract(s).
Pathway_Info

To get the information of pathways for Genes
HGNCdata

R Data containing HGNC data.
HGNC-class

HGNC Class for package.
tdm_for_lsa

create Term Document Matrix for lsa analysis
common_words_new

R Data containing words which frequently in text
combineabs

To combine the abstracts
searchabsL-methods

Searching Abstracts
sendabs-methods

To send the Data into a File
cleanabs-methods

Methods for Function cleanabs
Genewise

To Search the number of abstracts for Genes
gene_atomization

To Extract Genes from the Abstracts
sendabs

To send abstracts
cos_sim_calc

To calculate the cosine similarity between terms.
cluster_words

To Find the highest frequency of words within clusters
removeabs-methods

removeabs To remove abstracts of a term from the data.
contextSearch

For Context Search
ready

To Initiate the Classes.
wordscluster

To cluster the words
find_intro_conc_html

To find the introduction and conclusion from the abstracts.
whichcluster

To fetch the cluster for words
getabsT-methods

To Get Abstracts
get_original_term

To get the original terms from the corpus.
subabs-methods

Getting subabstracts
getabsT

To get Abstracts for a given term.
searchabsT-methods

searchabsT Searching abstracts
searchabsL

To Search the abstracts of term(s) in a combination mode.
xmlreadabs

To read the abstracts from the PubMed saved in XML format.
readabs

To read Abstracts
word_atomizations

Atomization of words
printabs

To prind the total number of abstracts in an S4 object of class Abstracts , its start and end
contextSearch-methods

Method for Context Search
xmlgene_atomizations

Gene atomization of xml abstracts.
SentenceToken

To Tokenize the sentences
input_for_find_intro_conc_html

fetch the abstracts using E-utilities.
uniprotfun

To get information about gene from the UniProt.
GeneToEntrez

Data containing Entrez Ids
combineabs-methods

Abstracts Method to Combine Abstracts
Yearwise

To Search abstracts Year wise
Genewise-methods

method to find the abstracts for the given gene.
cleanabs

To clean the result of searchabsL
removeabs

To remove abstracts for the query term.
HGNC2UniprotID

R Data containing HGNC2UniprotID data mapping.
Pathway_Link

To get the Links of the pathways for given genes
searchabsT

To Search Abstracts
xmlword_atomizations

Word atomizations of abstracts from xml format.
Abstracts-class

Class "Abstracts" Absract Class
cos_sim_calc_boot

Cosine Similarity Calculation by Boot Strapping
subabs

To find sub-abstracts
wordsclusterview

To view the words in cluster
getabs-methods

getabs To Get abstracts for a term
pubtator_function

function for text annotation uisng online PubTator
currentabs_fn

To Retrive the Abstracts for year.
prevsymbol_fn

To Find Previous symbols of genes.
BWI

To Get the Buzz Word Index of terms form the Abstracts.
local_uniprotfun

To Get Information from Uniprot.
get_PMCIDS

To extract the PMC Ids of the abstracts.
altnamesfun

To Get Alternative names of Genes
get_original_term2

To get the original terms from the corpus.
official_fn

To extract the official gene symbol.
genes_BWI

Function to get the Buzz Word Index of Genes from the abstracts.
get_PMCtable

To fetch the given PMC article.
Give_Sentences

To extract sentences from the Abstracts
get_Sequences

To extract the Gene sequence from the NCBI.
previousabs_fn

To Retrive the Abstracts from the large corpus for given years.
subsetabs-methods

To make subset of Abstracts.
get_gene_sentences

To extract the sentences for genes from the corpus.
subsetabs

To make subsets of large corpus.
head_abbrev

To extract the abbreviated term.
DAVID_info

To retreive information from DAVID.
pubtator_result_list_to_table

Function to Convert Pubtator result from list into Table
names_fn

To extract the gene names from HGNC.
Give_Sentences_PMC

To fetch the sentence from the PMC full text article
alias_fn

To Find Alias of the Genes.
pmids_to_abstracts

To Find and match the PMIDs to the abstracts.
get_NMids

To extract NM ids from NCBI.