CreateDataCV(net , p = 0.75 , G = 50 ,
net_type = "directed" , deg_thresh = 0 , exclude_end = FALSE)(from_node id, to_node id, time_stamp). from_node id is the id of the source node. to_node id is the id of the destination node. time_stamp is the arrival time of the edge. from_node id and to_node id are assumed to be integers starting from \(0\). time_stamp can be either numeric or string. The value of a time-stamp can be arbitrary, but we assume that a smaller time_stamp (regarded so by the sort function in R) represents an earlier arrival time.0 and 1. Indicates the ratio of number of new edges in the learning data to that of the full data. Default is p = 0.75.
"directed" or "undirected". Default is "directed".
0,i.e. all the nodes.
TRUE, then for the testing data, at each time-step we only consider the new edges that connect to nodes with the current degrees less than \(deg\_max\), which is the maximum degree in the learning data. The motivation for this option is that in the learning phase, we can only learn the PA function up to \(deg_max\), so it makes sense to limit the degree in the testing phase to \(deg\_max\). From our experiences, this option does not matter. Default value is FALSE"CV_Data" containing the data needed for cross validation.library("PAFit")
net <- GenerateNet(N = 100 , m = 1 , mode = 1 , alpha = 1 , shape = 5 , rate = 5)
data_cv <- CreateDataCV(net$graph)
summary(data_cv)
Run the code above in your browser using DataLab