Generates a dataset used to exemplify clustering The cluster centers are set relatively far away to see how well the algorithm performs in simple scenarios
generate_split_data(n, sd)
Number of datapoints to generate
Standard deviation from the cluster center
Returns the datapoints and the cluster assignments. The cluster assignments can be used to calculate the performance of the clustering.