Originally Posted by
melysion I need to find a method that tells me which class has samples that are related to the greatest number of samples from other classes and also get a general feeling for the distribution of samples being related to other samples in different classes for each graph.
Hi, melysion.
For the method, you could compare the mean number of related samples. For example, suppose this is the data.
Code:
Class 1 Class 2
0 50 0 100
1 25 1 60
2 15 2 20
3 10 3 20
N 100 N 200
Sum 85 Sum 160
Mean .85 Mean .80
Class 1 has 100 samples which are related to a total of 85 samples from other classes. This is a mean of .85 related samples per sample. Class 2 has more samples and related samples, but the mean of .80 related samples per sample is lower. So Class 1 is on average related to a greater number of samples from other classes.
To get a general feeling for the distribution, you could combine all the samples and create a bar chart for that.