graph:::length.communities (sometimes) produces the wrong length

KeesP · 25 July 2022 15:01

# as expected.
g <- make_full_bipartite_graph(4,4)
clu <- structure( list(membership = c(1, 1, 1, 1, 2, 2, 2, 2)
                , algorithm = "onto"), class = "communities")
length(clu)
clu[length(clu)]
[1] 2
$`2`
[1] 5 6 7 8

# not as expected when membership vector in cummunities is corrupted.
g <- make_full_bipartite_graph(4,4)
clu <- structure( list(membership = c(1, 1, 1, 1, 4, 4, 4, 4)
                , algorithm = "outside (1,2)"), class = "communities")
length(clu)
[1] 4

This can harm applications that depend on ‘[[’, e.g. plot.

clu[[length(clu)]]
Error in groups(x)[[i]] : subscript out of bounds.

tamas · 28 July 2022 19:00

For the record: this is due to a bug in the C core, and it will be resolved automatically after introducing the fix in the C core:

> g <- make_empty_graph(1, directed=F)
> clu <- cluster_leiden(g)
> length(clu)
[1] 1
> clu$membership
[1] 1

KeesP · 29 July 2022 07:53

The point I would like to make: if the groups in the $membership vector are not numbered consecutively and starting at 1, then max($membership) is wrong. A safer alternative is length(unique()). But that’s a lot of overhead for an unlikely situation. Is that worth it?

tamas · 29 July 2022 09:20

In theory, the C core of igraph will always return membership vectors where the communities are numbered consecutively and there are no empty communities. We have a function for that in the C core called igraph_reindex_membership(). I think that all community detection functions in the C core call this function in the end, and if not, that’s a bug in the C core and should be fixed there.

szhorvat · 29 July 2022 10:34

I added a test to the C core to verify that membership vectors use proper indexing when detecting communities in one particular random graph

Topic		Replies	Views
How to find cluster assignment of a node? Usage C	3	180	18 December 2023
Operations on igraph_vector_int_t - membership Usage C	0	176	28 June 2023
R/igraph 1.2.7 Announcements R	0	426	15 October 2021
Missing edges when detecting communities in dynamic social networks Usage R	0	116	29 January 2024
C/igraph 0.9.9 Announcements C	0	569	4 June 2022

graph:::length.communities (sometimes) produces the wrong length

Related topics