Hi I have a heterogeneous graph and I am trying to compute the closeness centrality on a subset:
A - (SENDS_TO) - B
Imagine this is part of a larger graph, e.g.,
A - (SENDS_TO) - B - (LIKES) - C - (FRIENDS) - D - (BUYS_FROM) - A
I am currently using the tg_closeness_cent_approx algo as
v_type = [“A”, “B”],
e_type = “SENDS_TO”,
re_type = “SENDS_TO”
max_size = 5000
There are ~2k A nodes and 172k B nodes. Each A is connected to a B.
However my results are all score=0.
Would anyone have an idea why?
This is an old version, the new one has better performance on large graph.
While we would have to study your situation a little more to see why you are getting 0, I do think that you subgraph is probably not a good candidate for closeness centrality.
Closeness centrality only makes sense when every vertex CAN REACH every other vertex. The centrality measure is calculating how many steps it takes on average to get to those other vertices.
If I understand your schema correctly, by including only A and B and only one edge type, it probably is not true that every vertex can reach every other vertex. If you can say instead “well, it’s not one connected graph, but it’s a reasonable number of connected subgraphs”, then we can treat each group separately and average their results. That is what the wf (Wasserman-Faust) parameter is for.
Thanks @Victor_Lee I will look into the wf parameter