all_simple_paths returning fewer paths than expected

rayasare · 15 December 2023 22:02

I have a 60-node graph with one connected component, in which there is a path between every pair of nodes. I am trying to enumerate all simple paths between nodes. I expect the longest simple path to be of length 60 after iterating through all nodes and using get_all_simple_paths with a cutoff=-1, but instead the longest simple path is only of length 27. Might I be doing something wrong here / thinking about this incorrectly?

To reproduce

from tqdm import tqdm
from igraph import Graph 
import itertools

G = Graph.Load('graph60.gml', format='gml')
sp_dicts = {}
sp_list = []
end_num = 60

for i, j in tqdm(itertools.combinations(range(end_num), 2)):
    simple_path_lists = G.get_all_simple_paths(i, to=j, mode="OUT", cutoff=-1)

    sp_dicts[f'{i}_{j}'] = [simple_path for simple_path in simple_path_lists]

    for path_list in simple_path_lists:
        sp_list.append(sorted(path_list)) 

# Convert lists to tuples and then to a set to get unique tuples
unique_tuples = set(tuple(lst) for lst in sp_list)

# Convert unique tuples back to lists
unique_lists = [sorted(list(tpl)) for tpl in unique_tuples]
n_mer_list = sorted(unique_lists, key=lambda x: (len(x), x))
print(f'length of longest path: {len(n_mer_list[-1])}')

Version information
‘0.11.3’ of python-igraph from pip

vtraag · 16 December 2023 11:27

Why do you expect to find a path that visits every vertex in this graph? This is called a Hamiltonian path, and it is not guaranteed to exist.

szhorvat · 18 December 2023 13:19

It is easy to see that no such path exists: there are several vertices with zero in-degree.

rayasare · 18 December 2023 21:04

Thanks for the responses – it helped me realize that I made a mistake when generating the graph. The corrected graph should be undirected. I believe that a path should exist that visits every vertex since the graph is undirected and since it has only one connected component.

As a follow-up: the docs mention that get_all_simple_paths may run out of memory for exponentially many paths between two vertices. I noticed that on my newly generated, undirected graph, this same calculation appears to be suffering from this limitation. Do you have any suggestions to speed up this calculation of simple paths between nodes?

I have tried reducing the cutoff parameter of get_all_simple_paths to half of the node length, because I can intuit the paths that I need from only this truncation, but the calculation is still very slow. I suspect that part of the slowdown comes from creating the new, undirected graph with the full adjacency matrix, whereas the other (accidentally) directed graph was created with only the upper triangular adjacency matrix and was definitely sparser. Any suggestions are welcomed.

szhorvat · 20 December 2023 18:34

This function is already very efficient, and it is unlikely that it could be sped up significantly.

There is a feature request to add a version which does not store results, but instead calls a function with each path that is found. This won’t speed up the calculation, in fact it will slow it down. But it will eliminate the need to store all paths at the same time. You can instead process them one by one as they are found. If this will help your use case, add your vote here:

github.com/igraph/igraph

Wishlist: get_all_simple_paths() should have a callback version

opened 05:56PM - 01 Jun 23 UTC

szhorvat

wishlist

**What is the feature or improvement you would like to see?** `igraph_get_all…_simple_paths()` produces an exponentially large number of results. Among such functions, it is unique in not having a callback version, so results can be filtered using arbitrary criteria before storing them. This is a proposal to add a callback version. This would be better done with iterators, but we are a long way off from building a consistent iterator interface. **Use cases for the feature** Reducing the result size based on arbitrary criteria. **References** - #2355 - #2356 - User request: https://igraph.discourse.group/t/how-to-break-up-all-simple-paths-calculation/1570?u=szhorvat

If you know C, a pull request is very welcome. This is not a difficult task.

You can add your vote to the following two feature requests if they help your use case. These would make it possible to filter paths by length without needing a (slow) callback.

github.com/igraph/igraph

Wishlist: get_all_simple_paths() lower and upper bounds on path length

opened 05:48PM - 01 Jun 23 UTC

szhorvat

wishlist

**What is the feature or improvement you would like to see?** `igraph_get_all…_simple_paths()` has a `cutoff` argument specifying an upper bound on path length. There should also be a lower bound. **Use cases for the feature** While this does not affect the performance of calculation (short paths need to be computed in order to extend them to longer ones), pre-filtering results by length is practically useful for reducing the size of the output. Having both an upper and a lower bound makes it convenient to find paths of a specific length. **References** - Mathematica has this as a built-in: https://reference.wolfram.com/language/ref/FindPath.html - User request: https://igraph.discourse.group/t/how-to-break-up-all-simple-paths-calculation/1570?u=szhorvat

github.com/igraph/igraph

Wishlist: get_all_simple_paths() with weighted path length

opened 05:53PM - 01 Jun 23 UTC

szhorvat

wishlist

**What is the feature or improvement you would like to see?** `igraph_get_all…_simple_paths()` has an integer `cutoff` argument, specifying the maximum unweighted path length. #2355 proposes adding a minimum length as well. This should be extended to the weighted case. **Use cases for the feature** Many, if not most network datasets are weighted. In spatially embedded networks, for example road networks, the relevant measure is typically not the number of hops, but the physical length of paths. **References** - Mathematica has this as a built-in: https://reference.wolfram.com/language/ref/FindPath.html

Topic		Replies	Views
How to break up all simple paths calculation? Usage R	2	390	1 June 2023
How to limit length of paths in a graph？ Usage Python	6	251	6 April 2023
Speed Improvments on get_all_simple_paths Usage Python	5	1035	5 November 2020
How to get all the shortest paths from i to other vertices but within a certain number of edges Usage R	4	531	1 May 2021
How to set the route or path to any node/vertex in iGraph in python Usage Python	6	306	20 April 2022

all_simple_paths returning fewer paths than expected

Related topics