GEDLIB  1.0
analyze_dataset.py File Reference

Python script that computes statistics of a given dataset. More...

Go to the source code of this file.

Detailed Description

Python script that computes statistics of a given dataset.

Usage:

$ python sample.py <dataset> <dir> [-h] [--help] [--topology] [--max_size <maximal number of nodes>] [--distr <data file to store node and edge distribution>]

Arguments:

positional arguments
<dataset> path to existing graph collection XML file representing the dataset that should be analyzed; must respect GraphCollection.dtd
<dir> path to directory containing GXL files
optional arguments
-h show help
–help show help
–topology also compute mean number of connected components and ratios of acyclic and planar graphs
–max_size <maximal number of nodes> only consider graphs with at most <maximal number of nodes> many nodes
–distr <data file to store node and edge distribution> store distributions of number of nodes and edges as 2-dimensional histogram, i.e., as a data file whose rows are of the form “<number of nodes> <number of edges> <count>

Definition in file analyze_dataset.py.