Generating synthetic complex networks with realistic topology and ground-truth community structure

by James Gilbert (University of Nottingham)

16:00 (40 min) in CT 7.01

Much of the recent focus on module detection has been geared towards developing new algorithms capable of detecting biologically significant clusters. However, evaluating clusterings detected by different methods shows that there is little topological agreement or consensus in terms of meta-data despite most methods discovering modules with significant ontology.

In this talk I will discuss CiGRAM, a model of complex networks with known modular structure that is capable of generating realistic graph topology. CiGRAM generates large scale graphs with heterogeneous degree distributions, high clustering coefficients and assortative degree correlations observed in real data but often ignored in existing benchmarks. This model can be tuned to fit many empirical biological and non-biological datasets through fitting target graph summary statistics. The ground-truth structure allows the evaluation of module extraction algorithms in a domain specific context. Furthermore, it was found that degree assortativity appears to negatively impact several module extraction methods such as the popular infomap and modularity maximisation methods.