Deep Learning based Network Similarity for Model Selection

Tracking #: 670-1650

Authors:

	Name	ORCID
	Kushal Veer	https://orcid.org/0000-0003-1406-215X
	Ajay kumar verma	https://orcid.org/0000-0002-4163-0100
	Lovekesh vig	https://orcid.org/0000-0001-9834-3308

Responsible editor:

Michael Maes

Submission Type:

Research Paper

Abstract:

Capturing data in the form of network’s is becoming an increasingly popular approach for modeling, analyzing and visualizing complex phenomena, to understand the important properties of the underlying complex processes. Access to many large-scale network datasets is restricted due to the privacy and security concerns. Also for several applications (such as functional connectivity networks), generating large scale real data is expensive. For these reasons, there is a growing need for advanced mathematical and statistical models (also called generative models) that can account for the structure of these large scale networks, without having to materialize them in the real world. The objective is to provide a comprehensible description of the network properties and to be able to infer previously unobserved properties. Various models have been developed by researchers, which generate synthetic networks that adhere to the structural properties of real networks. However, the selection of the appropriate generative model for a given real-world network remains an important challenge. In this paper, we investigate this problem and provide a novel technique (named as TripletFit) for model selection (or network classification) and estimation of structural similarities of the complex networks. The goal of network model selection is to select a generative model that is able to generate a structurally similar synthetic network for a given real-world (target) network. We consider six outstanding generative models as the candidate models. The existing model selection methods mostly suffer from sensitivity to network perturbations, dependency on the size of the networks, and low accuracy. To overcome these limitations, we considered a broad array of network features, with the aim of representing different structural aspects of the network and employed deep learning techniques such as deep triplet network architecture and simple feed-forward network for model selection and estimation of structural similarities of the complex networks. Our proposed method, outperforms existing methods with respect to accuracy, noise-tolerance, and size independence on a number of gold standard data set used in previous studies.

Manuscript:

ds-paper-670.pdf

Previous Version:

Deep Learning based Network Similarity for Model Selection

Special issue (if applicable):

Data repository URLs:

Date of Submission:

Monday, December 14, 2020

Date of Decision:

Tuesday, March 23, 2021

Nanopublication URLs:

Decision:

Solicited Reviews:

Review #1 submitted on 04/Jan/2021

By Viktoria Spaiser ORCID logo

https://orcid.org/0000-0002-5892-245X

Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Average
Suggested Decision: Undecided
Technical Quality of the paper: Average
Presentation: Average
Reviewer`s confidence: Low
Significance: Moderate significance
Background: Reasonable
Novelty: Clear novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right

Summary of paper in a few sentences (summary of changes and improvements for second round reviews):

The authors have revised the paper in accordance to the reviews they received, this includes addressing concerns with respect to the assortativity feature and its dependence on the network size, exploring network similarity measures for the generated models in the case study, revising Table 2, adding Table 3, which makes the case study more convincing. They however dismissed reviewer's concern about insufficient variability, without explaining this further. The discussion section has been modified, but not much expanded, which is something that the reviewers have explicitly requested.

Reasons to accept:

The authors propose a methodological advancement that might be relevant to network researchers.

Reasons to reject:

The paper still suffers from some weaknesses, not least the language. The authors would be advised to get a professional editor to make sure the paper is grammatically correct and readable. I would make acceptance conditional on proper proof-reading and editing.
I also think that the discussion section needs to be expanded.

Nanopublication comments:

Further comments:

Review #2 submitted on 12/Mar/2021

By Tobias Kuhn ORCID logo

https://orcid.org/0000-0002-1267-0234

Review Details

Reviewer has chosen not to be Anonymous

Overall Impression: Average
Suggested Decision: Undecided
Technical Quality of the paper: Average
Presentation: Average
Reviewer`s confidence: Low
Significance: Moderate significance
Background: Reasonable
Novelty: Limited novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The length of this manuscript is about right

Summary of paper in a few sentences (summary of changes and improvements for second round reviews):

This manuscript describes an approach to estimate the similarity of networks based on a Deep Learning model. The goal is to predict the best generative model to approximate a given real network.

Reasons to accept:

I am focusing my review here on how well the comments by the previous reviewers have been taken into account.

The authors have in my view sufficiently address the issue of assortativity and provided more information on the case study.

Reasons to reject:

The problem of variability in the network properties / parameters is in my view not sufficiently taken care of. Table 1 still looks very suspicious and the mentioned range of network sizes seems to confirm that suspicion. Moreover, Figure 4 is highly confusing as it includes results obtained from different settings. It seems likely that TripleFit had an unfair advantage. See my points below with some more details.

Nanopublication comments:

Further comments:

Main points:

- "The size of the network is randomly chosen from 1000 to 5000 nodes": This seems to be a quite narrow range. Why was this range not made larger? (like from 10 to 1 million nodes)

- This is related to the issue of variability of one of the previous reviewers. Covering larger, and in particular also smaller, networks would increase this variability and probably significantly change Table 1. (For example, networks of just 10 nodes are probably often indiscernible, and so no perfect accuracy will be possible.)

- It seems the dataset and process used by the authors of [28] was not identical as the one presented here, and therefore Figure 4 is highly confusing (the top performance of TripleFit was achieved under different settings than the other methods).

Further points:

- The data points in the plot in Figure 3 are made of numbers, but they are mostly too overlapping so the numbers cannot be read. Therefore the data points should rather be small dots (or crosses) instead of numbers (possibly even semi-transparent to visually get the density across too).

- It's unclear how the triplets are generated (one per dataset?). It seems this can't be exhaustive, so I assume some elements are picked randomly, but it's not clear how.

- Are the equations (10)-(14) different from what other such approaches normally use? This should be stated, and if they are the same, possibly not all these equations need to be shown here.

Minor points:

- Abstract: "network's" > "networks"
- Introduction: "Lots of literature" should be phrased better (e.g. "Many existing works" or "A large amount of existing literature")
- Introduction: missing "and" in front of "classical graph similarity approaches"
- Materials and Methods: "network structural similarity" should probably be "structural network similarity"
- page 7: "Deep Learning algorithm" > "The Deep Learning algorithm" or just "Deep Learning"
- Figure 3: a legend with the color codes would be helpful.
- page 11: "iteartion" > "iteration"
- equation (15) is unnecessary as euclidean distance is so well known and simple.
- Figure 6: also show numerical values in addition to color shades

RESPONSE TO REVIEWERS

We would like to thank the reviewers for their very valuable comments and suggestion for improvement. Our revised version of manuscript addresses their comments and suggestions, and we provide point by point response below: (we have indicated quotations from the reviewers by prefixing lines with ‘>’) >Reviewer # 1 >The authors engage with existing research in the field and establish the limitations of existing methods of generative model selection, which they seek to address. >They suggest a new sophisticated methodology, which appears to be sound and results in convincing outcomes. We thank the reviewer for his positive comments. > The descriptions are not always clear, in particular how the two processes, (1) learning of network similarity and (2) model selection based on classification exactly interact/inform each other is not entirely clear to me. > The authors are encouraged to proof-read their paper again and correct typos etc. As desired by the reviewer the manuscript was modified to improve the readability. >Reviewer # 2 > The authors use a grab-bag of network features, hoping to capture the topology of networks. My concern is with one of their features; the assortativity coefficient, has been shown to depend upon network size (Nelly Litvak and Remco van der Hofstad, Physical Review E 87, 022801 (2013)). A size-dependent feature is at odds with the authors’ aim to have a “distance metric that is agnostic” to network size. This is especially relevant for at least two of the models used in their set of generative models: the Barabasi-Albert model and the Watts-Strogatz model. We thank the reviewer for the comments and his input on the size-dependence of assortativity feature. A paragraph (subsection 4.4) has been added in the revised manuscript to specifically address this issue. Here is the explanation of changes introduced. We carried out an extensive analysis of this issue and plotted the assortativity feature as a function of network size for all types of networks considered in this study. Further, we examined how critical this particular feature is for the conclusions reached in our proposed method by re-evaluating the whole model in the absence of this feature. In Appendix A: Figures 7 to 12 the revised manuscript shows the boxplots of assortativity in various network size ranges for different network types. We do confirm that for a few network types, particularly the Barabasi-Albert model, assortativity slightly increases with network size. The major implication of this observation is that the proposed approach should not be used for a very large range of network sizes. It is to be noted that the dependencies are rather weak, with a reasonable range of network sizes; hence the proposed model can be used effectively as has been demonstrated in our study. Further, we carried out an additional experiment in which we removed the assortativity feature from our model and the results shows that this feature is not critical to the overall strategy of the proposed network comparison methodology. Thus, if users intend to reuse our model for a very large range of network sizes, they are advised to remove assortativity features and the cost of doing so in terms of model performance is not very high. > In the results section, figure 3 is a convincing, low dimension, demonstration of why the model selection method as presented is effective. The figure clearly shows that the distance measure learned is an effective discriminator between network models. However their evaluation of the model selection approach (Table 1) seems to indicate that their generated model instances (and I am speculating here in the absence of any indication of the range of model parameters used) do not have enough variability. The reviewer finds our results shown in figure 3 convincing which means the generative models are well distinct (not clustered together). Table 1 shows predictability of instances of different generative models is almost 100 percent. The reviewers comment about the variability in Table 1 seems to be not relevant. > While the authors take some care to present their method, the main result, called Case Study, for real networks is simply given as a table with the closest generative model selected for each real network. There is no way for the reader to assess how well the generative model fits the data. The method developed by the authors learns a metric. It would be useful to see the closeness of the fit for each generative models to each real network. This could be achieved by giving a measure of the distance of each real network from each generative model. In this paper, we consider model selection as a classification problem. As discussed in the manuscript, we can also utilize network similarity measures for the model selection problem. As suggested by the reviewer we added Table 3 in our revised manuscript that shows the average Euclidean distance between embedded features of real network and all instances of a particular generative model. Smallest average distance shows a stronger similarity between real network and generative network. We have seen in our study that the model selected through model selection (using classification scheme) and the model selected through network similarity (using Euclidean distance measure) agreed completely refer Tables 2 and 3. > In the current form, I would not accept the paper for publication. Perhaps with significant changes, the paper may be acceptable. The reasons for the decision: The result, in Table 2, for one of the real networks citHepTh, the selected generative model is the Erdos-Renyi random graph model. This result only indicates that none of the other 5 models fit the data well - fitting to a random graph model is like a “base-line” fit. There is no discussion of the results. I am not sure what the 1 2 “Discussion” section is trying to convey. Thanks for the reviewer for pointing out the error in Table 2 where the selected generative model should be FF instead of ER. We have corrected this mistake. As desired by the reviewer section 5 “Discussion” in revised manuscript is modified. > The paper has many typos and grammatical errors. Examples (not exhaustive) are: >page 2, line 13: ‘ ...to perform an effective model selection.....’, remove ‘an’. >page 2, lines15-17: A run-on sentence that is out of place. >page 3, line 4: ‘estimte’ >page 11, line 20: ‘1000 of network instances......’ >page 11, line 22: Missing ‘is’ in the first sentence. >page 11, line 30: ‘ecah iteartion’ >page 12, line 8: Missing space before ‘More’. >page 12, line 37: ‘....the randomly chosen pair of nodes.’ page 12, line 37: ‘....the the....’ >page 13, line 34: ‘....we computes.....’ >page 13, line 34: ‘The question is, Is the euclidean....’ should be: The question: is the Euclidean...... >page 13, line 39: heatmap should not be capitalized. >page 13, line 40: ‘....feture....’ >page 13, line 40: ‘.....diffrent.....’ >page 15, line 34: ‘ Despite most of the existing methods [19, 25, 27], the proposed distance based method.....’ >I am not sure of what the authors mean. >page 15, line 42: ‘perhaps smaller from the size of the target network’ the ‘from’ should be ‘than’ Thanks and sorry for inconvenience. The above concerns are addressed in revised manuscript.

1 Comment

Meta-Review by Editor

Submitted by Tobias Kuhn on Tue, 03/23/2021 - 01:31

We ask you to carefully address all points raised by the reviewers, focusing on the size of the studied networks and on improving Figure 4 and Table 1. I also echo Reviewer 1´s recommendation to extend the discussion section and to carefully proofread your paper.

Michael Maes (https://orcid.org/0000-0001-9416-3211)

Data Science

Deep Learning based Network Similarity for Model Selection

Tracking #: 670-1650

Authors:

Responsible editor:

Submission Type:

Abstract:

Manuscript:

Previous Version:

Tags:

Special issue (if applicable):

Data repository URLs:

Date of Submission:

Date of Decision:

Decision:

1 Comment

Meta-Review by Editor