Graph classification algorithm based on graph structure embedding

•Construct a “word list” of graph data from its subgraphs.•Design a neural network for automatically training graph embedding.•Proposed a graph classification algorithm based on graph embedding. With the application of data mining in many fields such as information science, bioinformatics, and netwo...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Expert systems with applications Ročník 161; s. 113715
Hlavní autoři:	Ma, Tinghuai, Pan, Qian, Wang, Hongmei, Shao, Wenye, Tian, Yuan, Al-Nabhan, Najla
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	New York Elsevier Ltd 15.12.2020 Elsevier BV
Témata:	Algorithms Bioinformatics Classification Criteria Data mining Data structures Embedding Feature extraction Graph classification Graph embedding Graph theory Graphs Natural language processing Neural network Neural networks Topology Words (language) Graph embedding Graph classification Neural network
ISSN:	0957-4174, 1873-6793
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	•Construct a “word list” of graph data from its subgraphs.•Design a neural network for automatically training graph embedding.•Proposed a graph classification algorithm based on graph embedding. With the application of data mining in many fields such as information science, bioinformatics, and network intrusion detection, more and more data are showing new features such as strong structuration and complex relationships between data. As a complex data structure, a graph can be used to describe the relationship between things. Traditional graph classification methods based on graph feature vector construction need to select a feature vector construction criterion in advance, such as graph-based theoretical indicators or graph-based topology occurrences, and then extract features from each graph in the graph set according to the designated criterion. However, the construction method of the graph feature vector is easy to lose the graph structural information and requires strong professional knowledge. Inspired by the Word2Vec and Doc2Vec models in the Natural Language Processing (NLP), this paper first constructs a “word list” of graph data consisting of subgraphs. Then a neural network for training graph embedding is designed with the graph itself as its input, and the “word” in the graph and the attribute features of the graph are used as its output, so that the neural network automatically learns the graph embedding corresponding to each graph. The graph embedding not only reflects the features of the graph itself but also includes the relative relationship among graphs. Finally, on the basis of the well-trained graph embedding, the common classifier can be used to classify graphs. Based on real-world bioinformatics and social data sets, the experiments demonstrate that the proposed graph classification algorithm has advantages over the existing graph classification algorithms based on feature vector construction.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0957-4174 1873-6793
DOI:	10.1016/j.eswa.2020.113715