随着大数据时代的到来,对异构和分布式的模糊XML数据管理显得越来越重要。在模糊XML数据的管理中,模糊XML文档的分类是关键问题。针对模糊XML文档的分类,提出采用双隐层极限学习机模型来实现模糊XML文档自动分类。这个模型可以分为两个部分:第一层采用极限学习机提取模糊XML文档的相应特征,第二层利用核极限学习机根据这些特征进行最终的模糊XML文档分类。通过实验验证了所提方法的性能优势。首先对主要的调节参数包括隐藏层节点的数目L,常量C和核参数γ进行了研究,接下来的对比实验说明提出的基于双隐层ELM(ExtremeLearningMachine)的方法相较于传统单隐层ELM以及SVM(SupportVectorMachine)方法,分类精度得到较大提高,训练时间进一步缩减。
With the arrival of the era of big data, the management of distributed and heterogeneous fuzzy XML data is also becoming more and more important. In the management of fuzzy XML data, the classification of fuzzy XML documents is the key problem. In order to study the classification for fuzzy XML documents, in this paper, a new ELM-based double hidden layer framework is proposed. The proposed architecture is divided into two main components:the feature extraction of fuzzy XML documents are performed using Extreme Learning Machine in first layer, and then use these characteristics to classify the fuzzy XML documents by KELM Kernel Extreme Learning Machine in second layer. Finally, the performance advantages of the proposed method are verified by experiments. Firstly, the parameters including the number of hidden neuron, and the constant parameter C and kernel parameter γ are investigated in detail. Compared with the traditional single hidden layer ELM(Extreme Learning Machine)and SVM(Support Vector Machine)method, the classification accuracy has been greatly improved and the training time has been decreased by approach based on the double hidden layer ELM proposed in this paper.