基于CI−GAT的煤矿安全事故文本分类研究

CI−GAT−Based Text Classification for Coal Mine Safety Accidents

  • 摘要: 针对煤矿生产领域事故的复杂性、类别的不平衡性以及事故致因和事故类别之间的因果性,提出了一种基于因果效应和图注意力网络的煤矿安全事故文本图数据分类算法CI−GAT,根据事故潜在致因预测煤矿安全事故类别。算法以CI−GNN模型为基础框架,首先优化了GraphVAE模块,编码器部分通过增加GCN层构建更深的GCN结构,解码器部分引入BatchNorm和Dropout,更加全面地解码事故文本图的致因节点。在算法的分类器模块使用GAT网络代替GIN,更好地捕获事故节点之间的依赖关系。此外,通过引入类别原型存储器实现事故的类别增强,降低类别不平衡的影响,在多粒度特征融合模块引进门控机制FusionGate以融合事故的全局特征和节点特征,将结果传入包含两个自适应残差块的MLP的解码器进行解码,输出事故类别预测结果。在自建的煤矿安全事故文本图数据集上进行实验,准确率、精确率、召回率和F1值分别为:96.3%、89.8%、93% 和0.913,验证了所提出的算法在煤矿安全事故文本图数据集上分类的优势。

     

    Abstract: To address the complexity of coal mine production accidents, the imbalance in accident categories, and the causal relationship between accident causes and accident consequences, this paper proposed a coal mine safety accident text−graph classification algorithm, CI−GAT, based on causal effects and graph attention networks. The algorithm was designed to intelligently classify text data related to coal mine safety accidents and predict the category of impending accidents based on the potential causes of accidents, and it took the CI−GNN model as its basic framework, with adaptive optimizations carried out on the GraphVAE module and the classifier module. Specifically, the algorithm first optimized the GraphVAE module: it enhanced the encoder with additional GCN layers to form a deeper GCN structure, while for the decoder part, it increased the number of MLP hidden units and incorporated BatchNorm and Dropout layers, which enabled the module to more comprehensively decode the causal nodes of accident text graphs. For the classifier module of the algorithm, the following optimizations were implemented: first, it replaced the Graph Isomorphism Network (GIN) with the Graph Attention Network (GAT) which incorporated an attention mechanism to better capture the dependency relationships between accident nodes; second, it introduced a category prototype memory to enhance minority classes and mitigate the effects of category imbalance; finally, in the multi−granularity feature fusion module, it employed a gating mechanism (FusionGate) to fuse global accident features with node features, and the fused results were then input to an MLP decoder containing two adaptive residual blocks to output the classification results of coal mine accident text. Comparative experiments and ablation experiments were conducted on a self−constructed text−graph dataset of coal mine safety accidents. The performance metrics of the proposed algorithm are as follows: accuracy is 96.3%, precision is 89.8%, recall is 93%, and F1−score is 0.913; these experimental results outperform the benchmark text−graph classification models, verifying the superiority of the proposed algorithm in classifying the text−graph dataset of coal mine safety accidents.

     

/

返回文章
返回