针对交通领域图谱模型构建中的结构化表达和复杂知识自动抽取能力不足,提出TransKG交通运输知识大模型及TransKG-Chat知识图谱自动构建方法。首先,基于交通运输专业语料,采用指令微调、高层有监督参数优化与多任务联合损失,提升模型对交通运输知识的理解与结构化抽取能力。然后,设计多层级五元组体系,结合自动解析与分层归一算法,实现知识高精度批量抽取和复杂语义归属层级组织。最后,结合五元组自动抽取,构建图谱驱动的智能应用体系,实现货运枢纽监控、多式联运等场景下知识可视化与辅助决策。实验结果显示:TransKG模型在交通运输领域问答集的Pass@1指标较相同参数规模的主流模型有明显提升,五元组抽取准确率达95%;自动化效率方面,TransKG-Chat方法在500字与20 000字文本下构建用时分别为人工的2.98倍和12.83倍。结果表明,该方法在完成交通知识自动抽取任务中具有领先优势,能够有效支撑行业智能化服务应用。
To address the limitations in structured representation and automated extraction of complex knowledge in transportation knowledge graph modeling, this study proposed the TransKG large model of transportation knowledge and the TransKG-Chat automatic construction method of knowledge graph. Firstly, based on domain-specific transportation corpora, instruction fine-tuning, high-level supervised parameter optimization, and multi-task joint loss were employed to enhance the model′s understanding and structured extraction abilities for transportation knowledge.Then, a multi-level quintuple system was designed, combined with automatic parsing and hierarchical normalization algorithms, to achieve high-precision batch extraction of knowledge and hierarchical organization of complex semantic attributions. Finally, by integrating automatic quintuple extraction, a knowledge graph-driven intelligent application framework was constructed, achieving knowledge visualization and auxiliary decision-making in scenarios such as freight hub monitoring and multimodal transport. Experimental results demonstrated that: the TransKG model significantly improved the Pass@1 metric on transportation domain question-answering datasets compared to mainstream models with the same parameter scale, and achieved a quintuple extraction accuracy of 95%; In terms of automation efficiency, the construction times of the TransKG-Chat method for texts of 500 words and 20,000 words are 2.98 times and 12.83 times faster than manual processing, respectively. Overall, the results verified the leading advantage and industry application value of the proposed method in automatic transportation knowledge extraction and intelligent applications.