基于异质信息网络的古代科技文献知识挖掘研究

  • 打印
  • 收藏
收藏成功


打开文本图片集

Knowledge Mining of Ancient Chinese Scientific Documents Based on Heterogeneous Information Networks

Pan Jun,Hu Pengfei, Tao Xiangxing

Abstract To address the issues of multi-source heterogeneity and semantic association deficiencyin the knowledge organization of ancient scientific and technological documents, this paper proposes a knowledge mining and visualization approach based on heterogeneous information networks.Firstly,a domain knowledge representation model is designed to construct an initial knowledge base.Next,online encyclopedic data are collected,from which triplesare extracted through rule templates and large language models.Finaly,the triple dataset is transformed intoa heterogeneous information network,and key metrics such as degree distribution, centrality,and community structureare analyzed.Avisualization application is built based on multi-dimensional datasets to intuitively present the semantic relationships among knowledge units in the ancient scientific and technological system,providing tool support for the digital organization and knowledge discovery of ancient scientific and technological documents.

KeyWordsHeterogeneous informationnetwork.Relationextraction.Ancient Chinese scientificandtechnologicaldocuments Knowledgemining.Digital humanities.

0 引言

古代科技文献是我国传统科技文明的重要载体,涵盖天文历算、医药农学、营造工艺等多个门类,形成了严谨完备的科学体系1。(剩余12797字)

monitor
客服机器人