真实世界数据问题与多源异构数据治理实践

  • 打印
  • 收藏
收藏成功

doi:10.3969/J.ISSN.1672-7274.2025.06.031

中图分类号:TP393.08 文献标志码:B 文章编码:1672-7274(2025)06-0092-03

RealWorld Data Problems and Governance Practices of Multi-source Heterogeneous Data

WANGWeiyu,DIAO Jiaxing,WANG Hui (HarbinInstituteofInformationEngineering,Harbin15o431,China)

Abstract: The article focuses on hospital HIS data,deeply analyzes the complex process of cross hospital data governance,and systematically summarizes common real-world data problems and corresponding solutions.This articleaims to provide a practical research framework and ideas foraddressngdata governancechallenges in the realworld research field,promoting eficient utilizationof data resources and scientific translation ofresearch results.

Keywords: real worldresearch;multi-sourceheterogeneous data;HIS data; medicalbig data;data governance

1 真实世界数据特点

1.1准确性不足

尽管医院信息系统(Hospital Information System,HIS)是数据采集的源头,但各医院均存在原始数据录入缺失或错误的情况,本研究按照统一标准整理了医院HIS系统常见字段信息共计156个: ① 基本信息,如性别、年龄、职业等; ② 出入院记录,如入院时间、主诉、既往史、出院时间、诊疗经过等; ③ 中西医诊断,如疾病名称、疾病编码、症候名称、症候编码等; ④ 生命体征,如检查时间、检查项、结果等; ⑤ 医嘱,如内容、时间、规格、频率、数量等; ⑥ 检验结果,如检验项目、检验结果、单位、参考范围等; ⑦ 检查结果,如检查部位、检查描述、结论、时间等。(剩余4940字)

目录
monitor