为了给医生及病人安全、合理、高效用药提供决策支持,提出了一种基于LDA(Latent Dirichlet Allocation)的用药分析方法MaLDA(Medication Analysis based on LDA)。该方法结合了用药记录和就诊记录,将药物看作文档、药物功能看作主题、疾病看作词语,通过主题模型LDA发现隐含的药物功能,通过药物功能,将相关药物、相关疾病和药物与疾病联系起来。根据药物对药物功能的分布对药物进行聚类,每一类药物被相关的疾病所描述,进而对临床用药进行分析。MaLDA不仅能发现临床用药中针对某一类疾病效用较好的药物,而且能发现隐含的联合用药。实验数据来源于上海市某医院137510位病人的用药记录和就诊记录。实验结果证实了MaLDA相对于其他方法在对电子就医记录进行用药分析的有效性。
To provide support for doctors and patients to use drugs in a safer, more rational and efficient way, this paper proposes a framework for medication analysis based on LDA(Latent Dirichlet Allocation), MaLDA(Medication Analysis based on the LDA). MaLDA combines the usage of medication records and diagnostic records, infers the function of each drug using topic-based inference model LDA, which regards a drug as a document, a function as a topic, and a disease as a word. As a result, related drugs, drug and disease, related diseases are associated by functions. Then clustering all drugs according to its distribution of functions, and each cluster is described by related diseases. Finally, it analyzes the clinical medication based on the results of clustering. The result generated by MaLDA can not only find the drug which is better in treatment, but also find the drug combination which lays the foundation for mining drug side effects and the complications of disease. The method is evaluated by using 137 510 patients’diagnostic records and medication records. The results justify the advantages of MaLDA over baseline methods on medication analysis.