105基于两种文摘数据的作者相似性探测席崇俊1丁楷1刘文斌1张洁2(1.中国科学技术信息研究所北京100038;2.内蒙古农业大学马克思主义学院呼和浩特010000)摘要[目的/意义]作者相似性探测一直是图书情报领域的热点研究问题之一,现有基于作者合著关系、作者关键词耦合、作者文献耦合等分析方法多假设关键词、标题、引文数据之间相互独立,难以真实准确地反映作者研究内容的相似性。[方法/过程]构建作者的关键词-标题和引文-标题2模矩阵,分别以标题向量表征关键词和引文,再以各关键词和引文的夹角余弦平均值表征作者相似性,并对关键词和引文加权从非对称视角下考察作者的相似性。[结果/结论]实验结果表明,基于加权的关键词-标题和引文-标题数据可以从非对称视角下较为准确地分析作者的相似性。关键词关键词引文2模矩阵余弦相似度作者相似性非对称视角分类号G350引用本文格式席崇俊,丁楷,刘文斌,等.基于两种文摘数据的作者相似性探测[J].图书情报研究,2023,16(2):105-112.AuthorSimilarityDetectionBasedonTwoAbstractDataXiChongjun1,DingKai1,LiuWenbin1,ZhangJie21.ChinaInstituteofScienceandTechnologyInformation,Beijing100038,China2.SchoolofMarxism,InnerMongoliaAgriculturalUniversityHuhhot010000,ChinaAbstract[Purpose/significance]Researchonauthorsimilarityhasalwaysbeenoneofthehotissuesinthefieldoflibraryandinformationscience.Theexistingmethodsbasedonauthorco-authoringrelationship,authorkeywordcoupling,authordocumentcoupling,etc.assumethatkeywords,titles,andcitationdataareindependentofeachother,whichisdifficulttotrulyandaccuratelyreflectthesimilarityoftheauthor’sresearchcontent.[Method/process]Thispaperintendstoconstructtheauthor’skeywordtitleandcitation-title2-modulematrix,respectively,usingthetitlevectortorepresentthekeywordsandcitations,andthenusingthecosinesimilaritymeanofeachkeywordandcitationtorepresenttheauthor’ssimilarity,andtoinvestigatethechangeofauthor’ssimilarityundertheasymmetricperspectivebeforeandaftertheweightingofkeywordsandcitations.[Result/conclusion]Theexperimentshowsthattheweightedkeywordtitleandcitationtitledatacanaccuratelyanalyzethesimilaritybetweenauthorsfromanasymmetricperspective.Keywordskeyword;citation;2-modul...