研究与开发差异化需求下的非关系型分布式报送信息大数据分类方法韩璐1,陈威宇1,张斐2,何建锋1,苏怀振3(1.国网甘肃省电力公司,甘肃兰州730030;2.国网思极飞天(兰州)云数科技有限公司,甘肃兰州730020;3.国网甘肃省电力公司定西供电公司,甘肃定西743000)摘要:针对多源异构、分布广泛报送信息差异化应用需求较多、无法区分可用性信息的问题,研究了差异化需求下的非关系型分布式报送信息大数据分类方法。首先,分析了非关系型分布式报送信息数据库的可用性、开放性和拓展性等特征,结合字段类型的基本要求,采用非结构化数据库存储文本检索信息处理(TRIP)存储非关系型分布式报送信息;然后,分析了汉明散列家族内散列过程,在线性级要求约束下,利用多吸引子优化元胞自动机,通过遗传算法改进多吸引子元胞自动机分类器的最优参数,进而改进大数据分类方法。实验结果表明,该方法能够有效识别并分类非关系型分布式报送信息中的结构化数据与非结构化数据,具有较高的分类精度。关键词:差异化需求;非关系型;分布式;报送信息;大数据分类;元胞自动机中图分类号:TP311文献标志码:Adoi:10.11959/j.issn.1000−0801.2023122BigdataclassificationmethodofnonrelationaldistributedsubmissioninformationunderdifferentiatedrequirementsHANLu1,CHENWeiyu1,ZHANGFei2,HEJianfeng1,SUHuaizhen31.StateGridGansuElectricPowerCompany,Lanzhou730030,China2.StateGridLanzhouSijiFeitianCloudDateScienceTechnologyCo.,Ltd.,Lanzhou730020,China3.StateGridGansuElectricPowerCompanyDingxiPowerSupplyCompany,Dingxi743000,ChinaAbstract:Theclassificationmethodofnon-relationaldistributedsubmittedinformationbigdataunderthedifferen-tiateddemandwasstudied,aimingattheproblemofmulti-sourceheterogeneous,widelydistributedsubmittedinfor-mationwithmoredifferentiatedapplicationrequirementsandinabilitytodistinguishtheavailableinformation.Firstly,theusability,opennessandexpansibilityofthenon-relationaldistributedsubmissioninformationdatabasewereana-lyzed.TheunstructureddatabasestorageTRIPwasusedtostorenon-relationaldistributedsubmissioninformationbycombiningthebasicrequirementsoffieldtypes.Then,thehashingprocesswithintheHamminghashfamilywasanalyzed.Undertheconstraintoflinearitylevelrequir...