东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

分布式文件系统元数据服务的负载均衡框架

ISSN号：1000-9825
期刊名称：《软件学报》
时间：0
分类：TP316[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]中国科学院软件研究所,北京100190, [2]中国科学院大学,北京100190
相关基金：国家自然科学基金（61170074,61202065,U1435220）; 国家高技术研究发展计划（863）（2013AA041301）; 国家科技支撑计划（2015BAH18F02）

关键词：元数据服务器, 分布式文件系统, 负载均衡, 预取, 缓存, metadata server, distributed file system, load balancing, caching, prefetching

中文摘要：

请求负载均衡,是分布式文件系统元数据管理需要面对的核心问题.以最大化元数据服务器集群吞吐量为目标,在已有元数据管理层之上设计实现了一种分布式缓存框架,专门管理热点元数据,均衡不断变化的负载.与已有的元数据负载均衡架构相比,这种两层的负载均衡架构灵活度更高,对负载的感知能力更强,并且避免了热点元数据重新分布、迁移引起的元数据命名空间结构被破坏的情况.经观察分析,元数据尺寸小、数量大,预取错误元数据带来的代价远远小于预取错误数据带来的代价.针对元数据的以上鲜明特点,提出一种元数据预取策略和基于预取机制的元数据缓存替换算法,加强了上述分布式缓存层的性能,这种两层的元数据负载均衡框架同时考虑了缓存一致性的问题.最后,在一个真实的分布式文件系统中验证了框架及方法的有效性.

英文摘要：

Request load balancing is the core issue in distributed file system metadata management. To maximize the throughput of the metadata service, an adaptive request load balancing framework is critical. This paper presents a distributed cache framework above the distributed metadata management schemes to manage hotspots rather than managing all metadata to achieve request load balancing. Compared with the existing distributed metadata load balancing framework, it has a higher degree of flexibility of the two-tier load balancing structure, and is stronger on the perception of the overall load. It also avoids hot spots redistribution and namespace structure destruction caused by metadata migration. Compared with data, metadata has its own distinct characteristics, such as small size and large quantity. The cost of non-use metadata prefetching is much less than data prefetching. Based on this study, a time period-based prefetching strategy and a perfecting-based adaptive replacement cache algorithm are devised to improve the performance of the distributed caching layer to adapt constantly changing workloads. Finally, the presented approach is evaluated with a Hadoop distributed file system cluster.

同期刊论文项目

面向复杂情报的大数据分析方法与决策支持

期刊论文 6

面向智能化个人软件生产环境的Mashup方法及技术研究

期刊论文 13 会议论文 4

云平台并行数据流程序的中间数据管理优化技术

期刊论文 3 会议论文 7

同项目期刊论文

一种云存储服务客户端增量同步算法

基于组件的大数据分析服务平台

FlowS:一种MapReduce数据流公平调度方法

一种基于过滤器的遗留系统安全单点登录方案

基于SHH框架的Java代码自动生成

基于交互逻辑复用的页面集成框架设计

一种面向移动应用的探索式服务组合方法

一种高效的基于服务功能规约的服务选择方法

基于操作日志的云存储服务多终端同步算法

面向最终用户的可定制数据统计服务

基于SSH框架的java代码自动生成

基于密度偏倚抽样的局部距离异常检测方法

卫星影像大数据情报分析与应用

基于Spark的流程化机器学习分析方法

高精度位置跟踪自适应增益调度滑模控制算法

期刊信息

《软件学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国科学院软件研究所中国计算机学会
主编：赵琛
地址：北京8718信箱中国科学院软件研究所
邮编：100190
邮箱：jos@iscas.ac.cn
电话：010-62562563

国际标准刊号：ISSN：1000-9825
国内统一刊号：ISSN：11-2560/TP
邮发代号:82-367

获奖情况:
2001年入选中国期刊方阵“双百期刊”,2000年荣获中国科学院优秀科技期刊一等奖

国内外数据库收录:
俄罗斯文摘杂志,美国数学评论（网络版）,波兰哥白尼索引,德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:54609