• 校内登录
访问量:   最后更新时间:--

田锋

电子邮箱:
所在单位:计算机科学与技术学院
职务:电子与信息学部副主任
学历:博士研究生毕业
办公地点:
性别:男
联系方式:
学位:博士
职称:教授
主要任职:视觉信息与应用国家工程研究中心常务副主任
其他任职:陕西省大数据知识工程重点实验室
博士生导师:是
硕士生导师:是
学科:计算机科学与技术
论文成果
当前位置: 中文主页 > 科学研究 > 论文成果
An optimized approach for storing and accessing small files on cloud storage
发布时间:2025-04-30    点击次数:

发布时间:2025-04-30

论文名称:An optimized approach for storing and accessing small files on cloud storage

发表刊物:Journal of Network and Computer Applications

摘要:This paper firstly analyzes and points out the reasons of smallfile problem of HDFS: (1) large numbers of smallfiles impose heavy burden on NameNode of HDFS; (2) correlations between smallfiles are not considered for data placement; and (3) no optimization mechanism, such as prefetching, is provided to improve I/O performance. Secondly, in the context of HDFS, the clear cut-off point between large and smallfiles is determined through experimentation, which helps determine ‘how small is small’. Thirdly, according to file correlation features, files are classified into three types: structurally-related files, logically-related files, and independent files. Finally, based on the above three steps, an optimizedapproach is designed to improve the storage and access efficiencies of smallfiles on HDFS. File merging and prefetching scheme is applied for structurally-related smallfiles, while file grouping and prefetching scheme is used for managing logically-related smallfiles. Experimental results demonstrate that the proposed schemes effectively improve the storage and access efficiencies of smallfiles, compared with native HDFS and a Hadoop file archiving facility.(http://www.sciencedirect.com/science/article/pii/S1084804512001610)

合写作者:董博,郑庆华,田锋(通信作者)等

卷号:**

页面范围:**

是否译文:

发表时间:2012-08-03