amazon

Monday, August 17, 2015

HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems

HBA: Distributed Metadata Management for
Large Cluster-Based Storage Systems








Abstract
An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing metadata management within a group of metadata servers, here the technique used called HIERARCHICAL BLOOM FILTER ARRAYS (HBA) to map filenames to the metadata servers holding their metadata. The Bloom filter arrays with different levels of accuracies are used on each metadata server. The first one with low accuracy and used to capture the destination metadata server information of frequently accessed files. The other array is used to maintain the destination metadata information of all files. Simulation results show our HBA design to be highly effective and efficient in improving the performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or superclusters) and with the amount of data in the petabyte scale or higher. HBA is reducing metadata operation by using the single metadata architecture instead of 16 metadata server.





EXISTING SYSTEM
                       File mapping or file lookup is critical in decentralizing metadata management within a group of metadata servers. Following approaches are used in the Existing system.
      1. Table-Based Mapping: It fails to balance the load.
     2. Hashing-Based Mapping: It has slow directory operations, such                                  
         as listing the  directory  contents  And renaming directories .                                                                                  
   3. Static Tree Partitioning    : Cannot balance the load and has a medium 
        lookup  time.
4. Dynamic Tree Partitioning: Small memory overhead, incurs a large
           Migration overhead.

2.3 PROPOSED SYSTEM
                            Here we are using the new approaches called HIERARCHICAL BLOOM FILTER ARRAYS (HBA), efficiently route metadata request within a group of metadata servers. There are two arrays used here. First array is used to reduce memory overhead, because it captures only the destination metadata server information of frequently accessed files to keep high management efficiency.   And the second one is used to maintain the destination metadata information of all files.  Both the arrays are mainly used for fast local lookup.










3. SYSTEM SPECIFICATION
3.1 Hardware CONFIGURATION

         SYSTEM             : Pentium IV 2.4 GHz
         HARD DISK         : 40 GB
         FLOPPY DRIVE         : 1.44 MB
         MONITOR                   : 15 VGA colour
         MOUSE               : Logitech.
         RAM                    : 256 MB
         KEYBOARD      : 110 keys enhanced.

3.2 Software CONFIGURATION

         Operating system   :-  Windows XP 3
         Front End                       :-  Microsoft Visual Studio .Net 2010
         Coding Language           :-  C#  2.0
         Database                :- SQL SERVER 2008



No comments:

Post a Comment