Conquest-2

Improving Energy Efficiency and Performance
 Through a Disk/Hybrid File System






NSF Grant: CNS-0410908 (Jul 2004 to Jun 2007)
UCLA PI:  Peter Reiher (Laboratory for Advanced Systems Research)
FSU PI:  An-I (Andy) Wang (FSU Computer Science Department)





Conquest-2 is a three-year NSF-funded collaborative research grant with Florida State University.  In conjunction with Andy Wang of Florida State University, we are developing a file system that can simultaneously provide high file systems performance and energy efficiency.  The key for reconciling these generally conflicting goals is based on the observation that RAM is becoming an inexpensive resource that can deliver all file system services with the single exception of large storage capacity.  In essence, instead of storing most files on disks, Conquest-2 stores them only in RAM.  Also, it stores all metadata in RAM.  Since  most files can then be accessed directly from RAM, there is no need to keep the disk spinning.  Only large files need to be kept on disk, and such files will be accessed infrequently.  While RAM running in its normal mode consumes a great deal of power, RAM can be put into an extremely power-efficient mode and returned to its normal mode much more rapidly than disk can.

Our research suggests that one can achieve better power savings than LRU caching by instead using RAM with battery backup as the primary persistent storage for most files, and providing two separate data paths to memory and disk storage.  By specializing the data paths, the memory data path contains no-disk-related complexity.  The disk data path consists only of optimizations for the specialized disk usage pattern. 

Conquest-2 is based on the original Conquest disk/RAM hybrid file system (NSF grant CCR-0098363).  Under the original Conquest, all metadata (e.g., file attributes) and all files smalled than a certain threshold are stored in RAM, while the disk stores only the data content of large files.  The performance improvement is achieved by matching the user behavior with physical characteristics, and having two specialized data paths.  Conquest-2 will extend the original Conquest to handle multiple disks.  Minimizing the energy use for multiple disks involves creating a bimodal distribution of active and idle disks, so we can put idle disks to sleep to conserve power.  The idea is to design and implement an energy-efficient RAID that can "gear shift" disks on and off to adapt for changing loads.  This energy-efficient RAID also needs to address existing load balancing and failure recovery concerns by exploting user access patterns, applying erasure encoding, and striping data in a non-uniform way.