Toward efficient search for ultrascale storage systems

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

As the rate at which scientific computing generates data continues to increase, we are finding that managing this data, in all facets, is quickly becoming more challenging. In many facilities with large scale storage needs, this massive amount of data is stored in distributed, multi-tiered storage systems. It has become imperative to allow for efficient and effective search within these kinds of environments. For some search problems, specifically file system metadata, traditional relational databases have been used with, initially, good results. As the scale of supercomputing has grown though, we find that it is becoming increasing difficult for databases to scale up with the volume of metadata that they are managing. In this paper, we propose a new direction for database management techniques within the context of high performance computing, specifically, search within ultrascale storage systems. Instead of using databases as a layer sitting above the storage system, we suggest the movement of database components within the storage system itself. By taking this approach, we aim to leverage the decades of research and tuning that have made relational database technology successful. At the same time, this integration gives us the ability to maintain a better view of the storage system for search optimization. Through this effort, we can position these techniques to better scale to the degree that is required by the high performance computing community currently, and in the future.

Original languageEnglish (US)
Title of host publicationHPCDB'11 - Proceedings of the 2011 Workshop on High-Performance Computing Meets Databases, Co-located with SC'11
Pages1-4
Number of pages4
DOIs
StatePublished - 2011
Event1st Annual 2011 Workshop on High-Performance Computing Meets Databases, HPCDB'11, Co-located with Supercomputing, SC'11 - Seattle, WA, United States
Duration: Nov 13 2011Nov 13 2011

Publication series

NameHPCDB'11 - Proceedings of the 2011 Workshop on High-Performance Computing Meets Databases, Co-located with SC'11

Other

Other1st Annual 2011 Workshop on High-Performance Computing Meets Databases, HPCDB'11, Co-located with Supercomputing, SC'11
Country/TerritoryUnited States
CitySeattle, WA
Period11/13/1111/13/11

Keywords

  • Databases
  • Exascale
  • File systems
  • Indexing
  • Search

Fingerprint

Dive into the research topics of 'Toward efficient search for ultrascale storage systems'. Together they form a unique fingerprint.

Cite this