Genomic data modeling

Jake Yue Chen, John V. Carlis

Research output: Contribution to journalArticlepeer-review

23 Scopus citations

Abstract

Researchers face many challenges in representing biological data, including: (1) inherent complexity of biological data, (2) domain knowledge barrier, (3) constantly evolving knowledge, and (4) lack of expert data-modeling skills. We have studied how to represent biological sequences and sequence-related genomics concepts using logical data structure. From our multiple experiences in genomic data modeling, we present results in three areas: genomic schema elements, genomic schema fragments, and genomic data modeling lessons. A genomic schema element is a data model that contains only one basic biological sequence notion. Genomic schema elements provide biology data modelers with baseline thoughts in genomic data modeling. A genomic schema fragment is a data model that contains only one genomic topic area. Genomic schema fragments provide biology data modelers with successful design solutions that they can adapt to fit their own problem's needs. Genomic data modeling lessons address issues particularly important to genomic data modeling such as modeling contextual information, modeling intermediate and derived data, modeling inconsistent data, and modeling categorical rules. Genomic data modeling lessons provide novice biology data modelers with enriched principles from content-neutral data modeling techniques. In all, we have demonstrated how to manage evolving genomic knowledge concepts and discovery results using data modeling techniques extended into the genomics domain.

Original languageEnglish (US)
Pages (from-to)287-310
Number of pages24
JournalInformation Systems
Volume28
Issue number4
DOIs
StatePublished - Jun 2003
Externally publishedYes

Bibliographical note

Copyright:
Copyright 2004 Elsevier Science B.V., Amsterdam. All rights reserved.

Keywords

  • Bioinformatics
  • Data modeling
  • Genomics
  • Knowledge representation

Fingerprint

Dive into the research topics of 'Genomic data modeling'. Together they form a unique fingerprint.

Cite this