TY - GEN
T1 - Generating GO slim using relational database management systems to support proteomics analysis
AU - Onsongo, Getiria
AU - Xie, Hongwei
AU - Griffin, Timothy J.
AU - Carlis, John
PY - 2008
Y1 - 2008
N2 - The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.
AB - The Gene Ontology Consortium built the Gene Ontology database (GO) to address the need for a common standard in naming genes and gene products. Using different names for the same concepts and different concepts with the same name makes it effectively impossible for humans and computers alike to analyze biological processes across different organisms. The consortium addresses this need by defining terms for categorizing genes and gene products. A convention in GO is that each gene or gene product is annotated to the most specific GO term in the GO database. It is, however, also useful for researchers to be able to group genes or gene products into broad biological categories that give a higher-level view of their function when analyzing results of an experiment. A GO Slim is a subset of the GO ontology that provides such a higher-level view of functions. Existing GO Slim generation tools have two important limitations: programming language dependence, and an inability to dynamically generate a GO Slim while analyzing. We have extended the relational database engine to dynamically generate a GO Slim overcoming this limitations. Using this extention, we have developed a tool (DynamicGOSlim) that dynamically generates a GO Slim and uses the generated GO Slim to categorize genes or gene products. This tool is being used in an ongoing proteomics project aimed at identifying possible oral cancer biomarkers in saliva.
UR - http://www.scopus.com/inward/record.url?scp=51849099551&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51849099551&partnerID=8YFLogxK
U2 - 10.1109/CBMS.2008.77
DO - 10.1109/CBMS.2008.77
M3 - Conference contribution
AN - SCOPUS:51849099551
SN - 9780769531656
T3 - Proceedings - IEEE Symposium on Computer-Based Medical Systems
SP - 215
EP - 217
BT - Proceedings of the 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008
T2 - 21st IEEE International Symposium on Computer-Based Medical Systems, CBMS 2008
Y2 - 17 June 2008 through 19 June 2008
ER -