RESEARCH INTEREST
Dr. Shyu has concentrated his informatics research efforts in three major areas – visual phenotypes to genotypes correlation, large-scale genomic data mining and retrievals, and computational structure biology. His research group has developed several unique bioinformatics tools for protein tertiary structure retrievals (ProteinDBS), repetitive/highly conserved sequence retrieval system (ACMES), and plant visual phenotypes/genotypes retrieval system (VPhenoDBS).
RESEARCH
Real-Time Protein Tertiary Structure (3D) Retrievals and Classifications
Protein fold is known to be an important clue of detecting possible biological functions. The study of the structure-to-function relationships usually relies on an effective protein structure retrieval and classification method. The task of protein structure retrieval compares a query structure and each known proteins from a database and returns the ones with high similarities. The classification of protein structures categorizes and annotates a newly-discovered protein to possible folds, which could be relevant to the functional properties. With efforts of Structural Genomics (SG) projects, a large amount of protein structures has been identified in recent years via the high-throughput structural determination techniques such as X-ray crystallography and nuclear magnetic resonance (NMR). In the future, more new structures could be solved. To meet the needs of retrieving and classifying these high-throughput protein data, the research activities of this project are designed to face four central challenges.
1) To compare globally similar 3D tertiary structures using content-based information retrieval (CBIR) and high-dimensional indexing techniques in real time.
2) To efficiently classify newly-discovered proteins into the fold hiereachy of the Structural Classification of Protein (SCOP) database based on the structural similarity.
3) To fast retrieve locally similar protein substructures with the non-contiguous structural core identifications in a large-scale protein database.
4) To fuse the retrieval and classification results from different structure cores and provide suggestions to assist the functional predictions.
The proposed system will be the first in the research community that allows a life science researcher or an educator to submit an unknown protein tertiary structure and ask, "What proteins in Protein Data Bank (PDB) have similar non-contiguous structure cores to the query protein?" or “Which fold of SCOP database maintains similar 3D structures to the query protein?”
Visual Phenotype Database
Discoveries in biology often require extensive knowledge of the genetics of an organism, a keen eye for phenotypes, a deep understanding of related species, and efficient strategies for collecting, combining, analyzing, and comparing data. Currently, public database tools that retrieve phenotypic and genomic information allow only relatively simplistic queries, and viable software tools to capture, parse, and return information from digital images are lacking. We hope to enable biologists to simultaneously query phenotype data by image example, sequence, ontology, genetic and physical map information, and text annotations by developing the first web-based visual phenotypic information management system to allow such complex queries.
The database framework will consist of five modules:
(1) A system to extract and quantify low-level features from phenotypic images
(2) A high-dimensional database indexing system to manage and cluster images for real-time retrievals
(3) A linking hub to correlate visual features already attributed to a given locus with relevant genetic and physical maps
(4) A text mining and ontology utilization system for parsing annotations
(5) A results visualization system.
SELECTED PUBLICATIONS
Pin-Hao Chi, Bin Pang, Dmitry Korkin, and Chi-Ren Shyu. Efficient SCOP fold classification and retrieval using index-based protein substructure alignments (IPSA), in Bioinformatics 2009; (to appear)
Adrian Barb, and Chi-Ren Shyu. Visual semantic modeling in content-based geospatial information retrieval using associative mining techniques, in Geoscience and Remote Sensing Letters 2009; (to appear)
Y. Liu, Lawrence W C. Chan, and Chi-Ren Shyu. Editorial for the special issue of knowledge discovery and management in biomedical information systems, in Information Systems Frontiers 2009
Wu He, Sanda Erdelez, Feng-Kwei Wang, and Chi-Ren Shyu. The effects of conceptual description and search practice on users’ mental models and information seeking in a case-based reasoning retrieval system, in Information Processing and Management, Vol. 44, No. 1 , January 2008; 294-309
Grant Scott, and Chi-Ren Shyu. Knowledge Driven Multidimensional Indexing Structure for Biomedical Media Database Retrieval, in IEEE Transactions on Information Technology in Biomedicine, Vol. 11, No. 3 , May 2007; 320-331
X. F. Wan, X. Wu, G. Lin, S. Holton, R. A. Desmone, Chi-Ren Shyu, Y. Guan, and M. Emch. Computational identification of reassortments in avian influenza viruses, in Avian Diseases, Vol. 51 2007; 434-439
H. Oh, S. -Y. Yoon, and Chi-Ren Shyu. How Does Virtual Reality Reshape Furniture Retailing, in Clothing and Textile Research Journal, to appear 2007
Chi-Ren Shyu, Matt Klaric, Grant Scott, Adrian Barb, Curt Davis, and Kannappan Palaniappan. GeoIRIS: Geospatial Information Retrieval and Indexing System - Content Mining, Semantics Modeling, and Complex Queries, in IEEE Transactions on Geoscience and Remote Sensing, Special Issue on Image Mining, Vol. 45, No. 4 , April 2007; 839-852
Chi-Ren Shyu, Jason Green, D. P. K. Lun, Toni Kazic, M. Schaeffer, and Ed Coe. Image Analysis for Mapping Immeasurable Phenotypes in Maize, in IEEE Signal Processing Magazine, Vol. 24, No. 3 , May 2007; 116-119
Wannapa Kay Mahamaneerat, Chi-Ren Shyu, Shih-Chun Ho, and C. Alec Chang. Domain-Concept Association Rules Mining for Large Scale and Complex Cellular Manufacturing Tasks, in Journal of Manufacturing Technology Management, Vol. 18, No. 7 , November 2007
Chi-Ren Shyu, Jaturon Harnsomburana, Jason Green, Adrian Barb, Toni Kazic, M. Schaeffer, and Ed Coe. Searching and mining visually-observed phenotypes of maize mutants, in Journal of Bioinformatics and Computational Biology, Special Issue on Making Sense of Mutations Requires Knowledge Management, Vol. 5, No. 6 , December 2007; 1193-1213
