³Åã¬Â, ²¦·~©ó¬F¤j¸ê°T¬ì¾Ç¨t, ²{¬°¶§©ú¤j¾Ç¥Í¤Æ©Ò¥Íª«¸ê°T¾Çµ{¾Ç¥Í.

¹ê²ß³æ¦ì: IBM Almaden Research Center, CA, USA

        http://www.almaden.ibm.com/

Abstract: 

For the needs to integrate multiple information sources for more insights or other applications, IBM has developed a semi-automatic schema mapping tool, called Clio, to help users. Based on my background and the prevalence of text files in Life Science, I hope to make efforts to benefit biological lab in using Clio to merge diverse data sources by the helps from my manager in IBM, Howard Ho, and my advisor, U-C Yang. Since Clio only accepts XML or relational database as sources, I design a procedure to deal with well-structured flat files before loading into Clio (flat-> XML schema->XML files) and in the end produce SQL script through Clio mapping file to manipulate data into physical databases. It not only increases the flexibility of Clio¡¦s source types but indeed extends the usage of biological data.

 

What I¡¦ve learned:

u        Usage of Clio and DB2. XML-related techniques.

u        A more open mind and wider vision ¡V During my summer intern, I met lots of people from different countries with different backgrounds and it gave me so many opportunities to learn other¡¦s advantages, respect unfamiliar but rich cultures and share people¡¦s valuable experiences. Staying in Taiwan so long often makes me forget how large the world is and I thank I can come to IBM.

u        Working harder and harder ¡V Facing the competitions from other countries is hardly to avoid and I know I have to work more diligently to keep up with those outstanding students in the world. The amounts of India and Mainland China students are alarming and they take good use of time for working and studying. Such strong power is raising and couldn¡¦t be ignored.