³Åã¬Â, ²¦·~©ó¬F¤j¸ê°T¬ì¾Ç¨t,
²{¬°¶§©ú¤j¾Ç¥Í¤Æ©Ò¥Íª«¸ê°T¾Çµ{¾Ç¥Í.
¹ê²ß³æ¦ì: IBM Almaden Research Center, CA, USA
Abstract:
For the needs to integrate multiple information
sources for more insights or other applications, IBM has developed a
semi-automatic schema mapping tool, called Clio, to help users. Based on my
background and the prevalence of text files in Life Science, I hope to make
efforts to benefit biological lab in using Clio to merge diverse data sources
by the helps from my manager in IBM, Howard Ho, and my advisor, U-C Yang. Since
Clio only accepts XML or relational database as sources, I design a procedure
to deal with well-structured flat files before loading into Clio (flat-> XML
schema->XML files) and in the end produce SQL script through Clio mapping
file to manipulate data into physical databases. It not only increases the
flexibility of Clio¡¦s source types but indeed extends the usage of biological
data.
What I¡¦ve
learned:
u
Usage of
Clio and DB2. XML-related techniques.
u
A more open
mind and wider vision ¡V During my summer intern, I met lots of people from
different countries with different backgrounds and it gave me so many
opportunities to learn other¡¦s advantages, respect unfamiliar but rich cultures
and share people¡¦s valuable experiences. Staying in Taiwan so long often makes
me forget how large the world is and I thank I can come to IBM.
u
Working
harder and harder ¡V Facing the competitions from other countries is hardly to
avoid and I know I have to work more diligently to keep up with those
outstanding students in the world. The amounts of India and Mainland China
students are alarming and they take good use of time for working and studying.
Such strong power is raising and couldn¡¦t be ignored.