Korea Univ • College of Information & Communication Seoul Korea 136-705

OFFICE (02) 3290-4840Cell (010) 8573-1779 Fax (02) 953-0771 • kangj@korea.ac.kr

http://infos.korea.ac.kr

Jaewoo Kang

Education

 

University of Wisconsin-Madison                                                  8/98 – 7/03

Ph.D. in Computer Science.  Database Systems

Advisor: Prof. Jeffrey F. Naughton

University of Colorado at Boulder                                                 8/94 – 5/96

M.S. in Computer Science.  Artificial Intelligence, Machine Learning.

Advisor: Prof. Andreas Weigend

Korea University, Seoul South Korea                                            3/87 – 2/94

B.S. in Computer Science. (On leave: 3/89-2/92 Military Service)

Research Interests

 

My research interests in a broad sense focus on understanding the fundamental aspects of building a large-scale information system that can answer complex queries over a large number (billions) of heterogeneous data sources. I focus on tackling the challenge particularly in data integration, Web data management and mining, Internet-scale distributed systems, large-scale data analytics and bio-medical informatics.  

Experience

 

Professor,  Korea University                                       3/14present

Seoul, Korea.

§  Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems

Associate Professor,  Korea University                                       3/092/14

Seoul, Korea.

§  Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems

Assistant Professor,  Korea University                                        9/062/09

Seoul, Korea.

§  Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems

§  Jointly appointed with Computer Science and Medical School

Assistant Professor,  North Carolina State University                  8/03 – 8/06

Raleigh, NC.

§  Database Systems, Data Integration, Data Mining, Large-scale Internet Information Systems, Scientific Data Management, Grid Computing/Data Grid, Biomedical Databases and Informatics

§  Taught under and graduate database courses for three years with good student evaluation consistently exceeding departmental average.

§  Won Microsoft Bioinformatics Program Award for project titled, “Enabling Large-scale Comparative Analysis across Disparate Science Data”

§  Participated as a coPI in the NOAA ISET Center project (12.5M USD)

Chief Technology Officer/Founder,  WISEngine Inc.                 2/00 – 9/01

Seoul South Korea/Santa Clara CA.

§  40+ employees (26 R&D).  Built and led R&D team, assembled advisory board, presented to 20+ top class VCs, raised first round funding of 1.7 million USD.

§  Developed IDB internet database integration engine.

Technical Staff,   Savera Systems                                                11/97 – 7/98

Summit, NJ.

§  Conducted research and development of semi-real time on-line telecommunication billing system with materialized view optimization.

Consultant/Senior Technical Staff,  AT&T Labs Research      5/96 – 11/97

Murray Hill, NJ/Florham Park, NJ. (Formerly Bell Labs.)

§  Semi-structured Data Management: Co-invented an innovative Web-site management system – STRUDEL. US Patent filed and approved. Successfully demonstrated in SIGMOD 1997.

§  Participated in the Information Manifold Web-source integration system project. Worked with the world-class researchers in AT&T (Bell) Labs.

Awards

 

Microsoft Graduate Fellowship,  2000-2001

Best Paper Award, The 33rd KIPS Spring Conference, April 23-24, 2010

Patents

 

Jaewoo Kang and Hanjun Shin: Apparatus for Processing of EEG Inputted through Single Channel and Processing Method Using the Same. Korea Patent No. 10-1007965 January 6, 2011

Jaewoo Kang, Hanjun Shin, Yoonkyu Kang and Kihoon Kim: Electrodiagnosis Support Apparatus and Method for Diagnosing Neural Injury Using the Same. Korea Patent No. 10-1007964 January 6, 2011.

Mary Fernandez, Daniela Florescu, Jaewoo Kang, Alon Levy, and Dan Suciu: Method and Apparatus for Web Site Management. U.S. Patent No. 5,956,720 September 21, 1999.

Professional Activities

 

Editorial Board Member:

Journal of Computing Science and Engineering, 2008 – present

Journal of Information Processing Systems, 2009 – present

Korea Information Processing Society Review, 2010 - present

Workshop Co-Chair:

ACM SIGMOD PhD Workshop on Innovative Database Research (IDAR), Providence, Rhode Island, USA, June 28, 2009

Workshop PC Co-Chair:

International Workshop on Bio-inspired computing for Hybrid Information Technology (BHIT), Gwangju, Korea, December 9-11, 2010

Conference PC Track Chair:

International Conference on Internet, Mactan Island Resort, Philippines, December 16-20, 2010

Program Committee:

International Database Engineering & Applications Symposiums (IDEAS), Portugal, 2011

ACM SIGMOD International Conference on Management of Data (SIGMOD), Athens, Greece, June 12-16, 2011

International Conference on Database Systems for Advanced Applications (DASFAA), Hong Kong, China, April 22-25, 2011

 

IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Hong Kong, China, December 19-22, 2010

IEEE BIBM Workshop on Data Mining for Biomarker Discovery (DMBD), Hong Kong, China, December 18–21, 2010

International Computer Symposium (ICS), Workshop on Biomedical Informatics, Tainan, Taiwan, December 16-18, 2010

International Workshop on Ubiquitous Computing & Applications (IWUCA), Sanya, China, December 16-18, 2010

International Symposium on Semantic Mining in Biomedicine (SMBM), Hinxton, Cambridgeshire, UK, October 25-26, 2010

International Conference on Database Systems for Advanced Applications (DASFAA), Tsukuba, Japan, April 1-4, 2010

IEEE International Conference on Data Engineering (ICDE), Long Beach, California, USA, March 1-6, 2010

 

International Conference on Information Integration and Web-based Applications & Services (iiWAS), Kuala Lumpur, Malaysia, December 14-16, 2009

International Symposium in Languages in Biology and Medicine (LBM), Seogwipo-si, Jeju Island, Korea, November 8-10, 2009

ACM Conference on Information and Knowledge Management (CIKM), Hong Kong, China, November 2-6, 2009

International Conference on Frontiers of Information Technology, Applications and Tools (FITAT), Cheongju, Chungbuk, Korea, Oct 22-23, 2009

ISIBM International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing (IJCBS), Shanghai, China, August 3-5, 2009

IEEE International Conference on Data Engineering (ICDE), Shanghai, China, March 29-April 4, 2009

 

IEEE International Conference on Data Engineering (ICDE), Cancun, Mexico, April 7-12, 2008

IEEE ICDE Workshop on Mining Multimedia Streams in Large-scale Distributed Environments (MMSDE), Cancun, Mexico, April 7, 2008

International Workshop on Intelligent Informatics in Biology and Medicine (IIBM), Barcelona, Spain, March 4–7, 2008

International Workshop on Scalable Stream Processing Systems (SSPS), Nantes, France, March 29, 2008

 

AAAI Conference on Artificial Intelligence Nectar Paper Track (AAAI Nectar), Vancouver, British Columbia, Canada, July 22-26, 2007

ACM/IEEE Joint Conference on Digital Libraries (JCDL), Vancouver, British Columbia, Canada,   June 18 - 23, 2007

IEEE ICDE Workshop on Scalable Stream Processing Systems (SSPS), Istanbul, Turkey, April 17-20, 2007

 

VLDB Workshop on Clean Databases (CleanDB), Seoul, Korea, September 11, 2006

ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD), Philadelphia, PA, USA, August 20-23, 2006

AAAI Conference on Artificial Intelligence Nectar Paper Track (AAAI Nectar), Boston, MA, USA,  July 16–20, 2006

ACM/IEEE Joint Conference on Digital Libraries (JCDL), Chapel Hill, NC, USA, June 11-15, 2006

 

ACM International Workshop on Web Information and Data Management (WIDM), Bremen, Germany, November 5, 2005

ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD), Chicago, IL, USA, August 21, 2005

 

ACM International Workshop on Web Information and Data Management (WIDM), Washington, DC, USA, November 12-13, 2004

 

Selected Publications   (Available online at http://infos.korea.ac.kr)

 

International Journals:

Pankaj Chopra begin_of_the_skype_highlightingend_of_the_skype_highlighting , Jinseung Lee, Jaewoo Kang begin_of_the_skype_highlightingend_of_the_skype_highlighting , Sunwon Lee: Improving Cancer Classification Accuracy Using Gene Pairs PLoS ONE, 5(12), Dec 2010

Yoojin Hong begin_of_the_skype_highlightingend_of_the_skype_highlighting, Jaewoo Kang, Dongwon Lee begin_of_the_skype_highlightingend_of_the_skype_highlighting, Damian B. van Rossum: Adaptive GDDA-BLAST: Fast and Efficient Algorithm for Protein Sequence Embedding. PLoS ONE, 5(10), Oct 2010

Hanjun Shin, Ki Hoon Kim, Chihwan Song, Injoon Lee, Kyubum Lee, Jaewoo Kang, Yoon Kyoo Kang: Electrodiagnosis support system for localizing neural injury in an upper limb. Journal of the American Medical Informatics Association (JAMIA), Vol. 17, Issue 3, Pages 345-347, May 2010

Gayathri Tambaram Kailasam, and Jin-Seung Lee, Jae-Won Rhee, Jaewoo Kang: Efficient skycube computation using point and domain-based filtering. Information Sciences, Vol. 180, Issue 7, Pages 1090-1103, Apr 2010

HyungJun Cho, Jaewoo Kang, Jae K. Lee: Empirical Bayes analysis of unreplicated microarray data. Computational Statistics, Vol. 24, No. 3, Pages 393-408, Aug 2009

Sungbo Seo, Jaewoo Kang, Keun Ho Ryu: Multivariable stream data classification using motifs and their temporal relations. Information Sciences, Vol. 179, Issue 20, Pages 3489-3504, Aug 2009

Kuan-ming Lin, Jaewoo Kang, Hanjun Shin, Jusang Lee: A Cube Framework for Incorporating Inter-gene Information into Biological Data Mining. Int. J. Data Mining and Bioinformatics, 3(1):3-22, 2009

HyungJun Cho, Ami Yu, Sukwoo Kim, Jaewoo Kang, Seung-Mo Hong: Robust Likelihood-Based Survival Modeling with Microarray Data. J. Statistical Software, 29(1):1548-7660, January 2009

Jaewoo Kang, Jeffrey F. Naughton: Schema Matching Using Interattribute Dependencies. IEEE Transactions on Knowledge and Data Engineering, 20(10):1393-1407, October 2008

Sunshin Kim, Jaewoo Kang, Yong Je Chung, Jinyan Li, Keun Ho Ryu: Clustering orthologous proteins across phylogenetically distant species. Proteins: Structure, Function, and Bioinformatics, 71(3), May 2008

Pankaj Chopra, Jaewoo Kang, Jiong Yang, HyungJun Cho, Heenam S Kim, Min-Goo Lee: Microarray data mining using landmark gene-guided clustering. BMC Bioinformatics, 9(92), February 2008

Dongwon Lee, Jaewoo Kang, Prasenjit Mitra, C. Lee Giles, Byung-Won On: Are Your Citations Clean?: New Scenarios and Challenges in Maintaining Digital Libraries. Communications of the ACM, 2007

Bin Song, Jeong-Hyeon Choi, Guangyu Chen, Jacek Szymanski, Guo-Qiang Zhang, Anthony K. H. Tung, Jaewoo Kang, Sun Kim, and Jiong Yang: ARCS: An Aggregated Related Column Scoring Scheme for Aligned Sequences. Bioinformatics, 22(19):2326-2332, October 2006.

Jeffrey F. Naughton, David J. DeWitt, David Maier, Ashraf Aboulnaga, Jianjun Chen, Leonidas Galanis, Jaewoo Kang, Rajasekar Krishnamurthy, Qiong Luo, Naveen Prakash, Ravishankar Ramamurthy, Jayavel Shanmugasundaram, Feng Tian, Kristin Tufte, Stratis Viglas: The Niagara Internet Query System. IEEE Data Engineering Bulletin, Vol. 24, No. 2, Pages 27-33, Jun 2001

Mary F. Fernández, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: Overview of Strudel - A Web-Site Management System. Networking and Information Systems Journal, Vol. 1, Pages 115-140, 1998

International Conferences:

Quinsong Jin, Jaewoo Kang, Injoon Lee: Metadata-driven Subspace Collaborative Filtering. The 1st International Conference on Internet (ICONI), Dec 2009

Hanjun Shin, Himchan Kim,Sangjun Lee, Jaewoo Kang: Online Removal of Ocular Artifacts from Single Channel EEG for Ubiquitous Healthcare Applications. The 4th International Conference on Ubiquitous Information Technologies & Applications (ICUT), Dec 20-22, 2009

Pankaj Chopra, Jaewoo Kang, Seung-Mo Hong: Meta-analysis of cancer microarray data reveals signaling pathway hotspots. International Workshop on Data Mining for Biomarker Discovery (DMBD), Washington D.C., USA, Nov 1-4, 2009

Pankaj Chopra, Jaewoo Kang, Jinseung Lee: Using Gene Pair Combinations to Improve the Accuracy of the PAM Classifier. Proceedings of the IEEE International conference on Bioinformatics and Biomedicine (BIBM), Washington D.C., USA, Nov 2009

Pankaj Chopra begin_of_the_skype_highlightingend_of_the_skype_highlighting, Han Jun Shin, Jaewoo Kang: Global gene map for cancer reveals pathway hotspots. Proceedings of the IEEE International conference on Bioinformatics and Biomedicine (BIBM), Philadelphia, USA. Nov 3-5, 2008

Yoojin Hong, Tao Yang, Jaewoo Kang, Dongwon Lee: Record Linkage as DNA Sequence Alignment Problem. Proceedings of the 6th International Workshop on Quality in Databases and Management of Uncertain Data(QDB), Auckland, New Zealand, August 25, 2008

Qiankun Zhao, Prasenjit Mitra, Dongwon Lee, Jaewoo Kang: HICCUP: Hierarchical Clustering Based Value Imputation using Heterogeneous Gene Expression Microarray Datasets. Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering (BIBE), Harvard Medical School, Boston, MA, USA, October 14-17, 2007: 71-78

Tae Sik Han, Seung-Kyu Ko, Jaewoo Kang: Efficient Subsequence Matching Using the Longest Common Subsequence with a Dual Match Index.

Amit Awekar, Jaewoo Kang: Selective approach to handling topic oriented tasks on the world wide web. Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining (IEEE CIDM), Honolulu, Hawaii, April 2007.

Kuan-ming Lin, Jaewoo Kang: Exploiting Inter-gene Information for Microarray Data Integration. Proceedings of the 22nd Annual ACM Symposium on Applied Computing (ACM SAC), Seoul, Korea, March 2007.

ByungWon On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei: Improving Grouped-Entity Resolution using Quasi-Cliques. To appear in Proceedings of the IEEE International Conference on Data Mining (IEEE ICDM), Hong Kong, China, December 2006.

Sungbo Seo, Jaewoo Kang, Dongwon Lee, Keun Ho Ryu: Multivariate Stream Data Classification Using Simple Text Classifiers. To appear in Proceedings of the 17th International Conference on Database and Expert Systems Applications (DEXA), Krakow, Poland, September 4-8, 2006.

ByungWon On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei: An Effective Approach to Entity Resolution Problem Using QuasiClique and its Application to Digital Libraries (short paper). In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (ACM/IEEE JCDL), Chapel Hill, NC, June 11-15, 2006.

Amit C. Awekar, Pabitra Mitra, Jaewoo Kang: Selective Hypertext Induced Topic Search (poster paper). In Proceedings of the 15th International World Wide Web Conference (WWW), Edinburgh, Scotland, May 23-26, 2006.

Sungbo Seo, Jaewoo Kang, Keun Ho Ryu: Multivariate Stream Data Transmission in Sensor Network Applications. In Proceedings of the International Symposium on Ubiquitous Intelligence and Smart Worlds (UISW), p.198-207, Nagasaki, Japan, December 6-7, 2005, (LNCS 3823, Springer 2005, ISBN 3-540-30803-2)

Jaewoo Kang, Dongwon Lee, and Prasenjit Mitra: Identifying Value Mappings for Data Integration: An Unsupervised Approach. In Proceedings of the International Conference on Web Information Systems Engineering (WISE), p.544-551, New York, NY, November 20-22, 2005, (LNCS 3806, Springer 2005, ISBN 3-540-30017-1)

Jaewoo Kang, Tae Sik Han, Dongwon Lee, Prasenjit Mitra: Establishing Value Mappings Using Statistical Models and User Feedback. In Proceedings of the ACM Conference on Information and Knowledge Management (ACM CIKM), p.68-75, Bremen, Germany, Oct 31- Nov 5, 2005.

Jaewoo Kang, Jiong Yang, Wanhong Xu, Pankaj Chopra: Integrating Heterogeneous Microarray Data Sources using Correlation Signatures. In Proceedings of the International Workshop on Data Integration in the Life Sciences (DILS), p.105-120, San Diego, CA, July 20-22, 2005, (LNCS 3615, Springer 2005, ISBN 3-540-27967-9)

Dongwon Lee, Byung-Won On, Jaewoo Kang, Sanghyun Park: Effective and Scalable Solutions for Mixed and Split Citation Problems in Digital Libraries. In Proceedings of the ACM SIGMOD Workshop on Information Quality in Information Systems (ACM IQIS), p.69-76, Baltimore, MD, June 17, 2005.

Byung-Won On, Dongwon Lee, Jaewoo Kang, Prasenjit Mitra: Comparative Study of Name Disambiguation Problem using a Scalable Blocking-based Framework. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (ACM/IEEE JCDL), p.344-353, Denver, CO, June 7-11, 2005.

Ramrajprabu Balasubramanian, Injong Rhee, Jaewoo Kang: A Scalable Architecture for SIP Infrastructure using Content Addressable Networks. In Proceedings of the IEEE International Conference on Communications (IEEE ICC), p.1314-1318, Seoul, Korea, May 16-20, 2005.      

Jaewoo Kang, Jeffrey F. Naughton: On Schema Matching with Opaque Column Names and Data Values. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p.205-216, San Diego, California, June 9-12, 2003.

Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas: Evaluating Window Joins over Unbounded Streams. In Proceedings of the IEEE International Conference on Data Engineering (IEEE ICDE), p.341-352, Bangalore, India, March 5-8, 2003.

Mary F. Fernandez, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: Catching the Boat with Strudel: Experiences with a Web-Site Management System. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p.414-425, Seattle, Washington, June 2-4, 1998.

Mary F. Fernandez, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: System Demonstration - STRUDEL: A Web-Site Management System. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p. 549-552, Tucson, Arizona, May 13-15, 1997.

Jaewoo Kang, Mark Choey, Andreas Weigend: Maximizing Risk-Adjusted Return in Financial Time Series. In Computing Science & Statistics (28th Symposium INTERFACE), p.677-681, Sidney, Austrailia, July 1996.