Korea Univ • College of Information & Communication • Seoul • Korea 136-705 |
OFFICE (02) 3290-4840 • Cell (010) 8573-1779 • Fax (02) 953-0771 • kangj@korea.ac.kr http://infos.korea.ac.kr |
Jaewoo Kang
Education |
|
|
University of Wisconsin-Madison 8/98 – 7/03 Ph.D. in Computer Science. Database Systems Advisor: Prof. Jeffrey F. Naughton University of Colorado at Boulder 8/94 – 5/96 M.S. in Computer Science. Artificial Intelligence, Machine Learning. Advisor: Prof. Andreas Weigend Korea University, Seoul South Korea 3/87 – 2/94 B.S. in Computer Science. (On leave: 3/89-2/92 Military Service) |
Research Interests |
|
|
My research interests in a broad sense focus on understanding the fundamental aspects of building a large-scale information system that can answer complex queries over a large number (billions) of heterogeneous data sources. I focus on tackling the challenge particularly in data integration, Web data management and mining, Internet-scale distributed systems, large-scale data analytics and bio-medical informatics. |
Experience |
|
|
Professor, Korea University 3/14 – present Seoul, Korea. § Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems Associate Professor, Korea University 3/09 – 2/14 Seoul, Korea. § Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems Assistant Professor, Korea University 9/06 – 2/09 Seoul, Korea. § Database Systems, Data Mining, Bio-medical Informatics, Large-scale Internet Information Systems § Jointly appointed with Computer Science and Medical School Assistant Professor, North Carolina State University 8/03 – 8/06 Raleigh, NC. § Database Systems, Data Integration, Data Mining, Large-scale Internet Information Systems, Scientific Data Management, Grid Computing/Data Grid, Biomedical Databases and Informatics § Taught under and graduate database courses for three years with good student evaluation consistently exceeding departmental average. § Won Microsoft Bioinformatics Program Award for project titled, “Enabling Large-scale Comparative Analysis across Disparate Science Data” § Participated as a coPI in the NOAA ISET Center project (12.5M USD) Chief Technology Officer/Founder, WISEngine Inc. 2/00 – 9/01 Seoul South Korea/Santa Clara CA. § 40+ employees (26 R&D). Built and led R&D team, assembled advisory board, presented to 20+ top class VCs, raised first round funding of 1.7 million USD. § Developed IDB internet database integration engine. Technical Staff, Savera Systems 11/97 – 7/98 Summit, NJ. § Conducted research and development of semi-real time on-line telecommunication billing system with materialized view optimization. Consultant/Senior Technical Staff, AT&T Labs Research 5/96 – 11/97 Murray Hill, NJ/Florham Park, NJ. (Formerly Bell Labs.) § Semi-structured Data Management: Co-invented an innovative Web-site management system – STRUDEL. US Patent filed and approved. Successfully demonstrated in SIGMOD 1997. § Participated in the Information Manifold Web-source integration system project. Worked with the world-class researchers in AT&T (Bell) Labs. |
Awards |
|
|
Microsoft Graduate Fellowship, 2000-2001 Best Paper Award, The 33rd KIPS Spring Conference, April 23-24, 2010 |
Patents |
|
|
Jaewoo Kang and Hanjun Shin: Apparatus for Processing of EEG Inputted through Single Channel and Processing Method Using the Same. Korea Patent No. 10-1007965 January 6, 2011 Jaewoo Kang, Hanjun Shin, Yoonkyu Kang and Kihoon Kim: Electrodiagnosis Support Apparatus and Method for Diagnosing Neural Injury Using the Same. Korea Patent No. 10-1007964 January 6, 2011. Mary Fernandez, Daniela Florescu, Jaewoo Kang, Alon Levy, and Dan Suciu: Method and Apparatus for Web Site Management. U.S. Patent No. 5,956,720 September 21, 1999. |
Professional Activities |
|
|
Editorial Board Member: Journal of Computing Science and Engineering, 2008 – present Journal of Information Processing Systems, 2009 – present Korea Information Processing Society Review, 2010 - present Workshop Co-Chair: ACM SIGMOD PhD Workshop on Innovative Database Research (IDAR), Providence, Rhode Island, USA, June 28, 2009 Workshop PC Co-Chair: International Workshop on Bio-inspired computing for Hybrid Information Technology (BHIT), Gwangju, Korea, December 9-11, 2010 Conference PC Track Chair: International Conference on Internet, Mactan Island Resort, Philippines, December 16-20, 2010 Program Committee: International Database Engineering & Applications Symposiums (IDEAS), Portugal, 2011 ACM SIGMOD International Conference on Management of Data (SIGMOD), Athens, Greece, June 12-16, 2011 International Conference on Database Systems for Advanced Applications (DASFAA), Hong Kong, China, April 22-25, 2011
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Hong Kong, China, December 19-22, 2010 IEEE BIBM Workshop on Data Mining for Biomarker Discovery (DMBD), Hong Kong, China, December 18–21, 2010 International Computer Symposium (ICS), Workshop on Biomedical Informatics, Tainan, Taiwan, December 16-18, 2010 International Workshop on Ubiquitous Computing & Applications (IWUCA), Sanya, China, December 16-18, 2010 International Symposium on Semantic Mining in Biomedicine (SMBM), Hinxton, Cambridgeshire, UK, October 25-26, 2010 International Conference on Database Systems for Advanced Applications (DASFAA), Tsukuba, Japan, April 1-4, 2010 IEEE International Conference on Data Engineering (ICDE), Long Beach, California, USA, March 1-6, 2010
International Conference on Information Integration and Web-based Applications & Services (iiWAS), Kuala Lumpur, Malaysia, December 14-16, 2009 International Symposium in Languages in Biology and Medicine (LBM), Seogwipo-si, Jeju Island, Korea, November 8-10, 2009 ACM Conference on Information and Knowledge Management (CIKM), Hong Kong, China, November 2-6, 2009 International Conference on Frontiers of Information Technology, Applications and Tools (FITAT), Cheongju, Chungbuk, Korea, Oct 22-23, 2009 ISIBM International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing (IJCBS), Shanghai, China, August 3-5, 2009 IEEE International Conference on Data Engineering (ICDE), Shanghai, China, March 29-April 4, 2009
IEEE International Conference on Data Engineering (ICDE), Cancun, Mexico, April 7-12, 2008 IEEE ICDE Workshop on Mining Multimedia Streams in Large-scale Distributed Environments (MMSDE), Cancun, Mexico, April 7, 2008 International Workshop on Intelligent Informatics in Biology and Medicine (IIBM), Barcelona, Spain, March 4–7, 2008 International Workshop on Scalable Stream Processing Systems (SSPS), Nantes, France, March 29, 2008
AAAI Conference on Artificial Intelligence Nectar Paper Track (AAAI Nectar), Vancouver, British Columbia, Canada, July 22-26, 2007 ACM/IEEE Joint Conference on Digital Libraries (JCDL), Vancouver, British Columbia, Canada, June 18 - 23, 2007 IEEE ICDE Workshop on Scalable Stream Processing Systems (SSPS), Istanbul, Turkey, April 17-20, 2007
VLDB Workshop on Clean Databases (CleanDB), Seoul, Korea, September 11, 2006 ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD), Philadelphia, PA, USA, August 20-23, 2006 AAAI Conference on Artificial Intelligence Nectar Paper Track (AAAI Nectar), Boston, MA, USA, July 16–20, 2006 ACM/IEEE Joint Conference on Digital Libraries (JCDL), Chapel Hill, NC, USA, June 11-15, 2006
ACM International Workshop on Web Information and Data Management (WIDM), Bremen, Germany, November 5, 2005 ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD), Chicago, IL, USA, August 21, 2005
ACM International Workshop on Web Information and Data Management (WIDM), Washington, DC, USA, November 12-13, 2004
|
Selected Publications (Available online at http://infos.korea.ac.kr) |
|
|
International Journals: Pankaj Chopra , Jinseung Lee, Jaewoo Kang , Sunwon Lee: Improving Cancer Classification Accuracy Using Gene Pairs PLoS ONE, 5(12), Dec 2010 Yoojin Hong , Jaewoo Kang, Dongwon Lee , Damian B. van Rossum: Adaptive GDDA-BLAST: Fast and Efficient Algorithm for Protein Sequence Embedding. PLoS ONE, 5(10), Oct 2010 Hanjun Shin, Ki Hoon Kim, Chihwan Song, Injoon Lee, Kyubum Lee, Jaewoo Kang, Yoon Kyoo Kang: Electrodiagnosis support system for localizing neural injury in an upper limb. Journal of the American Medical Informatics Association (JAMIA), Vol. 17, Issue 3, Pages 345-347, May 2010 Gayathri Tambaram Kailasam, and Jin-Seung Lee, Jae-Won Rhee, Jaewoo Kang: Efficient skycube computation using point and domain-based filtering. Information Sciences, Vol. 180, Issue 7, Pages 1090-1103, Apr 2010 HyungJun Cho, Jaewoo Kang, Jae K. Lee: Empirical Bayes analysis of unreplicated microarray data. Computational Statistics, Vol. 24, No. 3, Pages 393-408, Aug 2009 Sungbo Seo, Jaewoo Kang, Keun Ho Ryu: Multivariable stream data classification using motifs and their temporal relations. Information Sciences, Vol. 179, Issue 20, Pages 3489-3504, Aug 2009 Kuan-ming Lin, Jaewoo Kang, Hanjun Shin, Jusang Lee: A Cube Framework for Incorporating Inter-gene Information into Biological Data Mining. Int. J. Data Mining and Bioinformatics, 3(1):3-22, 2009 HyungJun Cho, Ami Yu, Sukwoo Kim, Jaewoo Kang, Seung-Mo Hong: Robust Likelihood-Based Survival Modeling with Microarray Data. J. Statistical Software, 29(1):1548-7660, January 2009 Jaewoo Kang, Jeffrey F. Naughton: Schema Matching Using Interattribute Dependencies. IEEE Transactions on Knowledge and Data Engineering, 20(10):1393-1407, October 2008 Sunshin Kim, Jaewoo Kang, Yong Je Chung, Jinyan Li, Keun Ho Ryu: Clustering orthologous proteins across phylogenetically distant species. Proteins: Structure, Function, and Bioinformatics, 71(3), May 2008 Pankaj Chopra, Jaewoo Kang, Jiong Yang, HyungJun Cho, Heenam S Kim, Min-Goo Lee: Microarray data mining using landmark gene-guided clustering. BMC Bioinformatics, 9(92), February 2008 Dongwon Lee, Jaewoo Kang, Prasenjit Mitra, C. Lee Giles, Byung-Won On: Are Your Citations Clean?: New Scenarios and Challenges in Maintaining Digital Libraries. Communications of the ACM, 2007 Bin Song, Jeong-Hyeon Choi, Guangyu Chen, Jacek Szymanski, Guo-Qiang Zhang, Anthony K. H. Tung, Jaewoo Kang, Sun Kim, and Jiong Yang: ARCS: An Aggregated Related Column Scoring Scheme for Aligned Sequences. Bioinformatics, 22(19):2326-2332, October 2006. Jeffrey F. Naughton, David J. DeWitt, David Maier, Ashraf Aboulnaga, Jianjun Chen, Leonidas Galanis, Jaewoo Kang, Rajasekar Krishnamurthy, Qiong Luo, Naveen Prakash, Ravishankar Ramamurthy, Jayavel Shanmugasundaram, Feng Tian, Kristin Tufte, Stratis Viglas: The Niagara Internet Query System. IEEE Data Engineering Bulletin, Vol. 24, No. 2, Pages 27-33, Jun 2001 Mary F. Fernández, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: Overview of Strudel - A Web-Site Management System. Networking and Information Systems Journal, Vol. 1, Pages 115-140, 1998 International Conferences: Quinsong Jin, Jaewoo Kang, Injoon Lee: Metadata-driven Subspace Collaborative Filtering. The 1st International Conference on Internet (ICONI), Dec 2009 Hanjun Shin, Himchan Kim,Sangjun Lee, Jaewoo Kang: Online Removal of Ocular Artifacts from Single Channel EEG for Ubiquitous Healthcare Applications. The 4th International Conference on Ubiquitous Information Technologies & Applications (ICUT), Dec 20-22, 2009 Pankaj Chopra, Jaewoo Kang, Seung-Mo Hong: Meta-analysis of cancer microarray data reveals signaling pathway hotspots. International Workshop on Data Mining for Biomarker Discovery (DMBD), Washington D.C., USA, Nov 1-4, 2009 Pankaj Chopra, Jaewoo Kang, Jinseung Lee: Using Gene Pair Combinations to Improve the Accuracy of the PAM Classifier. Proceedings of the IEEE International conference on Bioinformatics and Biomedicine (BIBM), Washington D.C., USA, Nov 2009 Pankaj Chopra , Han Jun Shin, Jaewoo Kang: Global gene map for cancer reveals pathway hotspots. Proceedings of the IEEE International conference on Bioinformatics and Biomedicine (BIBM), Philadelphia, USA. Nov 3-5, 2008 Yoojin Hong, Tao Yang, Jaewoo Kang, Dongwon Lee: Record Linkage as DNA Sequence Alignment Problem. Proceedings of the 6th International Workshop on Quality in Databases and Management of Uncertain Data(QDB), Auckland, New Zealand, August 25, 2008 Qiankun Zhao, Prasenjit Mitra, Dongwon Lee, Jaewoo Kang: HICCUP: Hierarchical Clustering Based Value Imputation using Heterogeneous Gene Expression Microarray Datasets. Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering (BIBE), Harvard Medical School, Boston, MA, USA, October 14-17, 2007: 71-78 Tae Sik Han, Seung-Kyu Ko, Jaewoo Kang: Efficient Subsequence Matching Using the Longest Common Subsequence with a Dual Match Index. Amit Awekar, Jaewoo Kang: Selective approach to handling topic oriented tasks on the world wide web. Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining (IEEE CIDM), Honolulu, Hawaii, April 2007. Kuan-ming Lin, Jaewoo Kang: Exploiting Inter-gene Information for Microarray Data Integration. Proceedings of the 22nd Annual ACM Symposium on Applied Computing (ACM SAC), Seoul, Korea, March 2007. ByungWon On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei: Improving Grouped-Entity Resolution using Quasi-Cliques. To appear in Proceedings of the IEEE International Conference on Data Mining (IEEE ICDM), Hong Kong, China, December 2006. Sungbo Seo, Jaewoo Kang, Dongwon Lee, Keun Ho Ryu: Multivariate Stream Data Classification Using Simple Text Classifiers. To appear in Proceedings of the 17th International Conference on Database and Expert Systems Applications (DEXA), Krakow, Poland, September 4-8, 2006. ByungWon On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei: An Effective Approach to Entity Resolution Problem Using QuasiClique and its Application to Digital Libraries (short paper). In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (ACM/IEEE JCDL), Chapel Hill, NC, June 11-15, 2006. Amit C. Awekar, Pabitra Mitra, Jaewoo Kang: Selective Hypertext Induced Topic Search (poster paper). In Proceedings of the 15th International World Wide Web Conference (WWW), Edinburgh, Scotland, May 23-26, 2006. Sungbo Seo, Jaewoo Kang, Keun Ho Ryu: Multivariate Stream Data Transmission in Sensor Network Applications. In Proceedings of the International Symposium on Ubiquitous Intelligence and Smart Worlds (UISW), p.198-207, Nagasaki, Japan, December 6-7, 2005, (LNCS 3823, Springer 2005, ISBN 3-540-30803-2) Jaewoo Kang, Dongwon Lee, and Prasenjit Mitra: Identifying Value Mappings for Data Integration: An Unsupervised Approach. In Proceedings of the International Conference on Web Information Systems Engineering (WISE), p.544-551, New York, NY, November 20-22, 2005, (LNCS 3806, Springer 2005, ISBN 3-540-30017-1) Jaewoo Kang, Tae Sik Han, Dongwon Lee, Prasenjit Mitra: Establishing Value Mappings Using Statistical Models and User Feedback. In Proceedings of the ACM Conference on Information and Knowledge Management (ACM CIKM), p.68-75, Bremen, Germany, Oct 31- Nov 5, 2005. Jaewoo Kang, Jiong Yang, Wanhong Xu, Pankaj Chopra: Integrating Heterogeneous Microarray Data Sources using Correlation Signatures. In Proceedings of the International Workshop on Data Integration in the Life Sciences (DILS), p.105-120, San Diego, CA, July 20-22, 2005, (LNCS 3615, Springer 2005, ISBN 3-540-27967-9) Dongwon Lee, Byung-Won On, Jaewoo Kang, Sanghyun Park: Effective and Scalable Solutions for Mixed and Split Citation Problems in Digital Libraries. In Proceedings of the ACM SIGMOD Workshop on Information Quality in Information Systems (ACM IQIS), p.69-76, Baltimore, MD, June 17, 2005. Byung-Won On, Dongwon Lee, Jaewoo Kang, Prasenjit Mitra: Comparative Study of Name Disambiguation Problem using a Scalable Blocking-based Framework. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (ACM/IEEE JCDL), p.344-353, Denver, CO, June 7-11, 2005. Ramrajprabu Balasubramanian, Injong Rhee, Jaewoo Kang: A Scalable Architecture for SIP Infrastructure using Content Addressable Networks. In Proceedings of the IEEE International Conference on Communications (IEEE ICC), p.1314-1318, Seoul, Korea, May 16-20, 2005. Jaewoo Kang, Jeffrey F. Naughton: On Schema Matching with Opaque Column Names and Data Values. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p.205-216, San Diego, California, June 9-12, 2003. Jaewoo Kang, Jeffrey F. Naughton, Stratis D. Viglas: Evaluating Window Joins over Unbounded Streams. In Proceedings of the IEEE International Conference on Data Engineering (IEEE ICDE), p.341-352, Bangalore, India, March 5-8, 2003. Mary F. Fernandez, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: Catching the Boat with Strudel: Experiences with a Web-Site Management System. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p.414-425, Seattle, Washington, June 2-4, 1998. Mary F. Fernandez, Daniela Florescu, Jaewoo Kang, Alon Y. Levy, Dan Suciu: System Demonstration - STRUDEL: A Web-Site Management System. In Proceedings of the ACM International Conference on Management of Data (ACM SIGMOD), p. 549-552, Tucson, Arizona, May 13-15, 1997. Jaewoo Kang, Mark Choey, Andreas Weigend: Maximizing Risk-Adjusted Return in Financial Time Series. In Computing Science & Statistics (28th Symposium INTERFACE), p.677-681, Sidney, Austrailia, July 1996. |