UIUC Bioinformatics Seminar, Fall 2006 Login | Register
Spring 2006 talks

01/20/2006 Ping Ma, Statistics 
01/27/2006 Brendan Frey, Engineering (U. Toronto) 
02/03/2006 Charles Whitfield, Entomology 
02/17/2006 Jose Meseguer, Computer Science 
02/24/2006 Xinguang Zhu, Plant Science 
03/03/2006 Jing Jiang, Computer Science 
03/10/2006 Bioinformatics Summit Week 
03/17/2006 Carlos Santos, Bioinformatics (U. Mich.) 
03/24/2006 UIUC spring break 
03/31/2006 Mike Colvin, Natural Sciences (UC-Merced) 
04/07/2006 No meeting 
04/14/2006 Huixia (Judy) Wang, Statistics 
04/21/2006 Jay Mittenthal, Cell & Structural Biology 
04/28/2006 William Hersh, Medical Informatics (OHSU) 
05/05/2006 Michael Erdmann (Carnegie Mellon) 
Fall 2005 talks

08/26/2005 Sheng Zhong, Bioengineering 
09/02/2005 Richard LeDuc, NIDA Center for Neuroproteomics 
09/09/2005 Xifeng Yan, Computer Science 
09/16/2005 Xu Ling, Computer Science 
09/23/2005 Saurabh Sinha, Computer Science 
09/30/2005 Hui Fang, Computer Science 
10/07/2005 Bruce Schatz, Medical Information Sciences 
10/14/2005 Kathy Lu, Bioengineering 
10/21/2005 Peter Bajcsy, NCSA 
10/28/2005 Uriel Kitron, Veterinary Medicine 
11/04/2005 Denis Larkin, Animal Sciences 
11/11/2005 Matthew Hudson, Crop Sciences 
12/02/2005 Sandra Rodriguez-Zas, Animal Sciences 
Spring 2005 talks

Charles Whitfield (Entomology) 1/28/05 
Peter Bajcsy (Automated Learning Group) 2/4/05 
Wei Xie (Chemical & Biomolecular Engineering) 2/11/05 
Gustavo Caetano-Anolles (Crop Science) 2/18/05 
Bruce Schatz (GSLIS; IGB) 2/25/05 
Xinguang Zhu (Plant Science) 3/4/05 
Gary Olsen (Microbiology) 3/11/05 
Tao Tao (Computer Science) 3/18/05 
Sameer Varma (Biophysics and Computational Biology) 4/1/05 
Christine Elsik (Texas A&M Univ.) 4/8/05 
Xin He (Bioinformatics MS Option, Computer Science) 4/15/05 
Xinghua Lu (Medical Univ. of S. Carolina) 4/22/05 
Spring 2006 talks
Date Event Related link  
03/03/2006 Jing Jiang, Computer Science  

Title: Exploiting the Domain Structure for Gene Recognition from Biomedical Text

Abstract: Automatic recognition of gene names and protein names is a critical first step towards information extraction and text mining in biomedical literature. The state-of-the-art biomedical named entity taggers are mostly based on supervised learning methods, and their performance depends largely on the availability and the quality of the training data. Because of the different gene naming conventions among different biological species, the performance of a gene tagger trained on text containing gene names of certain species may degrade significantly when it is used to recognize gene names of new species. We propose an approach to gene recognition (and to named entity recognition in general) that exploits the domain structure in the training data to improve the performance of the tagger on new domains. We show that by combining the feature ranking in each training domain and imposing such generalizability-based new feature ranking as a prior, we can learn patterns that generalize well in all domains. Our results from three sets of experiments show that the proposed method outperforms a baseline method that does not consider the domain structure in the training data.

Biography: Jing Jiang is a third-year Ph.D. student in the Department of Computer Science at UIUC. Her research interests are mainly in information extraction and its application in biomedical literature.