TY - GEN
T1 - Biomedical text mining applied to document retrieval and semantic indexing
AU - Lourenço, Anália
AU - Carneiro, Sónia
AU - Ferreira, Eugénio C.
AU - Carreira, Rafael
AU - Rocha, Luis M.
AU - Glez-Peña, Daniel
AU - Méndez, José R.
AU - Fdez-Riverola, Florentino
AU - Diaz, Fernando
AU - Rocha, Isabel
AU - Rocha, Miguel
PY - 2009
Y1 - 2009
N2 - In Biomedical research, the ability to retrieve the adequate information from the ever growing literature is an extremely important asset. This work provides an enhanced and general purpose approach to the process of document retrieval that enables the filtering of PubMed query results. The system is based on semantic indexing providing, for each set of retrieved documents, a network that links documents and relevant terms obtained by the annotation of biological entities (e.g. genes or proteins). This network provides distinct user perspectives and allows navigation over documents with similar terms and is also used to assess document relevance. A network learning procedure, based on previous work from e-mail spam filtering, is proposed, receiving as input a training set of manually classified documents.
AB - In Biomedical research, the ability to retrieve the adequate information from the ever growing literature is an extremely important asset. This work provides an enhanced and general purpose approach to the process of document retrieval that enables the filtering of PubMed query results. The system is based on semantic indexing providing, for each set of retrieved documents, a network that links documents and relevant terms obtained by the annotation of biological entities (e.g. genes or proteins). This network provides distinct user perspectives and allows navigation over documents with similar terms and is also used to assess document relevance. A network learning procedure, based on previous work from e-mail spam filtering, is proposed, receiving as input a training set of manually classified documents.
KW - Biomedical document retrieval
KW - Document relevance
KW - Enhanced instance retrieval network
KW - Named entity recognition
KW - Semantic indexing document network
UR - https://www.scopus.com/pages/publications/77952577426
U2 - 10.1007/978-3-642-02481-8_146
DO - 10.1007/978-3-642-02481-8_146
M3 - Conference contribution
SN - 3642024807
SN - 9783642024801
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 954
EP - 963
BT - Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, Ambient Assisted Living - 10th Int. Work-Conf. Artificial Neural Networks, IWANN 2009 Workshops, Proceedings
T2 - 10th International Work-Conference on Artificial Neural Networks, IWANN 2009
Y2 - 10 June 2009 through 12 June 2009
ER -