Tackling the challenges of matching biomedical ontologies

Background Biomedical ontologies pose several challenges to ontology matching due both to the complexity of the biomedical domain and to the characteristics of the ontologies themselves. The biomedical tracks in the Ontology Matching Evaluation Initiative (OAEI) have spurred the development of matching systems able to tackle these challenges, and benchmarked their general performance. In this study, we dissect the strategies employed by matching systems to tackle the challenges of matching biomedical ontologies and gauge the impact of the challenges themselves on matching performance, using the AgreementMakerLight (AML) system as the platform for this study. Results We demonstrate that the linear complexity of the hash-based searching strategy implemented by most state-of-the-art ontology matching systems is essential for matching large biomedical ontologies efficiently. We show that accounting for all lexical annotations (e.g., labels and synonyms) in biomedical ontologies leads to a substantial improvement in F-measure over using only the primary name, and that accounting for the reliability of different types of annotations generally also leads to a marked improvement. Finally, we show that cross-references are a reliable source of information and that, when using biomedical ontologies as background knowledge, it is generally more reliable to use them as mediators than to perform lexical expansion. Conclusions We anticipate that translating traditional matching algorithms to the hash-based searching paradigm will be a critical direction for the future development of the field. Improving the evaluation carried out in the biomedical tracks of the OAEI will also be important, as without proper reference alignments there is only so much that can be ascertained about matching systems or strategies. Nevertheless, it is clear that, to tackle the various challenges posed by biomedical ontologies, ontology matching systems must be able to efficiently combine multiple strategies into a mature matching approach. Electronic supplementary material The online version of this article (doi:10.1186/s13326-017-0170-9) contains supplementary material, which is available to authorized users.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.1186/s13326-017-0170-9
PID pmid:29335022
PID pmc:PMC5769431
URL http://dx.doi.org/10.1186/s13326-017-0170-9
URL https://academic.microsoft.com/#/detail/2783038509
URL https://jbiomedsem.biomedcentral.com/articles/10.1186/s13326-017-0170-9
URL https://doi.org/10.1186/s13326-017-0170-9
URL http://link.springer.com/content/pdf/10.1186/s13326-017-0170-9.pdf
URL https://www.ncbi.nlm.nih.gov/pubmed/29335022
URL http://www.scopus.com/inward/record.url?scp=85040717454&partnerID=8YFLogxK
URL http://europepmc.org/articles/PMC5769431
URL https://doaj.org/toc/2041-1480
URL https://jbiomedsem.biomedcentral.com/track/pdf/10.1186/s13326-017-0170-9
URL http://link.springer.com/article/10.1186/s13326-017-0170-9/fulltext.html
URL http://europepmc.org/abstract/MED/29335022
URL https://www.research.manchester.ac.uk/portal/en/publications/tackling-the-challenges-of-matching-biomedical-ontologies(25ad6b56-60b6-4123-a639-ce4b1610f806).html
URL https://dx.doi.org/10.1186/s13326-017-0170-9
URL https://dblp.uni-trier.de/db/journals/biomedsem/biomedsem9.html#FariaPMMCC18
URL https://core.ac.uk/display/153752757
URL http://link.springer.com/article/10.1186/s13326-017-0170-9
URL https://link.springer.com/article/10.1186/s13326-017-0170-9
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Isabela Mott, 0000-0002-4697-296X
Author Catia Pesquita, 0000-0002-1847-9393
Author Daniel Faria, 0000-0003-1511-277X
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Europe PubMed Central; PubMed Central; ORCID; Datacite; UnpayWall; The University of Manchester - Institutional Repository; DOAJ-Articles; Crossref; Microsoft Academic Graph
Hosted By Europe PubMed Central; Journal of Biomedical Semantics; The University of Manchester - Institutional Repository
Publication Date 2018-01-15
Additional Info
Field Value
Country United Kingdom
Language English
Resource Type Other literature type; Article; UNKNOWN
system:type publication
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/publication?articleId=dedup_wf_001::0ce85a4f70e24045f10dc72cd9212cf1
Author jsonws_user
Last Updated 22 December 2020, 16:35 (CET)
Created 22 December 2020, 16:35 (CET)