PIDapalooza 2018 has ended

Welcome to PIDapalooza 2018...where anything goes...as long as it goes on forever.   

Wednesday, January 24 • 1:15pm - 2:00pm
The (ID) equivalency is in the eye of the beholder

Sign up or log in to save this to your schedule and see who's attending!

Determining identifier equivalency is key to data integration and to realizing the scientific discoveries that can only be made by collating our vast disconnected data stores.
There are two key problems in determining equivalency - conceptual and syntactic alignment. Conceptual alignment often relies on Xrefs and string-matching against synonyms. There is indeed a better way! Algorithmic determination of identifier equivalency across different sources can use a combination of Xrefs, priors rules, existing semantic relations, and synonyms to create equivalency cliques than can highlight the discrepancies in conceptual definitions for manual review. This is especially useful for data sources annotated with concept drift and differences, such as diseases. Syntactic issues are that there are so many variations of the same identifier, making data joins difficult. We present a framework to reconcile and provide authoritative and integration-ready prefixed identifiers (CURIES), to capture and consolidate prefixes and to build links across key resource registries. The combination of JSON-LD context technology with a prefix metadata repository provides the basis for the infrastructure to handle identifiers in a consistent fashion. Finally, this architecture also allows resources to be self describing "beacons" with respect to their identifiers.

avatar for Melissa Haendel

Melissa Haendel

Associate Professor, OHSU
Associate Professor Interim co-director, Oregon Health & Science University Library

Wednesday January 24, 2018 1:15pm - 2:00pm
Main Stage