How to represent and detect contextual identity links in a knowledge base: Application on experimental data in life sciences

How to represent and detect contextual identity links in a knowledge base: Application on experimental data in life sciences

Joe Raad Nathalie Pernelle Fatiha Saïs Juliette Dibie Liliana Ibanescu Stéphane Dervaux  

UMR MIA-PARIS, INRA, AgroParisTech, Université Paris-Saclay Paris, France

LRI, Paris Sud University Orsay, France

Corresponding Author Email: 
{joe.raad,juliette.dibie,liliana.ibanescu,stephane.dervaux}@agroparistech.fr; {nathalie.pernelle,fatiha.sais}@lri.fr
Page: 
345-372
|
DOI: 
https://doi.org/10.3166/RIA.32.345-372
Received: 
| |
Accepted: 
| | Citation

OPEN ACCESS

Abstract: 

Most of the Linked Data applications currently rely on the use of owl:sameAs for linking ontology instances. However, several studies have noticed multiple misuses of this identity link, which can lead to erroneous statements or inconsistencies. We propose in this paper a new contextual identity link, that could serve as a replacement in linking identical instances in a specified context. To detect these contextual links, we have defined an algorithm named DECIDE, which has been tested on scientific knowledge bases from several INRA projects.  

Keywords: 

context, identity links, knowledge base, scientific data

1. Introduction
2. Identité contextuelle
3. DECIDE – Méthode de détection des liens d’identité contextuelle
4. Expérimentations
5. Travaux connexes
6. Conclusion
Remerciements
  References

Al-Bakri M., Atencia M., David J., Lalande S., Rousset M. (2016, 29 August-2 September 2016). Uncertainty-sensitive reasoning for inferring sameas facts in linked data. In G. A. Kaminka et al. (Eds.), ECAI 2016 - 22nd European Conference on Artificial Intelligence, vol. 285, p. 698–706. The Hague, The Netherlands, IOS Press. 

Al-Bakri M., Atencia M., Lalande S., Rousset M. (2015). Inferring same-as facts from linked data: An iterative import-by-query approach. In B. Bonet, S. Koenig (Eds.), Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, p. 9–15. Austin, Texas, USA., AAAI Press. 

Baader F., Horrocks I., Lutz C., Sattler U. (2017). An introduction to description logic. Cambridge University Press. 

Batchelor C. R., Brenninkmeijer C. Y. A., Chichester C., Davies M., Digles D., Dunlop I. et al. (2014). Scientific lenses to support multiple views over linked chemistry data. In The Semantic Web - ISWC 2014 - 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I, p. 98–113. 

Beek W., Raad J., Wielemaker J., Harmelen F. van. (2018). sameas.cc: The closure of 500m owl:sameas statements. In The Semantic Web - 15th International Conference, ESWC 2018, Heraklion, Crete, Greece, June 3-7, 2018. 

Beek W., Schlobach S., Harmelen F. van. (2016). A contextualised semantics for owl: sameas. In The Semantic Web. Latest Advances and New Domains - 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings, p. 405–419. 

Belleau F., Nolin M.-A., Tourigny N., Rigault P., Morissette J. (2008). Bio2rdf: Towards a mashup to build bioinformatics knowledge systems. Journal of Biomedical Informatics, vol. 41, no 5, p. 706 - 716. (Semantic Mashup of Biomedical Data) 

Carroll J. J., Bizer C., Hayes P., Stickler P. (2005). Named graphs, provenance and trust. In Proceedings of the 14th international conference on World Wide Web (WWW), Chiba, Japan, May 10-14, 2005, p. 613–622. 

Dean M., Schreiber G., Bechhofer S., Harmelen F. van, Hendler J., Horrocks I. et al. (2004). OWL web ontology language reference. W3C Recommendation February, vol. 10. 

Dodds L., Davis I. (2012). Linked data patterns: A pattern catalogue for modelling, publishing, and consuming linked data. web. Consulté sur http://patterns.dataincubator.org/book/ 

Dong X., Halevy A., Madhavan J. (2005). Reference reconciliation in complex information spaces. In Special interest group on management of data(acm sigmod), p. 85–96. NY, USA. 

Ferrara A., Nikolov A., Scharffe F. (2011). Data linking for the semantic web. Int. J. Semantic Web Inf. Syst., vol. 7, no 3, p. 46–76.

Ferré S., Cellier P. (2016). Graph-FCA in practice. In Graph-Based Representation and Reasoning - 22nd International Conference on Conceptual Structures, ICCS 2016, Annecy, France, July 5-7, 2016, Proceedings, p. 107–121. 

Forrest P. (2016). The identity of indiscernibles. In E. N. Zalta (Ed.), The stanford encyclopedia of philosophy, Winter 2016 éd.. Metaphysics Research Lab, Stanford University. 

Guéret C., Groth P., Stadler C., Lehmann J. (2012). Assessing linked data mappings using network measures. In The Semantic Web: Research and Applications - 9th Extended Semantic Web Conference, ESWC 2012, Heraklion, Crete, Greece, May 27-31, 2012, p. 87–102. 

Hacene M. R., Huchard M., Napoli A., Valtchev P. (2013). Relational concept analysis: mining concept lattices from multi-relational data. Ann. Math. Artif. Intell., vol. 67, no 1, p. 81–108. 

Halpin H., Hayes P. J., McCusker J. P., McGuinness D. L., Thompson H. S. (2010). When owl:sameAs isn’t the same: An analysis of identity in linked data. In P. F. Patel-Schneider et al. (Eds.), The semantic web – iswc 2010: 9th international semantic web conference, iswc 2010, shanghai, china, november 7-11, 2010, revised selected papers, part i, p. 305– 320. Berlin, Heidelberg, Springer Berlin Heidelberg. 

Halpin H., Hayes P. J., Thompson H. S. (2015). When owl: sameas isn’t the same redux: towards a theory of identity, context, and inference on the semantic web. In International and Interdisciplinary Conference on Modeling and Using Context Modeling, Lanarca, Cyprus, November 2-6, 2015, p. 47–60. 

Hu W., Chen J., Qu Y. (2011). A self-training approach for resolving object coreference on the semantic web. In Proceedings of the 20th International Conference on World Wide Web, WWW 2011, Hyderabad, India, March 28 - April 1, 2011, p. 87–96. 

Ibanescu L., Dibie J., Dervaux S., Guichard E., Raad J. (2016). po2 - a process and observation ontology in food science. application to dairy gels. In Metadata and Semantics Research: 10th International Conference MTSR 2016, Göttingen, Germany, November 22-25, 2016, p. 155–165. 

Jaffri A., Glaser H., Millard I. (2008). URI disambiguation in the context of linked data. In C. Bizer, T. Heath, K. Idehen, T. Berners-Lee (Eds.), Proceedings of the WWW2008 Workshop on Linked Data on theWeb, LDOW2008, Beijing, China, April 22, 2008, vol. 369. CEUR-WS.org. 

Melo G. de. (2013). Not quite the same: Identity constraints for the web of linked data. In M. desJardins, M. L. Littman (Eds.), Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, July 14-18, 2013, Bellevue, Washington, USA. AAAI Press. 

Miles A., Bechhofer S. (2009). SKOS Simple Knowledge Organization System Reference. W3C Recommendation 18 August 2009. Consulté sur http://www.w3.org/TR/2009/REC -skos-reference-20090818/ 

Nguyen V., Bodenreider O., Sheth A. (2014). Don’t like RDF reification?: making statements about statements using singleton property. In 23rd International World Wide Web Conference, WWW ’14, Seoul, Republic of Korea, April 7-11, 2014, p. 759–770. 

Nikolov A., d’Aquin M., Motta E. (2012). Unsupervised learning of link discovery configuration. In 9th Extended Semantic Web Conference (ESWC), p. 119–133. Berlin, Heidelberg, Springer-Verlag. 

Papaleo L., Pernelle N., Saïs F., Dumont C. (2014). Logical detection of invalid sameas statements in RDF data. In Knowledge Engineering and Knowledge Management - 19th International Conference, EKAW 2014, Linköping, Sweden, November 24-28, 2014. Proceedings, p. 373–384. 

Raad J., Pernelle N., Saïs F. (2017). Détection de liens d’identité contextuels dans une base de connaissances. In IC 2017 : 28es Journées francophones d’Ingénierie des Connaissances, Caen, France, July 3-7, 2017, p. 56–67. 

Raad J., Pernelle N., Saïs F. (2017). Detection of contextual identity links in a knowledge base. In Ó. Corcho, K. Janowicz, G. Rizzo, I. Tiddi, D. Garijo (Eds.), Proceedings of the Knowledge Capture Conference, K-CAP 2017, December 4-6, 2017, p. 8:1–8:8. Austin, TX, USA, ACM. 

Saïs F., Pernelle N., Rousset M.-C. (2009). Combining a logical and a numerical method for data reconciliation. Journal on Data Semantics, vol. 12, p. 66–94. 

Volz J., Bizer C., Gaedke M., Kobilarov G. (2009). Discovering and maintaining links on the web of data. In Proceedings of the 8th International Semantic Web Conference(ISWC), p. 650-665. Berlin, Heidelberg, Springer-Verlag.