Frequent patterns for improving categorization in semantic wiki

Frequent patterns for improving categorization in semantic wiki

Yaya Traore Cheikh Talibouya Diop Fatou Kamara-Sangare Sadouanouan Malo Moussa Lo Stanislas Ouaro

Université Gaston Berger de Saint -Louis, Saint-Louis, BP 234, Sénégal

Université Ouaga 1 Pr Joseph Ki-Zerbo, Ouagadougou,BP 7021, Burkina Faso

Université Polytechnique de Bobo Dioulasso, Bobo-Dioulasso,BP 1091, Burkina Faso

Corresponding Author Email: 
{cheikh-talibouya.diop, fatou.kamara, moussa.lo}@ugb.edu.sn, {yaytra, ouaro}@yahoo.fr, sadouanouan@yahoo.fr
Page: 
83-106
|
DOI: 
https://doi.org/10.3166/ISI.21.5-6.83-106
Received: 
N/A
| |
Accepted: 
N/A
| | Citation
Abstract: 

Semantic wikis allow collaboration between users of a community for creating and sharing knowledge. The wiki pages are semantically annotated and tags (keywords) can be freely associated with them. The categories organize links between pages in the wiki and are created by the experts. In this way, data stored in semantic wikis can be extracted and the resulting knowledge pattern can be reused in order to improve the wiki organization. In this paper, we propose a method which allows to extract frequent patterns useful of tags in the wiki for guiding new categories discovery and improving categorization. We use an external ontology of the wiki for guiding the experts to create these new categories in the wiki. Pages annotated by these new categories are categorized. The originality of our approach is to build from the content of wiki an extraction context and to extract knowledge units from this content. This will allow to improve the hierarchy category and improve the semantic research by categories. The experiments on a semantic wiki annotated show the substantial results.

Keywords: 

semantic wiki, frequent pattern, ontology, categorization

1. Introduction
2. Préliminaires et position du problème
3. Travaux liés
4. Approche et méthodologie proposée
5. Expérimentation
6. Conclusion
Remerciements
  References

Agrawal R., Srikant R. (1994). Fast algorithms for mining association rules in larges databases, Proc. VLDB conf., September, p. 478-499.

Boyer A., Brun A., Skaf-Molli H. (2010). Human Computer Collaboration to Improve Annotations in Semantic Wikis, 6th Conference on Web Information Systems and Technologies (Webist 2010), April, Valencia, Spain, p. 8.

Brin S., Motwani R. et Silverstein C. (1997). Beyond market baskets: Generalizing association rules to correlations, In Proc. of the ACMSIGMOD Conference, Tucson, Arizona, p. 265-276.

Buffa M., Gandon F., Ereteo G. (2007). Wiki et web sémantique, In F. Trichet (Ed.), IC’2007 : 18e Journées Francophones d’Ingénierie des connaissances.

Buffa M., Gandon F., Ereteo G., Sander P., Faro C. (2008). SweetWiki: A semantic wiki, Web Semantics: Science, Services and Agents on the World Wide Web Vol. 6, n° 1, February, p. 84-97, Semantic Web and Web 2.0.

Chernov S., Iofciu T., Nejdl W. and Zhou X. (2006). Extracting semantic relationships between wikipedia categories, In 1st International Workshop SemWiki2006 – From Wiki to Semantics, co-located with the ESWC 2006, Budva.

Deng Z. H., Lv S.-L. (2015). PrePost+: An efficient N-lists-based algorithm for mining frequent itemsets via Children–Parent Equivalence pruning. Expert Systems with Applications, vol. 42, n° 13, 1 August, p. 5424-5432.

Deng Z. H., Wang Z. H., Jiang J. J. (2012). A new algorithm for fast mining frequent itemsets using n-lists. Science China Information Sciences, September 2012, vol. 55, n° 9, p. 2008-2030.

Filipiak D., Ławrynowicz A. (2014). Generating semantic media Wiki content from domain ontologies, SWCS’14 Proceedings of the Third International Conference on Semantic Web Collaborative Spaces, vol. 1275, Germany ©2014, p. 68-76.

Fournier-Viger P. (2016). SPMF An Open-Source Data Mining Library http://www.philippefournier-viger.com/spmf/index.php?link=algorithms.php

Gennari J. H., Tu S. W., Rothenfluh T. E., Musen M. A. (1994). Mapping domains to methods in support of reuse. International Journal of Human-Compute Studies,41, p. 399-424.

Gil Y., Knight A., Zhang K., Zhang L., Sethi R. (2013). An Initial Analysis of Semantic Wikis, In Proceedings of the ACM International Conference on Intelligent User Interfaces (IUI).

Grahne G., Zhu J. (2003). High performance mining of maximal frequent itemsets - 6th International Workshop on High Performance Data.

Hernandez N. (2006). Ontologie de domaine pour la modélisation du contexte en recherche d’information, Thèse de Doctorat, Université Paul Sabatier de Toulouse.

Krötzsch M., Vrandecic D., Völkel M. (2006). Semantic Mediawiki, ISWC 2006:5th International Semantic Web Conference, Athens,Ga, USA, November 5-9.

Krötzsch M., Schaffert S., Vrandecic D. (2012). Swivt ontology specification. http://semantic-mediawiki.org/swivt/.

Krötzsch M., Schaffert S., Vrandecic D. (2007a). Reasoning in semantic wikis, In Reasoning web 2007, vol. 4636 of Lecture Notes in Computer Science, Springer, p. 310-329.

Krötzsch M., Vrandecic D. Kolkel M., Haller H., Studer R. (2007b). Semantic wikipedia, J. Web Sem, p. 251-261.

Marinica C., Guillet F., Briand H. (2008). Vers la fouille de règles d’association guidée par des ontologies et des schémas de règles, Atelier Qualité des Données et des Connaissances, EGC’08.

Meilender T. (2013). Un wiki sémantique pour la gestion des connaissances décisionnelles –Application à la cancérologie, Thèse de Doctorat Université de Lorraine.

Pasquier N., Bastide Y., Taouil R., LakhalL. (1998). Pruning closed itemset lattices for association rules, In Actes des 14e Journées Bases de Données Avancées (BDA’98), p. 177-196.

Pasquier N., Bastide Y., Taouil R., Lakhal L. (1999). Efficient Mining of Association Rules using Closed Itemset Lattices. Information Systems, Elsevier Science, vol. 24, n° 1, p. 25-46.

Rosenfeld M., Fernández A., Díaz A. (2010). Semantic Wiki Refactoring. A Strategy to Assist Semantic Wiki Evolution, In Proceedings of the Fifth Workshop on Semantic Wikis (SemWiki 2010), co-located with 7th European Semantic Web Conference, ESWC.

Schönberg C., Pree H., Freitag B. (2010). Rich ontology Extraction and Wikipedia Expansion Using Language Resources, Proc. of the 11th int. Conf. on Web-Age Information Management, Jiuzhaigou,China, LNCS, vol. 6184.

Shi L., Toussaint Y., Napoli A., Blansché A. (2011). Mining for Reengineering: An Application to Semantic Wikis Using Formal and Relational Concept Analysis, in The Semanic Web: Research and Applications, 8th Extended Semantic Web Conference Proceedings, ESWC’11, Heraklion, Crete, Greece, May 29-June 2, p. 421-435.

Tobias B., Andreas F. (2011). FrequentPattern TagCloud, Semantic MediaWiki Extension, Documentation, University of Heidelberg.

Traoré Y., Malo S., Diop C. T., Lo M., Ouaro S. (2015). Approche de découverte de nouvelles catégories dans un wiki sémantique basée sur les motifs fréquents, IC’15, June 2015, Rennes, France. collection AFIA.

Vaillant B. (2006) Mesurer la qualité des règles d’association : Etudes formelles et expérimentales, Thèse de doctorat École nationale supérieure des Télécommunications de Bretagne.

Winkler W. E. (1999). The state of record linkage and current research problems, Statistics of Income Division, Internal Revenue Service Publication R99/04.

Zaki M. J. (2000). Scalable algorithms for association mining. IEEE TKDE Journal, vol. 12, n° 3, p. 372-390.

Zaki M. J., Hsiao C.-J. (2002). CHARM: An Efficient Algorithm for Closed Itemset Mining, In 2nd {SIAM} International Conference on Data Mining.

Zaki M. J., Gouda K. (2003). Fast vertical mining using diffsets, In: SIGKDD, p. 326-335.