An efficient similarity-based approach for comparing XML documents.

dc.contributor.authorOliveira, Alessandreia Marta de
dc.contributor.authorTessarolli, Gabriel Piton
dc.contributor.authorMenezes, Gleiph Ghiotto Lima de
dc.contributor.authorPinto, Bruno
dc.contributor.authorCampello, Fernando
dc.contributor.authorMarques, Matheus
dc.contributor.authorOliveira, Carlos
dc.contributor.authorRodrigues, Igor
dc.contributor.authorKalinowski, Marcos
dc.contributor.authorSouza, Uéverton dos Santos
dc.contributor.authorMurta, Leonardo Gresta Paulino
dc.contributor.authorMurta, Vanessa Braganholo
dc.date.accessioned2019-04-08T14:15:35Z
dc.date.available2019-04-08T14:15:35Z
dc.date.issued2018
dc.description.abstractXML documents are widely used to interchange information among heterogeneous systems, ranging from office applications to scientific experiments. Independently of the domain, XML documents may evolve, so identifying and understanding the changes they undergo becomes crucial. Some syntactic diffapproaches have been proposed to address this problem. They are mainly designed to compare revisions of XML doc- uments using explicit IDs to match elements. However, elements in different revisions may not share IDs due to tool incompatibility or even divergent or missing schemas. In this paper, we present Phoenix, a similarity-based approach for comparing revisions of XML documents that does not rely on explicit IDs. Phoenix uses dynamic programming and optimization algorithms to compare different features (e.g., ele- ment name, content, attributes, and sub-elements) of XML documents and calculate the similarity degree between them. We compared Phoenix with X-Diffand XyDiff, two state-of-the-art XML diffalgorithms. XyDiffwas the fastest approach but failed in providing precise matching results. X-Diffpresented higher efficacy in 30 of the 56 scenarios but was slow. Phoenix executed in a fraction of the running time re- quired by X-Diffand achieved the best results in terms of efficacy in 26 of 56 tested scenarios. In our evaluations, Phoenix was by far the most efficient approach to match elements across revisions of the same XML document.pt_BR
dc.identifier.citationOLIVEIRA, A. M. de. et al. An efficient similarity-based approach for comparing XML documents. Information Systems, v. 78, p. 40-57, 2018. Disponível em: <https://www.sciencedirect.com/science/article/pii/S0306437916304926>. Acesso em: 15 fev. 2019.pt_BR
dc.identifier.issn03064379
dc.identifier.urihttp://www.repositorio.ufop.br/handle/123456789/10961
dc.identifier.uri2https://www.sciencedirect.com/science/article/pii/S0306437916304926pt_BR
dc.language.isoen_USpt_BR
dc.rightsrestritopt_BR
dc.subjectDiffpt_BR
dc.subjectMatchpt_BR
dc.titleAn efficient similarity-based approach for comparing XML documents.pt_BR
dc.typeArtigo publicado em periodicopt_BR
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
ARTIGO_EfficientSimilarityBased.pdf
Size:
3.74 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
924 B
Format:
Item-specific license agreed upon to submission
Description: