A tool for generating synthetic authorship records for evaluating author name disambiguation methods.

dc.contributor.authorFerreira, Anderson Almeida
dc.contributor.authorGonçalves, Marcos André
dc.contributor.authorAlmeida, Jussara Marques de
dc.contributor.authorLaender, Alberto Henrique Frade
dc.contributor.authorVeloso, Adriano Alonso
dc.date.accessioned2012-10-22T17:14:31Z
dc.date.available2012-10-22T17:14:31Z
dc.date.issued2012
dc.description.abstractThe author name disambiguation task has to deal with uncertainties related to the possible many-to-many correspondences between ambiguous names and unique authors. Despite the variety of name disambiguation methods available in the literature to solve the problem, most of them are rarely compared against each other. Moreover, they are often evaluated without considering a time evolving digital library, susceptible to dynamic (and therefore challenging) patterns such as the introduction of new authors and the change of research-ers’ interests over time. In order to facilitate the evaluation of name disambiguation meth-ods in various realistic scenarios and under controlled conditions, in this article we propose SyGAR, a new Synthetic Generator of Authorship Records that generates citation records based on author profiles. SyGAR can be used to generate successive loads of citation records simulating a living digital library that evolves according to various publication pat-terns. We validate SyGAR by comparing the results produced by three representative name disambiguation methods on real as well as synthetically generated collections of citation records. We also demonstrate its applicability by evaluating those methods on a time evolving digital library collection generated with the tool, considering several dynamic and realistic scenarios.pt_BR
dc.identifier.citationFERREIRA, A. A. et al. A tool for generating synthetic authorship records for evaluating author name disambiguation methods. Information Sciences, v. 206, p. 42-62, 2012. Disponível em: <https://www.sciencedirect.com/science/article/pii/S0020025512002861>. Acesso em: 22 out. 2012.pt_BR
dc.identifier.issn00200255
dc.identifier.urihttp://www.repositorio.ufop.br/handle/123456789/1728
dc.language.isoen_USpt_BR
dc.rights.licenseO periódico Information Sciences concede permissão para depósito do artigo no Repositório Institucional da UFOP. Número da licença: 3303030527825.
dc.subjectAuthor name disambiguationpt_BR
dc.subjectDigital librarypt_BR
dc.subjectBibliographic citationpt_BR
dc.subjectSynthetic generatorpt_BR
dc.titleA tool for generating synthetic authorship records for evaluating author name disambiguation methods.pt_BR
dc.typeArtigo publicado em periodicopt_BR
Files
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
ARTIGO_ToolGeneratingSynthetic.pdf
Size:
658.69 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: