- AutorIn
- Jochen Tiepmar
- Titel
- Release of the MySQL based implementation of the CTS protocol
- Zitierfähige Url:
- https://nbn-resolving.org/urn:nbn:de:bsz:15-qucosa-201773
- Quellenangabe
- Altertumswissenschaften in a Digital Age : Egyptology, Papyrology and beyond ; proceedings of a conference and workshop in Leipzig, November 4-6, 2015 / edited by Monica Berti and Franziska Naether. Leipzig, 2016. Beitrag 7
- Quellenangabe
- Altertumswissenschaften in a Digital Age
- Erstveröffentlichung
- 2016
- Abstract (EN)
- In a project called "A Library of a Billion Words" we needed an implementation of the CTS protocol that is capable of handling a text collection containing at least 1 billion words. Because the existing solutions did not work for this scale or were still in development I started an implementation of the CTS protocol using methods that MySQL provides. Last year we published a paper that introduced a prototype with the core functionalities without being compliant with the specifications of CTS (Tiepmar et al., 2013). The purpose of this paper is to describe and evaluate the MySQL based implementa-tion now that it is fulfilling the specifications version 5.0 rc.1 and mark it as finished and ready to use. Fur-ther information, online instances of CTS for all de-scribed datasets and binaries can be accessed via the projects website1. Reference Tiepmar J, Teichmann C, Heyer G, Berti M and Crane G. 2013. A new Implementation for Canonical Text Services. in Proceedings of the 8th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTeCH).
- Freie Schlagwörter (DE)
- Canonical Text Service, Text-Mining, Text-Repositorien, Textanalyse, Dokument-Repositorien
- Freie Schlagwörter (EN)
- Canonical Text Service, text mining, text repository, text analysis, document repository
- Klassifikation (DDC)
- 930
- Herausgeber (Institution)
- Universität Leipzig
- URN Qucosa
- urn:nbn:de:bsz:15-qucosa-201773
- Veröffentlichungsdatum Qucosa
- 20.04.2016
- Dokumenttyp
- Konferenzbeitrag
- Sprache des Dokumentes
- Englisch