Zur Kurzanzeige

[Zeitschriftenartikel]

dc.contributor.authorHeller, Markusde
dc.date.accessioned2009-02-04T17:00:00Zde
dc.date.accessioned2012-08-29T23:06:42Z
dc.date.available2012-08-29T23:06:42Z
dc.date.issued2006de
dc.identifier.issn0172-6404
dc.identifier.urihttp://www.ssoar.info/ssoar/handle/document/4998
dc.description.abstractHistorical documents have specific properties which make life hard for traditional information retrieval techniques. The missing notion of orthography and a general high degree of variation in the phonetic-graphemic representation, as well as in derivational morphology obstruct the possibility to find documents upon the entry of a modern word as the search term. The following paper gives an overview of existing string approximation technologies as used in bioinformatics, but also of phonetic approximation algorithms. It proposes an architecture of combining both notions, while using Jörg Michael’ phonet program to deduct from graphemes to a phonetic representation and a levenshtein automaton to allow for fast approximative matching. The final part of the paper evaluates the suitability of the approach, while using the levenshtein algorithm in its non-automaton-based implementation.en
dc.languagedede
dc.subject.ddcNews media, journalism, publishingen
dc.subject.ddcPublizistische Medien, Journalismus,Verlagswesende
dc.titleApproximative Indexierungstechnik für historische deutsche Textvariantende
dc.description.reviewbegutachtet (peer reviewed)de
dc.description.reviewpeer revieweden
dc.source.journalHistorical Social Researchde
dc.source.volume31de
dc.publisher.countryDEU
dc.source.issue3de
dc.subject.classozInformation Management, Information Processes, Information Economicsen
dc.subject.classozInformationsmanagement, informationelle Prozesse, Informationsökonomiede
dc.subject.thesozinformation retrievalen
dc.subject.thesozPhonetikde
dc.subject.thesozDatendokumentationde
dc.subject.thesozinformation retrievalde
dc.subject.thesozphoneticsen
dc.subject.thesozTextde
dc.subject.thesoztexten
dc.subject.thesozdata documentationen
dc.identifier.urnurn:nbn:de:0168-ssoar-49988de
dc.date.modified2009-03-05T12:28:00Zde
dc.rights.licenceCreative Commons - Attribution 4.0en
dc.rights.licenceCreative Commons - Namensnennung 4.0de
ssoar.gesis.collectionSOLIS;ADISde
ssoar.contributor.institutionGESISde
internal.status3de
internal.identifier.thesoz10060183
internal.identifier.thesoz10040537
internal.identifier.thesoz10054532
internal.identifier.thesoz10047326
dc.type.stockarticlede
dc.type.documentjournal articleen
dc.type.documentZeitschriftenartikelde
dc.rights.copyrightfde
dc.source.pageinfo288-307
internal.identifier.classoz1080502
internal.identifier.journal152de
internal.identifier.document32
internal.identifier.ddc070
dc.identifier.doihttps://doi.org/10.12759/hsr.31.2006.3.288-307
dc.description.pubstatusPublished Versionen
dc.description.pubstatusVeröffentlichungsversionde
internal.identifier.licence16
internal.identifier.pubstatus1
internal.identifier.review1
internal.check.abstractlanguageharmonizerCERTAIN
internal.check.languageharmonizerCERTAIN_RETAINED


Dateien zu dieser Ressource

Thumbnail

Das Dokument erscheint in:

Zur Kurzanzeige