Unscharfe Suche für Terme geringer Frequenz in einem großen Korpus

Please use this identifier to cite or link to this item: https://repositorium.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-201101107278
Title: Unscharfe Suche für Terme geringer Frequenz in einem großen Korpus
Other Titles: Fuzzy Search for Infrequent Terms in a Large Corpus
Authors: Gerhards, Karl
Thesis advisor: Prof. Dr. Kai-Uwe Kühnberger
Thesis referee: PD Dr. Helmar Gust
Abstract: Until now infrequent terms have been neglected in searching in order to save time and memory. With the help of a cascaded index and the introduced algorithms, such considerations are no longer necessary. A fast and efficient method was developed in order to find all terms in the largest freely available corpus of texts in the German language by exact search, part-word-search and fuzzy search. The process can be extended to include transliterated passages. In addition, documents that contain the term with a modified spelling, can also be found by a fuzzy search. Time and memory requirements are determined and fall considerably below the requests of common search engines.
URL: https://repositorium.ub.uni-osnabrueck.de/handle/urn:nbn:de:gbv:700-201101107278
Subject Keywords: Suche Retrieval Assoziativspeicher; Fuzzy Search Retrieval Corpus Assoziative Memory
Issue Date: 10-Jan-2011
License name: Namensnennung-NichtKommerziell-KeineBearbeitung 3.0 Unported
License url: http://creativecommons.org/licenses/by-nc-nd/3.0/
Appears in Collections:FB08 - E-Dissertationen

Files in This Item:
File Description SizeFormat 
thesis_gerhards.pdfPräsentationsformat2,13 MBAdobe PDFThumbnail
View/Open


This item is licensed under a Creative Commons License Creative Commons