×

A first approach to finding common motifs with gaps. (English) Zbl 1101.68562

Summary: We present three linear algorithms for as many formulations of the problem of finding motifs with gaps. The three versions of the problem are distinct in that they assume different constraints on the size of the gaps. The outline of the algorithm is always the same, although this is adapted each time to the specific problem, while maintaining a linear time complexity with respect to the input size. The approach we suggest is based on a re-writing of the text that uses a new alphabet made of labels representing words of the original input text. The computational complexity of the algorithm allows the use of it also to find long motifs. The algorithm is in fact general enough that it could be applied to several variants of the problem other than those suggested in this paper.

MSC:

68P30 Coding and information theory (compaction, compression, models of communication, encoding schemes, etc.) (aspects in computer science)
68P10 Searching and sorting
68W05 Nonnumerical algorithms
Full Text: DOI

References:

[1] Crochemore M., Text algorithms (1994)
[2] Crawford T., Computing in Musicology 11 pp 73–
[3] Farach M., Foundations of Computer Science (FOCS ’97) pp 137–
[4] Charras C., Handbook of Exact String Matching Algorithms (2004) · Zbl 1230.68001
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.