Centrum für Informations- und Sprachverarbeitung

Breadcrumb Navigation


BAS-CIS team wins international competition String Similarity Search/Join



A joint team of the Bulgarian Academy of Sciences (BAS) and the Center of Information and Language Processing has dominated a competition for fast approximative search in very large string databases. In comparison with other state-of-the-art approaches the BAS-CIS Team delivered the fastest search results. The competition was held as a workshop at EDBT 2013 (16th International Conference on Extending Database Technology).

(In Computational Linguistics string databases represent lexicons, phrase collections, sentence or text collections; in Biology genome sequences. Fast approximative search aims to select the complete set of strings „similar“ to an input string. The definition of similar depends on the application)

For further information:



S. Gerdjikov; S. Mihov; P. Mitankin; K.U. Schulz (2013), Good parts first - a new algorithm for approximate search in lexica and string databases. ArXiv e-prints, Jan. 2013.