Linux, Games, Programming, and some other random stuff: Advanced code completion filtering in kate / KDevelop

Wednesday, October 23, 2013

Advanced code completion filtering in kate / KDevelop

I have implemented two new ways to filter the code completion popup in kate: filtering the list using an abbreviation, and filtering the list using text not occuring at the word's beginning. This can probably best be demonstrated by lots of pictures:

You can match completion items by their abbreviation. This works for both camel case and underscore notation.

You can also match entries by words they merely contain but do not start with. This matching is only allowed at word borders (capitals or underscores). This feature makes it far easier to find that damn class which has an unexpected name prefix, or the m_foo variable you thought was called foo.

The abbreviation expansion engine also allows you to type parts of the words from the abbreviation, making your search more specific in a convenient way.

This feature is not specific to kdev-python, it works in all kate-based apps. It is available in kate's master branch, and will be available in KDE SC >= 4.12.

If you have more suggestions or cases which are not handled well, I'm happy to discuss this further. Have fun hacking!

P.S. If anyone can come up with an efficient algorithm for doing what is depicted in the last image, I'd be interested. The current one is quite slow for some corner cases.

10 comments:

AnonymousOctober 23, 2013 at 9:05 AM
Can you tell what the current approach is? Without much thought IMHO it looks like an advanced version of KMP could be applicable...
ReplyDelete
Replies
scummosOctober 23, 2013 at 12:41 PM
Currently, first all the word borders of the matched string (the one inthe completion list) are computed. Then, we walk through all the typed string (the one in the document), and compare them with the matched string: each letter must either match the next letter in the current word, or the first letter of the next word. If it matches both, the function will recursively call itself to decide with one should be used, i.e. it calls itself with the remaining pieces of both words to decide if there would be a match if the first letter of the next word was used, and if there's no match it will continue with the next letter of the current word. That's very slow if there's lots of equal letters, but I don't see a different way.
ReplyDelete
Replies
Francesco R.October 23, 2013 at 5:28 PM
wow
ReplyDelete
Replies
AnonymousOctober 23, 2013 at 6:02 PM
QtCreator supports the same (when entering the letters directly in uppercase).
Maybe check what they doing.
ReplyDelete
Replies
AnonymousOctober 23, 2013 at 10:14 PM
This is not really an elegant solution, but it's interesting to note, that you can reverse the search. So in the example you would start with "Matcher" and try to match (sorry, I could not resist) the longest possible string to the start of the word. So it would work like the following:
"t": does not equal "M"
"at" does not equal "Ma"
"mat" does equal "Mat"
"rmat" does not equal "Matc"
"rrmat" does not equal "Match"
stop, as "arrmat" would leave only two more characters, but there are three unmatched tokens left and each token needs at least one matching character.
⇒ The only possible combination is "mat".

So you could proceed from front to back and if there are multiple possibilities, you could switch processing from back to front hoping, that everything is unique this way. In general, I'd bounce on the remaining string and the remaining tokens and always take the way with less combinations and the trick outlined above to not try to find matches which will definitely lead to a dead end later on.

In case of multiple possible combinations I would also try to get through as fast as possible before trying other combinations, so in doubt match as much as possible (greedy) and first follow one token to the end instead of building up the whole tree of combinations. This should be faster because sometimes there might be multiple solutions and it does not matter which one matches, so it should be better to have a smaller subproblem left on average.

This might not really reduce the algorithms worst-case complexity, but it should improve the complexity of some cases.
ReplyDelete
Replies
PrometheusOctober 25, 2013 at 3:25 PM
Maybe the following (via Planet Gnome) is a good approach:
http://browse.feedreader.com/c/Planet_GNOME/627062185
ReplyDelete
Replies
stativNovember 20, 2013 at 11:38 AM
This is absolutely amazing! I've always been missing this feature in KDevelop and thanks to you it's everywhere, not just KDevelop.
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.