matchit.txt 19 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406
  1. *matchit.txt* Extended "%" matching
  2. For instructions on installing this file, type
  3. :help matchit-install
  4. inside Vim.
  5. For Vim version 6.3. Last change: 2007 Aug 29
  6. VIM REFERENCE MANUAL by Benji Fisher
  7. *matchit* *matchit.vim*
  8. 1. Extended matching with "%" |matchit-intro|
  9. 2. Activation |matchit-activate|
  10. 3. Configuration |matchit-configure|
  11. 4. Supporting a New Language |matchit-newlang|
  12. 5. Known Bugs and Limitations |matchit-bugs|
  13. The functionality mentioned here is a plugin, see |add-plugin|.
  14. This plugin is only available if 'compatible' is not set.
  15. You can avoid loading this plugin by setting the "loaded_matchit" variable
  16. in your |vimrc| file: >
  17. :let loaded_matchit = 1
  18. {Vi does not have any of this}
  19. ==============================================================================
  20. 1. Extended matching with "%" *matchit-intro*
  21. *matchit-%*
  22. % Cycle forward through matching groups, such as "if", "else", "endif",
  23. as specified by |b:match_words|.
  24. *g%* *v_g%* *o_g%*
  25. g% Cycle backwards through matching groups, as specified by
  26. |b:match_words|. For example, go from "if" to "endif" to "else".
  27. *[%* *v_[%* *o_[%*
  28. [% Go to [count] previous unmatched group, as specified by
  29. |b:match_words|. Similar to |[{|.
  30. *]%* *v_]%* *o_]%*
  31. ]% Go to [count] next unmatched group, as specified by
  32. |b:match_words|. Similar to |]}|.
  33. *v_a%*
  34. a% In Visual mode, select the matching group, as specified by
  35. |b:match_words|, containing the cursor. Similar to |v_a[|.
  36. A [count] is ignored, and only the first character of the closing
  37. pattern is selected.
  38. In Vim, as in plain vi, the percent key, |%|, jumps the cursor from a brace,
  39. bracket, or paren to its match. This can be configured with the 'matchpairs'
  40. option. The matchit plugin extends this in several ways:
  41. You can match whole words, such as "if" and "endif", not just
  42. single characters. You can also specify a |regular-expression|.
  43. You can define groups with more than two words, such as "if",
  44. "else", "endif". Banging on the "%" key will cycle from the "if" to
  45. the first "else", the next "else", ..., the closing "endif", and back
  46. to the opening "if". Nested structures are skipped. Using |g%| goes
  47. in the reverse direction.
  48. By default, words inside comments and strings are ignored, unless
  49. the cursor is inside a comment or string when you type "%". If the
  50. only thing you want to do is modify the behavior of "%" so that it
  51. behaves this way, you do not have to define |b:match_words|, since the
  52. script uses the 'matchpairs' option as well as this variable.
  53. See |matchit-details| for details on what the script does, and |b:match_words|
  54. for how to specify matching patterns.
  55. MODES: *matchit-modes* *matchit-v_%* *matchit-o_%*
  56. Mostly, % and related motions (|g%| and |[%| and |]%|) work just like built-in
  57. |motion| commands in |Operator-pending| and |Visual| modes. However, you
  58. cannot make these motions |linewise| or |characterwise|, since the |:omap|s
  59. that define them start with "v" in order to make the default behavior
  60. inclusive. (See |o_v|.) In other words, "dV%" will not work. The
  61. work-around is to go through Visual mode: "V%d" will work.
  62. LANGUAGES: *matchit-languages*
  63. Currently, the following languages are supported: Ada, ASP with VBS, Csh,
  64. DTD, Entity, Essbase, Fortran, HTML, JSP (same as HTML), LaTeX, Lua, Pascal,
  65. SGML, Shell, Tcsh, Vim, XML. Other languages may already have support via
  66. the default |filetype-plugin|s in the standard vim distribution.
  67. To support a new language, see |matchit-newlang| below.
  68. DETAILS: *matchit-details* *matchit-parse*
  69. Here is an outline of what matchit.vim does each time you hit the "%" key. If
  70. there are |backref|s in |b:match_words| then the first step is to produce a
  71. version in which these back references have been eliminated; if there are no
  72. |backref|s then this step is skipped. This step is called parsing. For
  73. example, "\(foo\|bar\):end\1" is parsed to yield
  74. "\(foo\|bar\):end\(foo\|bar\)". This can get tricky, especially if there are
  75. nested groups. If debugging is turned on, the parsed version is saved as
  76. |b:match_pat|.
  77. *matchit-choose*
  78. Next, the script looks for a word on the current line that matches the pattern
  79. just constructed. It includes the patterns from the 'matchpairs' option.
  80. The goal is to do what you expect, which turns out to be a little complicated.
  81. The script follows these rules:
  82. Insist on a match that ends on or after the cursor.
  83. Prefer a match that includes the cursor position (that is, one that
  84. starts on or before the cursor).
  85. Prefer a match that starts as close to the cursor as possible.
  86. If more than one pattern in |b:match_words| matches, choose the one
  87. that is listed first.
  88. Examples:
  89. Suppose you >
  90. :let b:match_words = '<:>,<tag>:</tag>'
  91. < and hit "%" with the cursor on or before the "<" in "a <tag> is born".
  92. The pattern '<' comes first, so it is preferred over '<tag>', which
  93. also matches. If the cursor is on the "t", however, then '<tag>' is
  94. preferred, because this matches a bit of text containing the cursor.
  95. If the two groups of patterns were reversed then '<' would never be
  96. preferred.
  97. Suppose you >
  98. :let b:match_words = 'if:end if'
  99. < (Note the space!) and hit "%" with the cursor at the end of "end if".
  100. Then "if" matches, which is probably not what you want, but if the
  101. cursor starts on the "end " then "end if" is chosen. (You can avoid
  102. this problem by using a more complicated pattern.)
  103. If there is no match, the cursor does not move. (Before version 1.13 of the
  104. script, it would fall back on the usual behavior of |%|). If debugging is
  105. turned on, the matched bit of text is saved as |b:match_match| and the cursor
  106. column of the start of the match is saved as |b:match_col|.
  107. Next, the script looks through |b:match_words| (original and parsed versions)
  108. for the group and pattern that match. If debugging is turned on, the group is
  109. saved as |b:match_ini| (the first pattern) and |b:match_tail| (the rest). If
  110. there are |backref|s then, in addition, the matching pattern is saved as
  111. |b:match_word| and a table of translations is saved as |b:match_table|. If
  112. there are |backref|s, these are determined from the matching pattern and
  113. |b:match_match| and substituted into each pattern in the matching group.
  114. The script decides whether to search forwards or backwards and chooses
  115. arguments for the |searchpair()| function. Then, the cursor is moved to the
  116. start of the match, and |searchpair()| is called. By default, matching
  117. structures inside strings and comments are ignored. This can be changed by
  118. setting |b:match_skip|.
  119. ==============================================================================
  120. 2. Activation *matchit-activate*
  121. You can use this script as a plugin, by copying it to your plugin directory.
  122. See |add-global-plugin| for instructions. You can also add a line to your
  123. |vimrc| file, such as >
  124. :source $VIMRUNTIME/macros/matchit.vim
  125. or >
  126. :runtime macros/matchit.vim
  127. Either way, the script should start working the next time you start up Vim.
  128. (Earlier versions of the script did nothing unless a |buffer-variable| named
  129. |b:match_words| was defined. Even earlier versions contained autocommands
  130. that set this variable for various file types. Now, |b:match_words| is
  131. defined in many of the default |filetype-plugin|s instead.)
  132. For a new language, you can add autocommands to the script or to your vimrc
  133. file, but the recommended method is to add a line such as >
  134. let b:match_words = '\<foo\>:\<bar\>'
  135. to the |filetype-plugin| for your language. See |b:match_words| below for how
  136. this variable is interpreted.
  137. TROUBLESHOOTING *matchit-troubleshoot*
  138. The script should work in most installations of Vim. It may not work if Vim
  139. was compiled with a minimal feature set, for example if the |+syntax| option
  140. was not enabled. If your Vim has support for syntax compiled in, but you do
  141. not have |syntax| highlighting turned on, matchit.vim should work, but it may
  142. fail to skip matching groups in comments and strings. If the |filetype|
  143. mechanism is turned off, the |b:match_words| variable will probably not be
  144. defined automatically.
  145. ==============================================================================
  146. 3. Configuration *matchit-configure*
  147. There are several variables that govern the behavior of matchit.vim. Note
  148. that these are variables local to the buffer, not options, so use |:let| to
  149. define them, not |:set|. Some of these variables have values that matter; for
  150. others, it only matters whether the variable has been defined. All of these
  151. can be defined in the |filetype-plugin| or autocommand that defines
  152. |b:match_words| or "on the fly."
  153. The main variable is |b:match_words|. It is described in the section below on
  154. supporting a new language.
  155. *MatchError* *matchit-hl* *matchit-highlight*
  156. MatchError is the highlight group for error messages from the script. By
  157. default, it is linked to WarningMsg. If you do not want to be bothered by
  158. error messages, you can define this to be something invisible. For example,
  159. if you use the GUI version of Vim and your command line is normally white, you
  160. can do >
  161. :hi MatchError guifg=white guibg=white
  162. <
  163. *b:match_ignorecase*
  164. If you >
  165. :let b:match_ignorecase = 1
  166. then matchit.vim acts as if 'ignorecase' is set: for example, "end" and "END"
  167. are equivalent. If you >
  168. :let b:match_ignorecase = 0
  169. then matchit.vim treats "end" and "END" differently. (There will be no
  170. b:match_infercase option unless someone requests it.)
  171. *b:match_debug*
  172. Define b:match_debug if you want debugging information to be saved. See
  173. |matchit-debug|, below.
  174. *b:match_skip*
  175. If b:match_skip is defined, it is passed as the skip argument to
  176. |searchpair()|. This controls when matching structures are skipped, or
  177. ignored. By default, they are ignored inside comments and strings, as
  178. determined by the |syntax| mechanism. (If syntax highlighting is turned off,
  179. nothing is skipped.) You can set b:match_skip to a string, which evaluates to
  180. a non-zero, numerical value if the match is to be skipped or zero if the match
  181. should not be skipped. In addition, the following special values are
  182. supported by matchit.vim:
  183. s:foo becomes (current syntax item) =~ foo
  184. S:foo becomes (current syntax item) !~ foo
  185. r:foo becomes (line before cursor) =~ foo
  186. R:foo becomes (line before cursor) !~ foo
  187. (The "s" is meant to suggest "syntax", and the "r" is meant to suggest
  188. "regular expression".)
  189. Examples:
  190. You can get the default behavior with >
  191. :let b:match_skip = 's:comment\|string'
  192. <
  193. If you want to skip matching structures unless they are at the start
  194. of the line (ignoring whitespace) then you can >
  195. :let b:match_skip = 'R:^\s*'
  196. < Do not do this if strings or comments can span several lines, since
  197. the normal syntax checking will not be done if you set b:match_skip.
  198. In LaTeX, since "%" is used as the comment character, you can >
  199. :let b:match_skip = 'r:%'
  200. < Unfortunately, this will skip anything after "\%", an escaped "%". To
  201. allow for this, and also "\\%" (an excaped backslash followed by the
  202. comment character) you can >
  203. :let b:match_skip = 'r:\(^\|[^\\]\)\(\\\\\)*%'
  204. <
  205. See the $VIMRUNTIME/ftplugin/vim.vim for an example that uses both
  206. syntax and a regular expression.
  207. ==============================================================================
  208. 4. Supporting a New Language *matchit-newlang*
  209. *b:match_words*
  210. In order for matchit.vim to support a new language, you must define a suitable
  211. pattern for |b:match_words|. You may also want to set some of the
  212. |matchit-configure| variables, as described above. If your language has a
  213. complicated syntax, or many keywords, you will need to know something about
  214. Vim's |regular-expression|s.
  215. The format for |b:match_words| is similar to that of the 'matchpairs' option:
  216. it is a comma (,)-separated list of groups; each group is a colon(:)-separated
  217. list of patterns (regular expressions). Commas and backslashes that are part
  218. of a pattern should be escaped with backslashes ('\:' and '\,'). It is OK to
  219. have only one group; the effect is undefined if a group has only one pattern.
  220. A simple example is >
  221. :let b:match_words = '\<if\>:\<endif\>,'
  222. \ . '\<while\>:\<continue\>:\<break\>:\<endwhile\>'
  223. (In Vim regular expressions, |\<| and |\>| denote word boundaries. Thus "if"
  224. matches the end of "endif" but "\<if\>" does not.) Then banging on the "%"
  225. key will bounce the cursor between "if" and the matching "endif"; and from
  226. "while" to any matching "continue" or "break", then to the matching "endwhile"
  227. and back to the "while". It is almost always easier to use |literal-string|s
  228. (single quotes) as above: '\<if\>' rather than "\\<if\\>" and so on.
  229. Exception: If the ":" character does not appear in b:match_words, then it is
  230. treated as an expression to be evaluated. For example, >
  231. :let b:match_words = 'GetMatchWords()'
  232. allows you to define a function. This can return a different string depending
  233. on the current syntax, for example.
  234. Once you have defined the appropriate value of |b:match_words|, you will
  235. probably want to have this set automatically each time you edit the
  236. appropriate file type. The recommended way to do this is by adding the
  237. definition to a |filetype-plugin| file.
  238. Tips: Be careful that your initial pattern does not match your final pattern.
  239. See the example above for the use of word-boundary expressions. It is usually
  240. better to use ".\{-}" (as many as necessary) instead of ".*" (as many as
  241. possible). See |\{-|. For example, in the string "<tag>label</tag>", "<.*>"
  242. matches the whole string whereas "<.\{-}>" and "<[^>]*>" match "<tag>" and
  243. "</tag>".
  244. *matchit-spaces* *matchit-s:notend*
  245. If "if" is to be paired with "end if" (Note the space!) then word boundaries
  246. are not enough. Instead, define a regular expression s:notend that will match
  247. anything but "end" and use it as follows: >
  248. :let s:notend = '\%(\<end\s\+\)\@<!'
  249. :let b:match_words = s:notend . '\<if\>:\<end\s\+if\>'
  250. < *matchit-s:sol*
  251. This is a simplified version of what is done for Ada. The s:notend is a
  252. |script-variable|. Similarly, you may want to define a start-of-line regular
  253. expression >
  254. :let s:sol = '\%(^\|;\)\s*'
  255. if keywords are only recognized after the start of a line or after a
  256. semicolon (;), with optional white space.
  257. *matchit-backref* *matchit-\1*
  258. In any group, the expressions |\1|, |\2|, ..., |\9| refer to parts of the
  259. INITIAL pattern enclosed in |\(|escaped parentheses|\)|. These are referred
  260. to as back references, or backrefs. For example, >
  261. :let b:match_words = '\<b\(o\+\)\>:\(h\)\1\>'
  262. means that "bo" pairs with "ho" and "boo" pairs with "hoo" and so on. Note
  263. that "\1" does not refer to the "\(h\)" in this example. If you have
  264. "\(nested \(parentheses\)\) then "\d" refers to the d-th "\(" and everything
  265. up to and including the matching "\)": in "\(nested\(parentheses\)\)", "\1"
  266. refers to everything and "\2" refers to "\(parentheses\)". If you use a
  267. variable such as |s:notend| or |s:sol| in the previous paragraph then remember
  268. to count any "\(" patterns in this variable. You do not have to count groups
  269. defined by |\%(\)|.
  270. It should be possible to resolve back references from any pattern in the
  271. group. For example, >
  272. :let b:match_words = '\(foo\)\(bar\):more\1:and\2:end\1\2'
  273. would not work because "\2" cannot be determined from "morefoo" and "\1"
  274. cannot be determined from "andbar". On the other hand, >
  275. :let b:match_words = '\(\(foo\)\(bar\)\):\3\2:end\1'
  276. should work (and have the same effect as "foobar:barfoo:endfoobar"), although
  277. this has not been thoroughly tested.
  278. You can use |zero-width| patterns such as |\@<=| and |\zs|. (The latter has
  279. not been thouroughly tested in matchit.vim.) For example, if the keyword "if"
  280. must occur at the start of the line, with optional white space, you might use
  281. the pattern "\(^\s*\)\@<=if" so that the cursor will end on the "i" instead of
  282. at the start of the line. For another example, if HTML had only one tag then
  283. one could >
  284. :let b:match_words = '<:>,<\@<=tag>:<\@<=/tag>'
  285. so that "%" can bounce between matching "<" and ">" pairs or (starting on
  286. "tag" or "/tag") between matching tags. Without the |\@<=|, the script would
  287. bounce from "tag" to the "<" in "</tag>", and another "%" would not take you
  288. back to where you started.
  289. DEBUGGING *matchit-debug* *:MatchDebug*
  290. If you are having trouble figuring out the appropriate definition of
  291. |b:match_words| then you can take advantage of the same information I use when
  292. debugging the script. This is especially true if you are not sure whether
  293. your patterns or my script are at fault! To make this more convenient, I have
  294. made the command :MatchDebug, which defines the variable |b:match_debug| and
  295. creates a Matchit menu. This menu makes it convenient to check the values of
  296. the variables described below. You will probably also want to read
  297. |matchit-details| above.
  298. Defining the variable |b:match_debug| causes the script to set the following
  299. variables, each time you hit the "%" key. Several of these are only defined
  300. if |b:match_words| includes |backref|s.
  301. *b:match_pat*
  302. The b:match_pat variable is set to |b:match_words| with |backref|s parsed.
  303. *b:match_match*
  304. The b:match_match variable is set to the bit of text that is recognized as a
  305. match.
  306. *b:match_col*
  307. The b:match_col variable is set to the cursor column of the start of the
  308. matching text.
  309. *b:match_wholeBR*
  310. The b:match_wholeBR variable is set to the comma-separated group of patterns
  311. that matches, with |backref|s unparsed.
  312. *b:match_iniBR*
  313. The b:match_iniBR variable is set to the first pattern in |b:match_wholeBR|.
  314. *b:match_ini*
  315. The b:match_ini variable is set to the first pattern in |b:match_wholeBR|,
  316. with |backref|s resolved from |b:match_match|.
  317. *b:match_tail*
  318. The b:match_tail variable is set to the remaining patterns in
  319. |b:match_wholeBR|, with |backref|s resolved from |b:match_match|.
  320. *b:match_word*
  321. The b:match_word variable is set to the pattern from |b:match_wholeBR| that
  322. matches |b:match_match|.
  323. *b:match_table*
  324. The back reference '\'.d refers to the same thing as '\'.b:match_table[d] in
  325. |b:match_word|.
  326. ==============================================================================
  327. 5. Known Bugs and Limitations *matchit-bugs*
  328. Just because I know about a bug does not mean that it is on my todo list. I
  329. try to respond to reports of bugs that cause real problems. If it does not
  330. cause serious problems, or if there is a work-around, a bug may sit there for
  331. a while. Moral: if a bug (known or not) bothers you, let me know.
  332. The various |:vmap|s defined in the script (%, |g%|, |[%|, |]%|, |a%|) may
  333. have undesired effects in Select mode |Select-mode-mapping|. At least, if you
  334. want to replace the selection with any character in "ag%[]" there will be a
  335. pause of |'updatetime'| first.
  336. It would be nice if "\0" were recognized as the entire pattern. That is, it
  337. would be nice if "foo:\end\0" had the same effect as "\(foo\):\end\1". I may
  338. try to implement this in a future version. (This is not so easy to arrange as
  339. you might think!)
  340. ==============================================================================
  341. vim:tw=78:fo=tcq2: