|
Filename Masks |
Top Previous Next |
|
As duplicate file system scanning progresses, each file name encountered is compared against the current project's filename masks. If the mask does not match the name of the current file, that file is not included in the duplicate comparison process.
Filename masks can be defined using one of two syntaxes: basic wildcard patterns or more advanced regular expressions. Use the "Use regular expressions" switch in the filename masks section of the search filtering panel to switch between these two modes.
Wildcard Patterns
When operating in wildcard pattern mode, filename masks are comprised of patterns of characters, with multiple sets of masks separated by a semicolon. Wildcard characters include '?' and '*', which match either one instance or multiple instances of any character (respectively). Any other (non-wildcard) character matches itself.
Further, any filename mask can be preceded by a tilde character ( '~' ), which specifies that the mask is exclusionary. If a filename matches an exclusionary mask, the file will always be skipped.
Wildcard Pattern Examples
Here are a few examples of wildcard filename masks:
The filename masks section of the Search Filtering docking pane also includes a Presets button, which you can click for easy access to a range of built-in masks. These can help you to get started quickly. Presets can also be customized to suit your needs.
Important Note: When using wildcards in filename masks, keep in mind that a mask of '*.*' is subtly different than just '*'. The former requires that a dot (.) be included in the filename, while the latter does not. In other words, the *.* pattern will not match filenames that don't have an extension - if you wish to match these, use just a single asterisk (*) instead.
Regular Expressions
As noted above, filename masks can also be defined using powerful regular expression syntax. Regular expressions are formulas that can be used to match strings of text that follow some pattern. They allow their users to succinctly express a set of character matching rules that would otherwise require a large number of switches and logical operations.
This help file will not provide an in-depth tutorial on the formation of regular expressions, simply because a large number of such tutorials are freely available on the web (visit your favorite search engine and enter "regular expressions" into the search box to find them).
Please keep in mind, however, that there are subtle differences between the regular expression syntax engines that various applications employ. The charts below provide an overview of the regular expression metacharacters and abbreviations supported by Duplicate File Detective.
Abbreviations
\a Any alphanumeric character: ([a-zA-Z0-9]) \b White space (blank): ([ \\t]) \c Any alphabetic character: ([a-zA-Z]) \d Any decimal digit: ([0-9]) \h Any hexadecimal digit: ([0-9a-fA-F]) \n Newline: (\r|(\r?\n)) \q A quoted string: (\"[^\"]*\")|(\'[^\']*\') \w A simple word: ([a-zA-Z]+) \z An integer: ([0-9]+)
|