AO Field Analyzer
Advanced Object (AO) Field Analyzer scans a specified field and summarizes the most common word values for that field. This macro is useful for finding common "noise" words in Company or Firm fields that might cause over-matching. It accepts a single stream as input and produces one output. You may also optionally define K and S inputs, which are fields containing one or more patterns describing textual components to Keep together or Split apart. See the repository sample project AO Field Analyzer Sample for a demonstration of the macro in action.
AO Field Analyzer configuration parameters
AO Field Analyzer has a single set of configuration parameters in addition to the standard execution options:
Input
Data for analysis | Required. Field containing values to be scanned and summarized. Default: None. |
Noise characters
Symbols | Optional. If selected, the following non-alphanumeric characters are used to delineate words: |
Special characters | Optional. If selected, control characters (tab, newline, carriage return, form feed) are used to delineate words. Default: Yes. |
Custom characters | Optional. User-defined characters used to delineate words. Default: None. |
Optional parsing tables
Keep together values | Optional. Field containing one or more patterns (regular expressions) that you want to keep whole, even if they contain noise characters. Examples: 3.5 MM, 12 1/2 FT. Default: None. |
Split patterns | Optional. Field containing one or more patterns (regular expressions) describing textual components you want separated into two or more words. Default: None. |
Configure AO Field Analyzer
Select AO Field Analyzer, and then go to the Configuration tab on the Properties pane.
Select Data for analysis and select the field to analyze.
Optionally, specify Noise characters to delineate words:
Select Symbols to use the following characters to delineate words: ~!@#$%^& *()_+`=[];:",<>./?\ plus the space character. Enabled by default.
Select Special characters to use ASCII control characters (tab, newline, carriage return, form feed) to delineate words. Enabled by default.
Specify Custom characters to use characters other than those listed as Symbols and Special characters to delineate words.
Optionally, you may define a table containing one or more regular expressions describing textual components to keep together. Connect this to the K input, and then select the field containing the Keep together values.
Optionally, you may define a a table containing one or more patterns describing textual components to split apart. Connect this to the S input, and then select the field containing the Split values.
Optionally, go to the Execution tab, and then set Web service options.