Overview
Advanced Object (AO) Field Analyzer scans a specified field and summarizes the most common word values for that field. This macro is useful for finding common "noise" words in Company or Firm fields that might cause over-matching. It accepts a single stream as input and produces one output. You may also optionally define K and S inputs, which are fields containing one or more patterns describing textual components to Keep together or Split apart. See the repository sample project AO Field Analyzer Sample for a demonstration of the macro in action.
AO Field Analyzer configuration parameters
AO Field Analyzer has a single set of configuration parameters in addition to the standard execution options.
Input
|
Parameter |
Description |
|---|---|
|
Data for analysis |
(Required) Field containing values to be scanned and summarized.
|
Noise characters
|
Parameter |
Description |
|---|---|
|
Symbols |
(Optional) If selected, the following non-alphanumeric characters are used to delineate words:
|
|
Special characters |
(Optional) If selected, control characters (tab, newline, carriage return, form feed) are used to delineate words.
|
|
Custom characters |
(Optional) User-defined characters used to delineate words.
|
Optional parsing tables
|
Parameter |
Description |
|---|---|
|
Keep together values |
(Optional) Field containing one or more patterns (regular expressions) that you want to keep whole, even if they contain noise characters. Examples: 3.5 MM, 12 1/2 FT.
|
|
Split patterns |
(Optional) Field containing one or more patterns (regular expressions) describing textual components you want separated into two or more words.
|
Configure AO Field Analyzer
-
Select AO Field Analyzer.
-
Go to the Configuration tab on the Properties pane.
-
Select Data for analysis and choose the field to analyze.
-
Optionally, specify Noise characters to delineate words:
-
(Enabled by default) Select Symbols to use the following characters to delineate words: ~!@#$%^& *()_+`=[];:",<>./?\ plus the space character.
-
(Enabled by default) Select Special characters to use ASCII control characters (tab, newline, carriage return, form feed) to delineate words.
-
Specify Custom characters to use characters other than those listed as Symbols and Special characters to delineate words.
-
-
Optionally, you may define a table containing one or more regular expressions describing textual components to keep together. Connect this to the K input, and then select the field containing the Keep together values.
-
Optionally, you may define a a table containing one or more patterns describing textual components to split apart. Connect this to the S input, and then select the field containing the Split values.
-
Optionally, go to the Execution tab, and then set Web service options.