AO Business Name-Only Match
Overview
Advanced Object (AO) Business Name-Only Match identifies matches by using the business name only without address components (for example, looking for all franchisees within a state or nationwide). In this process, the object will parse Business Names into discrete components and look for matching values across the components. AO Business Name-Only Match affords a high degree of control over the matching process, with numerous parameters that can be adjusted to improve match quality. The default settings in the configuration dialog represent best business practices and provide a very good starting point.
AO Business Name-Only Match accepts a single stream as input and produces a single output with the Firm Group value added.
If the input stream contains multiple data sources, at least one of the sources must have an internal dedupe flag set to "Y".
AO Business Name-Only Match configuration parameters
AO Business Name-Only Match has four sets of configuration parameters in addition to the standard execution options: Input, Match, Options, and Table.
AO Business Name-Only Match: Input tab
Match fields
Parameter | Description |
---|---|
Business name | (Required) Company name used for matching.
|
Business name 2 | (Optional) Alternate company name that can also be used for cross-field matching (for example, Lotus & IBM). Allows for different companies to be matched to either field.
|
Business keyword | (Optional) Firm keyword that is matched using a separate score. This should be used in conjunction with a separate parsing step that separates the significant parts of a business name from "common" parts. For example, suppose that you have the following business names: FIRST NATIONAL BANK OF MIAMI 1ST NATL BANK MIAMI FIRST NATIONAL BANK OF ORLANDO BANK OF MIAMI The key words here are “MIAMI” and “ORLANDO”. Using upstream regular expressions and the text parsing tools, you can create a separate KEYWORD field that contains the distinguishing parts of a business name, and specify this field as the Business keyword. If you specify this field, also configure the Business keyword matching score on the Match tab.
|
Unique record ID
Parameter | Description |
---|---|
Record ID | (Optional) Field containing the unique record ID.
|
AO Business Name-Only Match: Match tab
Business matching
Parameter | Description |
---|---|
Business score | Match threshold for business name field after any optional business adjustments (described below) are taking into account.
|
Match nicknames | Allows for personal names in a firm to be standardized. For example, in Liz Smith Enterprises versus Elizabeth Smith Enterprises, Liz and Elizabeth would be treated as identical.
|
Match initials | If selected, enables Initials match option.
|
Initials | Recover match points if Initial matches a name (for example, J Robin Smith Inc versus Jonathon Robert Smith, Inc).
|
Match abbreviations | If selected, enables Abbreviations match option.
|
Abbreviations | Recover match points due to an abbreviation identified by pattern rather than known value (for example, Halbert Construction Contractors versus Hlbrt Construction Contractors).
|
Match acronyms | If selected, enables Acronyms match option.
|
Acronyms | Assign acronym matches a specific score for their part of the match (for example, International Machine Parts versus IMP).
|
Missing words | Recover match points due to missing or disjoint words (for example, Halston Construction Contractors versus Halston Contractors).
|
Strip vowels if > N bytes | If selected, allows for fewer comparisons on large firm words by eliminating the vowels and comparing only the vowels within the word.
|
Business keyword matching
Parameter | Description |
---|---|
Keyword score | Match threshold for business keyword field after optional business Keyword adjustments (described below) are taken into account.
|
Match blank keyword | Specifies method for matching blank business keyword input field. Options are:
The default is Blank vs. Blank Only. |
Match abbreviations | Allows for variations in the business keyword field (for example, MISS vs. MISSISSIPPI to be considered an exact match as a case of abbreviation).
|
Segmentation
Parameter | Description |
---|---|
Business letters | Allows for segmentation of data based on the first X characters of the business name. This option allows for better performance; however it may reduce the number of potential matches.
|
Max segment size | Maximum number of records compared in a single comparison group. The default setting allows for nearly-unlimited segment size. If you want to limit segment size to avoid runaway computation, potentially at the expense of missing a few record matches, set this to a lower value like 1000. Typically you can reduce this value unless you are matching within a very large segment like STATE.
|
Custom segment | (Optional) If segmentation other than above is desired, select the field containing the segment key.
|
AO Business Name-Only Match: Options tab
Reporting
Parameter | Description |
---|---|
Output match score | If selected, outputs the overall score from the match records as a percentage between 1—100.
|
Source control
Parameter | Description |
---|---|
Source | Field containing the logical description for input data source. This is usually defined in AO Define Source.
|
Internal dedupe flag | Field containing a Y/N flag indicating whether or not data from a particular source should be compared against itself (deduped) or solely against other sources. As a general rule, master databases are not internally deduped whereas update files are.
|
Compare sources not internally deduped | If a matching process has more than one source with the internal dedupe field set to "N", selecting this will compare the two sources.
|
Parallel processing
Parameter | Description |
---|---|
Parallelism level | Set to the lesser of the number of CPU cores on the Execution Server, or the number of threads configured in the project in which the macro is embedded.
|
Optimize for large segments | If you receive warnings like "Window Compare segment size for value (06828EAS3135) has exceeded 2000," enable this option. Comparing large candidate groups may reduce matching efficiency. Selecting this option increases the number of records that can be sent to a matching process without slowing processing.
|
AO Business Name-Only Match: Table tab
Parameter | Description | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Business alias table | Path and name of optional DLD table used to augment or override the alias values defined within the macro. The table must be of the form below.
Where
| ||||||||||||
Business noise table | Path and name of optional DLD table used to add additional "noise" words for Firm matching. The table must be a single-column DLD table of the form below.
Where
|
Configure AO Business Name-Only Match
Select AO Business Name-Only Match.
Go to the Input tab on the Properties pane.
Select the input fields.
To specify alternate input fields, you may optionally go to the Input2 tab.
Go to the Match tab to edit matching options.
Go to the Options tab to configure reporting, source control, and parallel processing options.
Optionally, go to the Table tab to specify alias and noise reference tables.
Optionally, go to the Execution tab, and then set Web service options.