Parse and Permutate Explained

A common problem with data we receive is the content often varies in standardization. Data feeds are sourced from other systems or could be the result of someone “keying” the data into your database. All scenarios result in inconsistent data.  QDS offers the ability to “clean” data before any search actions are performed.

QDS uses a two-step strategy to clean and generate name variations: Parse and Permutate.

The ability to parse data ensures the highest level of accuracy. Along with parsing data, name variations can be generated to ensure all possible searches are performed, resulting in the highest level of due diligence and compliance. The goal is to find additional hits, identify active SCRA, MLA or Bankruptcy when it’s absolutely critical and avoid the need to rescind a Forclosure Sale or some other legal action.

Name Parsing

Parsing ensures all name parts are accurate and any additional subjects are extracted from the original subject. If a name provided is mostly normalized but contains a suffix within the last name, parsing will detect. If the name is 100% standardized the name parsing will bypass the name. Parsing ensures that provided subject names are stored and represented as discrete data elements.

Ideally, orders adhere to this format:

  • Prefix: Mr.
  • First Name: John
  • Middle Name: A
  • Last Name: Doe
  • Suffix: Jr

However, in practice we see:

  • First Name: Mr John
  • Middle Name: A
  • Last Name: Doe Jr aka John Smith aka Johnathan Allen Smith

Or even:

  • John A Doe Jr aka Jane Smith fka Janice Smith aka Shiela Smith (Deceased) aka Shelia Ann Smith aka Estate of Mike Smith

Parsing the latter example generates six parsed contacts called “aliases”:

Alias 1Alias 2Alias 3Alias 4Alias 5Alias 6
John A Doe JrJane SmithJanice SmithShiela SmithSheila Ann SmithMike Smith

Parsing offers the following features:

FeatureDescription
Known Word RemovalRemoves known string patterns from last name such as Estate, AKA, deceased, etc. Additional words can be added to removal list.
Prefix RemovalRemoves known names prefixes from first name.
Space NormalizationReduces multiple spacing between name parts to a single space. Removes any leading and trailing spaces next to hyphens.
Word ScrubbingRemoves and/or substitutes all non-word characters from names including numbers and any type of whitespace with the exception of Non Removed Characters.
Segment ParsingSplits last name based on presence of Segment Delimiter and creates name aliases. Used when multiple full names are passed in last name. For example, last name Smith AKA John A. Doe Jr will generate two name segments: 1) last name Smith 2) John A. Doe Jr where the left most name is the primary segment and the subsequent names are aliases. Segments are evaluated to ensure that no duplicate alias are produced by redundant segments.

Upon completion, the name is either ready to be searched or permutated.

Name Permutations

Name Permutations offers the ability to generate name variations on a given subject. If a contact needs to be cleaned, it will be parsed before reaching any of the designated permutation features. This allows multiple searches to be performed when a name contains a multi-part name or suffixes. The ability to leverage name permutations along with name parsing ensures the highest level of Accuracy and Compliance.

Permutation offers the following features:

FeatureDescriptionOptionExample
TransposeTransposes name parts to generate all unique variations. Transposing occurs on any number of name parts provided.True | FalseOriginal Name: Smith Doe
Generates: Doe Smith
Merge NamesMerges compound name parts into a single name. Merging occurs on any number of name parts provided.True | FalseOriginal Name: Smith Doe
Generates: SmithDoe
Force HyphenGenerates additional name variations only on two part last names delimited by a spaces replacing the space with a hyphen.True | FalseOriginal Name: Smith Doe
Generates: Smith-Doe
Search OriginalAllows original name to be searched even in case of name containing known words along with any name variations generated.True | FalseOriginal Name: (Deceased) Jones

(Deceased) would be flagged as a known word and the original name would be disqualified.
Toggle Middle NameAbility to generate name variations with/without middle name when middle name is provided. Middle name toggle generates variations after transposing of name parts.True | FalseOriginal Name: John A Doe
Generates:
John Doe
John A Doe
Toggle Primary NameAbility to generate name variations with/without first and middle names.True | FalseOriginal Name: John A Doe
Generates: John A Doe
Doe
Middle Name ParseWill apply middle name and/or initial to alias or permutated names.
Option turned off.None
Inserts new contact for every unique middle name. Always uses the original alias middle name (even if initial). Shares all middle names across all original aliases and primaryShareOriginal Name: John Bailey Doe AKA James Michael Doe
Generates: John Bailey Doe
James Michael Doe
John Michael Doe
James Bailey Does
If a middle name is a middle initial and it matches first letter of alias contact’s middle name, then the full middle name is shared with the original contact.Share MergeOriginal Name: John B Doe AKA James Bailey Doe
Generates:John B Doe
John Bailey Doe
James Bailey Doe
Inserts new contact with middle initial when middle name is provided.Append Middle InitialOriginal Name: John Bailey Doe AKA James Michael Doe
Generates: John Bailey Doe
John B Doe
James Michael Doe
James M Doe
Inserts new contact with middle initial on ALL permutations/aliases when middle name is provided.Append Middle Initial AllOriginal Name: John Bailey Doe AKA James Michael Doe
Generates:
John Bailey Doe
John B Doe
John M Doe
James Michael Doe
James M Doe
James B Doe
Include SuffixAbility to generate name variations using name suffix at given levels:
Omits name suffix from any searchesNoneOriginal Name: John  Doe, Jr.
Generates: John Doe
Search with and without and without name suffix for the original namesOriginal OnlyOriginal Last Name: John Doe Jr AKA James Smith Jr
Generates:
John Doe
John Doe Jr
James Smith
James Smith Jr

± There are no compound names.
All name variations will be searched with and without suffix when a compound nameAllOriginal Last Name: John Doe Jr a.k.a. James A Smith Doe Jr
Generates:
James Smith Doe Jr
James Smith Doe
James Smith
James Smith Jr
James Doe
James Doe Jr

± John Doe searches are the same
±± Depending on permutation model each name will be generated with and without suffix
Mixed Case ParseSplits single names with mixed case names into compound name partsTrue | FalseOriginal Last Name: LaVerne
Generates: La
Verne
Middle Name ParseWill apply middle name and/or initial to alias or permutated names.
Option turned off.None
Inserts new contact for every unique middle name. Always uses the original alias middle name (even if initial). Shares all middle names across all original aliases and primaryShareOriginal Name: John Bailey Doe AKA James Michael DoeGenerates:
John Bailey Doe
James Michael Doe
John Michael Doe
James Bailey Does
If a middle name is a middle initial and it matches first letter of alias contact’s middle name, then the full middle name is shared with the original contact.Share MergeOriginal Name: John B Doe AKA James Bailey Doe
Generates:
John B Doe
John Bailey Doe
James Bailey Doe
Inserts new contact with middle initial when middle name is provided.Append Middle InitialOriginal Name: John Bailey Doe AKA James Michael Doe
Generates:
John Bailey Doe
John B Doe
James Michael Doe
James M Doe
Inserts new contact with middle initial on ALL permutations/aliases when middle name is provided.Append Middle Initial AllOriginal Name: John Bailey Doe AKA James Michael Doe
Generates:
John Bailey Doe
John B Doe
John M Doe
James Michael Doe
James M Doe
James B Doe
Suppress Hyphen ParseAllows for granular control over handling of hyphens in last names:
Defaults to standard hyphen-parse behaviorNeverOriginal Name: John Doe-Smith
Generates:
John Doe-Smith
John Doe Smith
John Smith
John Doe
Always suppress hyphen-parse behaviorAlwaysOriginal Name: John Doe-Smith
Generates: John Doe-Smith
Suppress hyphen parsing only when there is a hyphen in a multi-part last name.MultipleOnlyOriginal Name: John Doe-Smith Bailey

Generates: John Doe-Smith Bailey
John Doe-Smith
John Bailey

± John Doe-Smith would be parsed using default hyphen parsing

± Permutation options can be used in combination or interchangeably.

Quandis can also implement custom Parsing and Permutation functionality to match your specific needs.

Use Cases

Transpose:
NameTranspose OptionContacts Searched
John Bailey Doe SmithFalseJohn Bailey Doe
John Bailey Smith
TrueJohn Bailey Doe
John Bailey Smith
John Bailey Smith Doe
Merge Names:
NameMerge Names OptionContacts Searched
Sarah O ConnorFalseSarah Connor
Sarah O
TrueSarah OConnor
Sarah Connor
Sarah O
Force Hyphen:
NameForce Hyphen OptionContacts Searched
John Doe SmithFalseJohn Doe Smith
John Smith
John Doe
TrueJohn Doe Smith
John Smith
John Doe
John Doe-Smith
Search Original:
NameSearch Original OptionContacts Searched
John A Doe (DECEASED)FalseJohn A Doe
TrueJohn A Doe (DECEASED)
John A Doe
Toggle Middle Name:
NameToggle Middle Name OptionContacts Searched
John Bailey DoeFalseJohn Bailey Doe
TrueJohn Bailey Doe
John Doe
Middle Name Parse:
NameMiddle Name Parse OptionContacts Searched
John Bailey DoeNoneJohn Bailey Doe
John Bailey Doe AKA Joe Michael DoeShareJohn Bailey Doe
John Michael Doe
Joe Michael Doe
Joe Bailey Doe
John B Doe AKA Joe Bailey DoeShare MergeJohn B Doe
John Bailey Doe
Joe Bailey Doe

± Joe B Doe is not created.
John Bailey Doe AKA Joe Michael DoeAppend Middle InitialJohn Bailey Doe
John B Doe
Joe Michael Doe
Joe M Doe
John Bailey Doe AKA Joe Michael DoeAppend Middle Initial AllJohn Bailey Doe
John B Doe
Joe Michael Doe
Joe M Doe
John Michael Doe
John M Doe
Joe Bailey Doe
Joe B Doe
Include Suffix:
NameInclude Suffix OptionContacts Searched
John Doe Sr.NoneJohn Doe
John Doe Sr. AKA John SmithOriginal OnlyJohn Doe
John Doe Sr.
John Smith
John Doe Sr. AKA John SmithAllJohn Doe Sr.
John Doe
John Smith Sr.
John Smith
Mixed Case Parse:
NameMixed Case Parse OptionContacts Searched
John DeCostaFalseJohn DeCosta
TrueJohn De Costa
John Costa
John De

Combining Features

Permutation features can be used in combination, allowing for the highest level of due diligence in your searches.

NameActive OptionsContacts Searched
John A Doe AKA Joe Doe AKA John Andrew DoeMiddle Name Parse: Share
Toggle Middle Name: True
John A Doe
John Doe
Joe Doe
Joe A Doe
Joe Andrew Doe
John Andrew Doe
John Doe SmithTranspose: True
Force Hyphen: True
Search Original: True
John Doe Smith
John Smith Doe
John Smith
John Doe
John Doe-Smith
John Smith-Doe