US 7,599,921 B2
System and method for improved name matching using regularized name forms
David Edward Biesenbach, Alexandria, Va. (US); Richard Theodore Gillam, Chantilly, Va. (US); Frankie Elizabeth Patman Maguire, Washington, D.C. (US); Leonard Arthur Shaefer, Jr., Leesburg, Va. (US); and Charles Kinston Williams, Fairfax, Va. (US)
Assigned to International Business Machines Corporation, Armonk, N.Y. (US)
Filed on Mar. 02, 2007, as Appl. No. 11/681,333.
Prior Publication US 2008/0215562 A1, Sep. 04, 2008
Int. Cl. G06F 17/30 (2006.01)
U.S. Cl. 707—3  [704/7; 704/8] 17 Claims
OG exemplary drawing
 
1. A computer-implemented method comprising:
retrieving, by a processor, a candidate name from memory;
identifying, by the processor, a cultural classification that corresponds to the candidate name;
retrieving, by the processor, one or more culture-specific regularization rules from the memory corresponding to the cultural classification;
applying, by the processor, one or more of the culture-specific regularization rules to the candidate name, resulting in a regularized candidate name, wherein the applying further comprises:
determining that a first regularization rule included in the one or more culture-specific regularization rules applies to the candidate name;
generating a first iteration regularized candidate name by applying the first regularized rule to the candidate name;
determining that a second regularization rule included in the one or more culture-specific regularization rules applies to the candidate name; and
generating the regularized candidate name by applying the second regularized rule to the first iteration regularized candidate name;
storing the regularized candidate name in the memory;
comparing, by the processor, the regularized candidate name with a regularized query name;
determining, by the processor, that the comparison meets a regularization matching threshold, which indicates a potential match between the regularized candidate name and the regularized query name; and
in response to determining that comparison meets the regularization matching threshold, providing the candidate name to the user.