Vol. 54 No. 4 (2006)
Research Article

A Case Study in Name Matching

Published 2006-12-01

Abstract

Abstract

We examined variants of a well-documented habitational surname of English origin and evaluated the performance of several common name search algorithms to determine their efficiency in identifying equivalences of many known variants of the original surname. The surname had over fifty known variants. A new algorithm to include habitational information was developed as part of the analysis of search algorithms. The new algorithm had improved performance on the data set, achieving 92% success in name matching when the best two matches were used. The new algorithm is easily automated and holds promise as a search technique for much larger data sets.

References

  1. Blueshoes Corporation. 2005. www.blueshoes.org/en/plugins/onomastics/example_compare_name_pair/
  2. Daitch, Randy. 1986. “Jewish soundex—A revised format”. Avotanu 1:19–26.
  3. Galbi, Douglas A. 2002. “Long Term Trends in the Frequencies of Given Names” Names 50: 275–288.
  4. Gatty, Alfred. 1847. Hallamshire: The History and Topography of the Parish of Sheffield of the County of YorkshireiSecond Edition. Sheffield, England: Pawson and Brailsford.
  5. Hanks, Patrick, and D. Kenneth Tucker 2000. “A Diagnostic Database of American Personal Names” Names 48:59–69.
  6. Jones, Joan and Mel Jones. 2003. Whitely Hall: an Illustrated History. Rotherham, England: Green Tree Publications.
  7. Mokotoff, Gary. 1985. “Proposal for a Jewish soundex code”. Avotanu 1:5–10.
  8. Mokotoff, Gary. 1997. “Soundexing and Genealogy, http://www.avotaynu.com/soundex.html.
  9. Morse, Stephen. 2005. stevenmorse. org/census/soundex.html
  10. Shurtleff, Benjamin, ‘Descendants of William Shurtleff, 2 vols, 1912.
  11. Shurtleff, Roy L., Descendants of William Shurtleff: 1976 rev. ed. San Francisco. R. L. Shurtleff, 1976.
  12. Swart, E. R. 1989. “A Computer Simulation of the Ineradicable Uncertainty in Genealogical Research”. Family History, 11:118.
  13. Tucker, D. K. 2002. “Distribution of Forenames, Surnames, and Forename-Surname Pairs” Names 50:105–132.
  14. Tucker, Patrick, ed. 2003. Dictionary of American Family Names, Oxford: Oxford University Press.