Talk 36:00

And Yet Akin: Name Disambiguation in Elixir

Synonymity and homonymity make name disambiguation difficult. To ease this difficulty, I combined two unmaintained Elixir string comparison libraries and added preprocessing and a double metaphone algorithm. The result is a comprehensive map of scores for pattern identification and machine learning. This talk will address the pre-processing, algorithms, and scoring as well as the strengths and limitations. A live demonstration of scoring will allow us to identify patterns. We end with a discussion of how to gain further benefits from the scores.

OBJECTIVES:

To introduce the problem of name disambiguation and string comparison by looking at two existing string comparison libraries before addressing the process of combining them into a single repository. I hope attendees will leave understanding the problem as well as the strengths, limitations, and possibilities of the new library and how it can be used to address the challenges of name disambiguation.

AUDIENCE:

Beginner to intermediate programmers.