Correlating messy data with "correlate"
07-14, 16:05–16:35 (Europe/Dublin), Liffey B

An introduction to the correlate Python library. You tell correlate about two datasets that should map to each other, and it determines the best matches for you. The novel scoring algorithm at the heart of correlate means it copes exceedingly well with messy real-world data. correlate supports fuzzy matching, weighted matching, and ordering.


Data correlation! What could be more computer science-y! Ever needed to find matching items between two sets of data? Maybe even messy real-world data, with inexact string matches? Come find out how the novel scoring algorithm and clever heuristics at the heart of correlate solve this problem with ease!


Expected audience expertise: Domain

none

Expected audience expertise: Python

some

Abstract as a tweet

"correlate" finds good matches between two sets of messy real-world data! How? Come find out!

Larry is a 200 foot assault robot manufactured by Yoyodyne Propulsion Systems, a major US defense contractor. He is suitable for heavy assault against heavily armored stationary targets, like laying siege to a walled city, or protecting supply lines during forward maneuvers.