Anton Masalovich

Machine learning scientist and engineering manager with 15+ years of experience mostly focused on Natural Language Processing, Machine Translation and Document Recognition.
Ex-ABBYY, ex-Microsoft, currently working in a machine learning R&D department of a large healthcare organization. Have a lot of experience of transforming state-of-the-art ML ideas to actual product features.


Sessions

07-15
11:55
30min
Machine Translation engines evaluation framework
Anton Masalovich, Sahil Manchanda

As an engineers in a ML R&D department of large healthcare enterprise company we were presented with the task to evaluate several Machine Translation engines and choose the one best suited for our corporate needs. To do that we created extendable Python-based framework that allowed us to easily plug-in different Machine Translation engines and compare them across large variety of test datasets with a unified set of quality metrics. Our goal from the start was to create universal MT evaluation framework, that will be useful not only for healthcare domain, but to a wider community as well.

At this talk we will present our evaluation framework an will do a walk-through of its capabilities. We also cover how it can be extended to new MT engines, new test datasets and new language pairs. We will also present our evaluation results for several state-of-the-art machine translation engines, both open-source and cloud-based.

All the source code of our framework is published to open source:
https://github.com/Optum/nmt

PyData: Deep Learning, NLP, CV
Liffey Hall 1