PyArrow and the future of data analytics
07-14, 10:30–11:15 (Europe/Dublin), Liffey Hall 1

In this talk we will introduce PyArrow and talk bout the transformation that the Arrow format is allowing in the Data Analytics world.

PyArrow provides an in-memory format, a disk format, a network exchange protocol, a dataframe library and a query engine all integrated in a single library. But the Arrow ecosystem doesn't stop there and allows you to work integrating multiple different technologies. It can be a swiss army knife for data engineers and it integrates zero cost with NumPy and Pandas in many cases.


Expected audience expertise: Domain

none

Expected audience expertise: Python

some

Abstract as a tweet

PyArrow can be a swiss army knife for data engineers, providing an in-memory format, a disk format, a network exchange protocol, a dataframe api and a query engine all integrated in a single library.

Relying on Python as his primary development language for more than 15 years, has always been interested in Python as a Development Platform.

He worked as CTO and team leader of Python teams for the past 10 years and is currently core developer of the TurboGears2 web framework and a contributor to the Apache Arrow project.

Alessandro is the author of Crafting Test-Driven Software with Python and Modern Python Standard Library Cookbook
and has authored many OpenSource Python projects like the DEPOT file storage framework and the DukPy JavaScript interpreter for Python.

Alessandro has been an active speaker to tens of European conferences since 2012