by Marco Nenciarini for EuroPython 2011
How Python was used to implement a tool for the daily backup of an over 50 terabyte distributed data warehouse based on Greenplum Database technology.
The talk covers our experiences, from the initial customer requirements to the first alpha release of the tool, describing the main issues we faced and the Python modules we adopted to address them, while using agile methods.
Though the application is not currently available as an open-source project, the talk gives valuable insights into the development of distributed and parallel applications using Python.