Intro To Pydra - A Distributed Computing Framework For Python

*
Proposal
Short form
osb2009-0206

Excerpt

Pydra provides an easy to use framework for writing, running and managing distributed computing jobs in python. Come hear an overview of Pydra project and how you can use it to deploy your own distributed computing cluster.

Description

Pydra is a distributed and parallel computing framework for Python being built by the Oregon State University Open Source Lab. Pydra is used by the Protein Geometry Database, a biochemistry research tool, and will be used by other projects developed at the OSL. The Pydra team is working hard towards an official release later this year.

Pydra will provide:

  • A Web interface for cluster management, monitoring and job scheduling
  • Strong security within the cluster
  • An easy to use, flexible job framework, including MapReduce support
  • Fault tolerance and automatic fail over.

This session will provide an overview of the Pydra project’s architecture, concepts, and roadmap. You’ll learn how to deploy and use Pydra within your own applications.

Tags

python, Distributed Computing

Speaker

  • Peter_0

    Peter Krenesky

    Open Source Lab

    Biography

    Peter is the Lead Software Engineer for the Open Source Lab. He has been professionally developing open source software for the OSL for over 5 years. His primary areas of expertise are object oriented design, data modeling, and software architecture. His language of choice is Python but he is also experienced in Java, C++ and numerous other languages.

    Peter’s current project’s include Protein Geometry Database a tool aiding biochemistry research, and Pydra a cluster computing solution for Python. Past projects have included software for One Laptop Per Child and Helix Media Player. In his spare time he hacks on Android applications.