Skip to content
goossaert edited this page Sep 14, 2010 · 21 revisions

What is Prince?

Prince is an extra-light API that enables Hadoop to use Python methods. As the goal is to keep the API as small and light as possible, the API only contains a few methods! I developed this API as I was trying to code some algorithms using the MapReduce paradigm and more complex than the usual word count example. Other solutions for Hadoop/Python already exist, but they are either not up to date with the latest Hadoop versions, or they are not flexible enough for complex map/reduce combinations. So I had no choice but to code my own API. The reason why I want it to remain as small and flexible as possible is to be able to switch all my algorithms to some new good and complete framework as soon as it will be made available. Examples coded with Prince include so far: word count, total count, merge sort and Dijkstra’s single source shortest path.

Prince is now version 0.1, and has been tested with Hadoop 0.20.1 and Python 2.6.4

The Little Prince