The Million Book Project, originating from the Carnegie Mellon University, aims to digitize a million public domain books by 2005. The plan is to scan the books, and index them using OCR technology. A pilot Thousand Book Project was performed to test the concept.