PDFBox - References

PDFBox References

This page lists projects that utilize PDFBox and articles that have been written about PDFBox. Send me an e-mail if your article or project is missing.

Projects


Project Name License Project Description
Centric CRM Free To Use But Restricted/Commercial The Most Advanced Open Source CRM Software.
Canoo Webtest BSD Like Free OpenSource tool for XP-style acceptance testing of Java-based Web applications.
Jomic GPL Jomic is a viewer for comic book archives.
LIUS GPL LIUS is an indexing Java framework based on the Jakarta Lucene project. The LIUS framework adds to Lucene many files format indexing fonctionalities as: Ms World, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office suite and JavaBeans.
LuceGene Artistic License LuceGene is an open-source document/object search and retrieval system specially tuned for bioinformatics text databases and documents.
Nutch ASL Nutch is open source web-search software. It builds on Lucene, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.
OpenCms Custom OpenCms is a professional level Open Source Website Content Management System.
PodReader GPL PodReader is an application that facilitates making electronic documents like eBooks readable on your iPod.
SearchBlox Commercial SearchBlox is a high-performance corporate search software designed for the Java 2 Enterprise Edition (J2EE) platform.
Terrier MPL Terrier is software for the rapid development of Web, intranet and desktop search engines.

Articles/Books


Article Name Article Abstract
Build an eDoc Reader for your iPod
Part 1 - User Interface
Part 2 - Document Reading Engine
Part 3 - *Integration with PDFBox*
A three part article that discusses the implementation of the PodReader application. PodReader is Cocoa application written in Objective-C and article discusses how to use the Cocoa-Java bridge to integrate with the Java version of PDFBox.
Lucene In Action A book that discusses integrating with the lucene search engine. One chapter discusses how to index various file formats and highlights PDFBox for indexing PDF documents.
Java Developers Journal - March 2005 An article written by the lead developer of PDFBox discussing text extraction and AcroForm integration using PDFBox functionality.