Easy metadata and document extraction with Apache Tika

If you need a simple tool to extract text and metadata from web or documents (most types) you should check out java based Tika from Apache.