← Explore
TOPIC

#extraction

Open source repositories tagged with #extraction, ranked by health score.

apache
apache/tika
Java
90
health

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

3.8k