-
Notifications
You must be signed in to change notification settings - Fork 214
Expand file tree
/
Copy pathREADME
More file actions
executable file
·28 lines (22 loc) · 990 Bytes
/
README
File metadata and controls
executable file
·28 lines (22 loc) · 990 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
==
__ ___ __
____/ /___ ______________ / (_) /_
/ __ / __ \/ ___/ ___/ __ \/ / / __/
/ /_/ / /_/ / /__(__ ) /_/ / / / /_
\____/\____/\___/____/ .___/_/_/\__/
/_/
Docsplit is a command-line utility and Ruby library for splitting apart
documents into their component parts: searchable UTF-8 plain text, page
images or thumbnails in any format, PDFs, single pages, and document
metadata (title, author, number of pages...)
Installation:
gem install docsplit
Added the options:
pdf_opts: which can be used to passed the pdftotext binary file options to docsplit gem
For Example:
Passing raw options to pdftotext,
Docsplit.extract_text(path, {:pdf_opts => '-raw'})
For documentation, usage, and examples, see:
http://documentcloud.github.com/docsplit/
To suggest a feature or report a bug:
http://github.com/documentcloud/docsplit/issues/