Skip to content
This repository has been archived by the owner on Apr 25, 2023. It is now read-only.
/ docsplit Public archive
forked from documentcloud/docsplit

Break Apart Documents into Images, Text, Pages and PDFs

License

Notifications You must be signed in to change notification settings

LegalSifter/docsplit

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

==
         __                      ___ __ 
    ____/ /___  ______________  / (_) /_
   / __  / __ \/ ___/ ___/ __ \/ / / __/
  / /_/ / /_/ / /__(__  ) /_/ / / / /_  
  \____/\____/\___/____/ .___/_/_/\__/  
                      /_/
                      
  Docsplit is a command-line utility and Ruby library for splitting apart
  documents into their component parts: searchable UTF-8 plain text, page 
  images or thumbnails in any format, PDFs, single pages, and document 
  metadata (title, author, number of pages...)
  
  Installation:
  gem install docsplit
  
  For documentation, usage, and examples, see:
  http://documentcloud.github.com/docsplit/
  
  To suggest a feature or report a bug: 
  http://github.com/documentcloud/docsplit/issues/

About

Break Apart Documents into Images, Text, Pages and PDFs

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Ruby 100.0%