Commit Graph

9 Commits

Author SHA1 Message Date
Daniel Hensby
e9e33605b4
FIX PDFTextExtractor no longer smushes words together than break across lines 2016-10-03 23:59:18 +01:00
Jake Bentvelzen
75ffe7b56a fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' is defined. Updated documentation to inform the user of this. 2016-05-13 15:07:33 +10:00
Damian Mooyman
f72ba3a978 API Whitelist bin paths for pdftotext 2016-02-25 16:40:25 +13:00
helpfulrobot
8e14595f1a Converted to PSR-2 2015-11-18 17:07:31 +13:00
Damian Mooyman
2977f85cb5 API Implement Tika support
API Implement support for detection via mime-type as well as file extension
API Implement FileContent property for safe usage in templates
API instead of returning the list of extensions / mime types supported, support is determined on a per-file bases
Marking dev-master as version 2.0 as this contains breaking changes
2015-02-20 15:12:20 +13:00
Ingo Schommer
733644d6bb Better shell execution feedback from PDF extractor 2012-08-27 11:31:53 +02:00
Ingo Schommer
f3fcf60c0f FileTextExtractor->isAvailable() 2012-08-22 18:25:55 +02:00
Ingo Schommer
977c4e49c9 API Using paths instead of File objects in extractors
Makes coupling to File objects optional, by choosing
to use the FileTextExtractable extension.
2012-08-22 18:25:12 +02:00
Ingo Schommer
ec0921c6d1 Initial commit 2012-08-22 17:52:08 +02:00