Commit Graph

187 Commits

Author SHA1 Message Date
Jake Dale Ovenden eb7a45865b Allow username and password in requests to Tika server (#35) 2017-11-23 10:24:32 +13:00
Robbie Averill 40ba6a245d
DOCS Fix build badges in readme 2017-11-23 09:52:40 +13:00
Robbie Averill 3d289b4e05 DOCS Add Windows note back into Configuration guide, bump license year 2017-11-23 09:49:05 +13:00
Robbie Averill f8c3015161 Merge pull request #19 from camfindlay/feature/make-supported 2017-11-23 09:39:34 +13:00
Damian Mooyman 23e255b5c6 Merge pull request #34 from jvdanker/disable-cache-fix
Don't try to save the object to the cache if it has been disabled
2017-02-22 15:30:59 +13:00
Juan van den Anker 0761311170 Don't try to save the object to the cache if it has been disabled 2017-02-22 15:17:32 +13:00
Damian Mooyman 1b89000fcd Merge pull request #33 from alwex/master
fixed the version comparison for tika server text extractor
2016-10-19 16:10:05 +13:00
Alexandre Guidet 196007314a fixed the version comparison using version_compare() instead of plain float 2016-10-19 15:46:30 +13:00
Damian Mooyman 545e711f16 Merge pull request #31 from dhensby/pulls/composer-alias
Bumping composer alias
2016-10-04 12:35:58 +13:00
Daniel Hensby 5d24770d79
Bumping composer alias 2016-10-04 00:17:39 +01:00
Damian Mooyman 5a5c648c1e Merge pull request #30 from dhensby/pulls/pdf-extraction
FIX PDFTextExtractor no longer smushes words together than break acro…
2016-10-04 12:13:09 +13:00
Daniel Hensby e9e33605b4
FIX PDFTextExtractor no longer smushes words together than break across lines 2016-10-03 23:59:18 +01:00
Damian Mooyman e0125ba745 Merge pull request #29 from dhensby/pulls/fix-text-extraction-config
FIX UnexpectedValueException thrown when trying to set SolrCellTextEx…
2016-10-04 10:02:58 +13:00
Daniel Hensby aaf9238384
FIX UnexpectedValueException thrown when trying to set SolrCellTextExtraction.base_url in config 2016-10-03 20:19:30 +01:00
Daniel Hensby 61750e33fc Merge pull request #28 from SilbinaryWolf/fix-windowscompat
fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' config is defined
2016-05-14 12:14:31 +01:00
Jake Bentvelzen 75ffe7b56a fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' is defined. Updated documentation to inform the user of this. 2016-05-13 15:07:33 +10:00
Hamish Friedlander bde4cf4536 Merge pull request #27 from tractorcow/pulls/pdfpaths
API Whitelist bin paths for pdftotext
2016-02-25 16:45:56 +13:00
Damian Mooyman f72ba3a978 API Whitelist bin paths for pdftotext 2016-02-25 16:40:25 +13:00
Damian Mooyman 9e44e834cf Merge pull request #26 from helpfulrobot/update-license-year
Updated license year
2016-01-05 11:14:29 +13:00
helpfulrobot 0420d56e4d Updated license year 2016-01-01 06:50:40 +13:00
Daniel Hensby 5a070eb47d Merge pull request #25 from helpfulrobot/add-standard-code-of-conduct
Added standard code of conduct
2015-11-21 12:29:45 +00:00
helpfulrobot 7c45684dbb Added standard code of conduct 2015-11-21 20:17:44 +13:00
Daniel Hensby cdea0f0798 Merge pull request #23 from helpfulrobot/add-standard-license
Added standard license
2015-11-19 12:55:38 +00:00
Daniel Hensby dcd527deb1 Merge pull request #24 from helpfulrobot/add-standard-git-attributes
Added standard git attributes
2015-11-19 10:39:37 +00:00
helpfulrobot 08cc7c37da Added standard git attributes 2015-11-19 19:14:04 +13:00
helpfulrobot df3af6722b Added standard license 2015-11-19 18:32:42 +13:00
Damian Mooyman 1d2a9bc296 Merge pull request #22 from helpfulrobot/add-standard-editor-config
Added standard editor config
2015-11-19 14:05:19 +13:00
helpfulrobot 80a4773cce Added standard editor config 2015-11-19 13:27:10 +13:00
Daniel Hensby ebfa07dc5f Merge pull request #21 from helpfulrobot/convert-to-psr-2
Converted to PSR-2
2015-11-18 23:30:07 +00:00
Daniel Hensby 9cb2a79f8d Merge pull request #20 from helpfulrobot/add-standard-scrutinizer-config
Added standard Scrutinizer config
2015-11-18 12:42:33 +00:00
helpfulrobot 8e14595f1a Converted to PSR-2 2015-11-18 17:07:31 +13:00
helpfulrobot 03de223162 Added standard Scrutinizer config 2015-11-18 15:38:01 +13:00
Daniel Hensby 80f61a21be Merge pull request #18 from assertchris/add-scrutinizer-support
Added Scrutinizer support
2015-11-08 23:57:42 +00:00
Cam Findlay 7b3fb280c6 Add supported module standard docs 2015-11-07 14:06:23 +13:00
Christopher Pitt 4c955bde13 Added Scrutinizer support 2015-11-07 11:22:33 +13:00
Damian Mooyman 1e8581d7f8 Merge pull request #17 from dhensby/patch-1
Move to new travis containerised infrastructure
2015-08-26 12:32:30 +12:00
Daniel Hensby e67fb97672 Move to new travis containerised infrastructure 2015-08-25 15:28:20 +01:00
Damian Mooyman 832437e4bf Merge pull request #15 from kinglozzer/patch-2
FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14)
2015-07-02 12:00:48 +12:00
Loz Calver 9ea4b79543 FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14) 2015-06-08 12:42:31 +01:00
Damian Mooyman 9b36af2791 Merge pull request #13 from assertchris/use-symfony-mime-type-detection
Using Symfony mime type detection
2015-05-14 11:33:04 +12:00
Christopher Pitt fbc31692e7 Using Symfony mime type detection 2015-05-13 21:36:05 +12:00
Damian Mooyman fd917f04a1 Merge pull request #12 from chillu/pulls/check-file-existence
Check file existence in for_file()
2015-05-12 16:45:35 +12:00
Ingo Schommer da6c554acb Check file existence in for_file()
finfo() will silently fail the whole request (at least on my PHP 5.4 install)
if invoked on a file that doesn't exist, so fail early here.
2015-05-12 16:45:03 +12:00
Ingo Schommer 15f9647bca Merge pull request #11 from tractorcow/pulls/invalidate
API Only invalidate cache when file is changed
2015-05-12 16:08:45 +12:00
Damian Mooyman c9d74f83db API Only invalidate cache when file is changed 2015-05-12 16:01:38 +12:00
Damian Mooyman 6cf09f26c8 Merge pull request #9 from chillu/pulls/tika-logging
Improved Tika error logging
2015-05-12 15:27:08 +12:00
Damian Mooyman 6c7ffa2c6f Merge pull request #10 from chillu/pulls/truncate-db-cache
Truncate FileContentCache by default to avoid SQL query errors
2015-05-12 15:25:59 +12:00
Damian Mooyman 1f4083dda4 BUG Fix incorrect cache key generation 2015-05-12 15:23:14 +12:00
Ingo Schommer 8aca06aef2 Truncate FileContentCache by default to avoid SQL query errors
MySQL has a packet limit of 1MB as a default
(http://dev.mysql.com/doc/refman/5.0/en/packet-too-large.html).
This interferes with the UPDATE queries required
to add file content caches. Since the query can't be terminated
correctly, the whole content will be discarded with a query error.

This change allows to truncate content prior to the UPDATE operation,
and defaults to 500 characters. This leaves some room for multibyte
characters as well as other parts of the SQL query.
2015-05-07 19:14:02 +12:00
Ingo Schommer 72ce8fc0bc Improved Tika error logging 2015-05-07 12:06:59 +12:00