Commit Graph

86 Commits

Author SHA1 Message Date
Jake Dale Ovenden
eb7a45865b Allow username and password in requests to Tika server (#35) 2017-11-23 10:24:32 +13:00
Robbie Averill
40ba6a245d
DOCS Fix build badges in readme 2017-11-23 09:52:40 +13:00
Robbie Averill
3d289b4e05 DOCS Add Windows note back into Configuration guide, bump license year 2017-11-23 09:49:05 +13:00
Robbie Averill
f8c3015161 Merge pull request #19 from camfindlay/feature/make-supported 2017-11-23 09:39:34 +13:00
Damian Mooyman
23e255b5c6 Merge pull request #34 from jvdanker/disable-cache-fix
Don't try to save the object to the cache if it has been disabled
2017-02-22 15:30:59 +13:00
Juan van den Anker
0761311170 Don't try to save the object to the cache if it has been disabled 2017-02-22 15:17:32 +13:00
Damian Mooyman
1b89000fcd Merge pull request #33 from alwex/master
fixed the version comparison for tika server text extractor
2016-10-19 16:10:05 +13:00
Alexandre Guidet
196007314a fixed the version comparison using version_compare() instead of plain float 2016-10-19 15:46:30 +13:00
Damian Mooyman
545e711f16 Merge pull request #31 from dhensby/pulls/composer-alias
Bumping composer alias
2016-10-04 12:35:58 +13:00
Daniel Hensby
5d24770d79
Bumping composer alias 2016-10-04 00:17:39 +01:00
Damian Mooyman
5a5c648c1e Merge pull request #30 from dhensby/pulls/pdf-extraction
FIX PDFTextExtractor no longer smushes words together than break acro…
2016-10-04 12:13:09 +13:00
Daniel Hensby
e9e33605b4
FIX PDFTextExtractor no longer smushes words together than break across lines 2016-10-03 23:59:18 +01:00
Damian Mooyman
e0125ba745 Merge pull request #29 from dhensby/pulls/fix-text-extraction-config
FIX UnexpectedValueException thrown when trying to set SolrCellTextEx…
2016-10-04 10:02:58 +13:00
Daniel Hensby
aaf9238384
FIX UnexpectedValueException thrown when trying to set SolrCellTextExtraction.base_url in config 2016-10-03 20:19:30 +01:00
Daniel Hensby
61750e33fc Merge pull request #28 from SilbinaryWolf/fix-windowscompat
fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' config is defined
2016-05-14 12:14:31 +01:00
Jake Bentvelzen
75ffe7b56a fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' is defined. Updated documentation to inform the user of this. 2016-05-13 15:07:33 +10:00
Hamish Friedlander
bde4cf4536 Merge pull request #27 from tractorcow/pulls/pdfpaths
API Whitelist bin paths for pdftotext
2016-02-25 16:45:56 +13:00
Damian Mooyman
f72ba3a978 API Whitelist bin paths for pdftotext 2016-02-25 16:40:25 +13:00
Damian Mooyman
9e44e834cf Merge pull request #26 from helpfulrobot/update-license-year
Updated license year
2016-01-05 11:14:29 +13:00
helpfulrobot
0420d56e4d Updated license year 2016-01-01 06:50:40 +13:00
Daniel Hensby
5a070eb47d Merge pull request #25 from helpfulrobot/add-standard-code-of-conduct
Added standard code of conduct
2015-11-21 12:29:45 +00:00
helpfulrobot
7c45684dbb Added standard code of conduct 2015-11-21 20:17:44 +13:00
Daniel Hensby
cdea0f0798 Merge pull request #23 from helpfulrobot/add-standard-license
Added standard license
2015-11-19 12:55:38 +00:00
Daniel Hensby
dcd527deb1 Merge pull request #24 from helpfulrobot/add-standard-git-attributes
Added standard git attributes
2015-11-19 10:39:37 +00:00
helpfulrobot
08cc7c37da Added standard git attributes 2015-11-19 19:14:04 +13:00
helpfulrobot
df3af6722b Added standard license 2015-11-19 18:32:42 +13:00
Damian Mooyman
1d2a9bc296 Merge pull request #22 from helpfulrobot/add-standard-editor-config
Added standard editor config
2015-11-19 14:05:19 +13:00
helpfulrobot
80a4773cce Added standard editor config 2015-11-19 13:27:10 +13:00
Daniel Hensby
ebfa07dc5f Merge pull request #21 from helpfulrobot/convert-to-psr-2
Converted to PSR-2
2015-11-18 23:30:07 +00:00
Daniel Hensby
9cb2a79f8d Merge pull request #20 from helpfulrobot/add-standard-scrutinizer-config
Added standard Scrutinizer config
2015-11-18 12:42:33 +00:00
helpfulrobot
8e14595f1a Converted to PSR-2 2015-11-18 17:07:31 +13:00
helpfulrobot
03de223162 Added standard Scrutinizer config 2015-11-18 15:38:01 +13:00
Daniel Hensby
80f61a21be Merge pull request #18 from assertchris/add-scrutinizer-support
Added Scrutinizer support
2015-11-08 23:57:42 +00:00
Cam Findlay
7b3fb280c6 Add supported module standard docs 2015-11-07 14:06:23 +13:00
Christopher Pitt
4c955bde13 Added Scrutinizer support 2015-11-07 11:22:33 +13:00
Damian Mooyman
1e8581d7f8 Merge pull request #17 from dhensby/patch-1
Move to new travis containerised infrastructure
2015-08-26 12:32:30 +12:00
Daniel Hensby
e67fb97672 Move to new travis containerised infrastructure 2015-08-25 15:28:20 +01:00
Damian Mooyman
832437e4bf Merge pull request #15 from kinglozzer/patch-2
FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14)
2015-07-02 12:00:48 +12:00
Loz Calver
9ea4b79543 FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14) 2015-06-08 12:42:31 +01:00
Damian Mooyman
9b36af2791 Merge pull request #13 from assertchris/use-symfony-mime-type-detection
Using Symfony mime type detection
2015-05-14 11:33:04 +12:00
Christopher Pitt
fbc31692e7 Using Symfony mime type detection 2015-05-13 21:36:05 +12:00
Damian Mooyman
fd917f04a1 Merge pull request #12 from chillu/pulls/check-file-existence
Check file existence in for_file()
2015-05-12 16:45:35 +12:00
Ingo Schommer
da6c554acb Check file existence in for_file()
finfo() will silently fail the whole request (at least on my PHP 5.4 install)
if invoked on a file that doesn't exist, so fail early here.
2015-05-12 16:45:03 +12:00
Ingo Schommer
15f9647bca Merge pull request #11 from tractorcow/pulls/invalidate
API Only invalidate cache when file is changed
2015-05-12 16:08:45 +12:00
Damian Mooyman
c9d74f83db API Only invalidate cache when file is changed 2015-05-12 16:01:38 +12:00
Damian Mooyman
6cf09f26c8 Merge pull request #9 from chillu/pulls/tika-logging
Improved Tika error logging
2015-05-12 15:27:08 +12:00
Damian Mooyman
6c7ffa2c6f Merge pull request #10 from chillu/pulls/truncate-db-cache
Truncate FileContentCache by default to avoid SQL query errors
2015-05-12 15:25:59 +12:00
Damian Mooyman
1f4083dda4 BUG Fix incorrect cache key generation 2015-05-12 15:23:14 +12:00
Ingo Schommer
8aca06aef2 Truncate FileContentCache by default to avoid SQL query errors
MySQL has a packet limit of 1MB as a default
(http://dev.mysql.com/doc/refman/5.0/en/packet-too-large.html).
This interferes with the UPDATE queries required
to add file content caches. Since the query can't be terminated
correctly, the whole content will be discarded with a query error.

This change allows to truncate content prior to the UPDATE operation,
and defaults to 500 characters. This leaves some room for multibyte
characters as well as other parts of the SQL query.
2015-05-07 19:14:02 +12:00
Ingo Schommer
72ce8fc0bc Improved Tika error logging 2015-05-07 12:06:59 +12:00