Commit Graph

74 Commits

Author SHA1 Message Date
Daniel Hensby
e9e33605b4
FIX PDFTextExtractor no longer smushes words together than break across lines 2016-10-03 23:59:18 +01:00
Damian Mooyman
e0125ba745 Merge pull request #29 from dhensby/pulls/fix-text-extraction-config
FIX UnexpectedValueException thrown when trying to set SolrCellTextEx…
2016-10-04 10:02:58 +13:00
Daniel Hensby
aaf9238384
FIX UnexpectedValueException thrown when trying to set SolrCellTextExtraction.base_url in config 2016-10-03 20:19:30 +01:00
Daniel Hensby
61750e33fc Merge pull request #28 from SilbinaryWolf/fix-windowscompat
fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' config is defined
2016-05-14 12:14:31 +01:00
Jake Bentvelzen
75ffe7b56a fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' is defined. Updated documentation to inform the user of this. 2016-05-13 15:07:33 +10:00
Hamish Friedlander
bde4cf4536 Merge pull request #27 from tractorcow/pulls/pdfpaths
API Whitelist bin paths for pdftotext
2016-02-25 16:45:56 +13:00
Damian Mooyman
f72ba3a978 API Whitelist bin paths for pdftotext 2016-02-25 16:40:25 +13:00
Damian Mooyman
9e44e834cf Merge pull request #26 from helpfulrobot/update-license-year
Updated license year
2016-01-05 11:14:29 +13:00
helpfulrobot
0420d56e4d Updated license year 2016-01-01 06:50:40 +13:00
Daniel Hensby
5a070eb47d Merge pull request #25 from helpfulrobot/add-standard-code-of-conduct
Added standard code of conduct
2015-11-21 12:29:45 +00:00
helpfulrobot
7c45684dbb Added standard code of conduct 2015-11-21 20:17:44 +13:00
Daniel Hensby
cdea0f0798 Merge pull request #23 from helpfulrobot/add-standard-license
Added standard license
2015-11-19 12:55:38 +00:00
Daniel Hensby
dcd527deb1 Merge pull request #24 from helpfulrobot/add-standard-git-attributes
Added standard git attributes
2015-11-19 10:39:37 +00:00
helpfulrobot
08cc7c37da Added standard git attributes 2015-11-19 19:14:04 +13:00
helpfulrobot
df3af6722b Added standard license 2015-11-19 18:32:42 +13:00
Damian Mooyman
1d2a9bc296 Merge pull request #22 from helpfulrobot/add-standard-editor-config
Added standard editor config
2015-11-19 14:05:19 +13:00
helpfulrobot
80a4773cce Added standard editor config 2015-11-19 13:27:10 +13:00
Daniel Hensby
ebfa07dc5f Merge pull request #21 from helpfulrobot/convert-to-psr-2
Converted to PSR-2
2015-11-18 23:30:07 +00:00
Daniel Hensby
9cb2a79f8d Merge pull request #20 from helpfulrobot/add-standard-scrutinizer-config
Added standard Scrutinizer config
2015-11-18 12:42:33 +00:00
helpfulrobot
8e14595f1a Converted to PSR-2 2015-11-18 17:07:31 +13:00
helpfulrobot
03de223162 Added standard Scrutinizer config 2015-11-18 15:38:01 +13:00
Daniel Hensby
80f61a21be Merge pull request #18 from assertchris/add-scrutinizer-support
Added Scrutinizer support
2015-11-08 23:57:42 +00:00
Christopher Pitt
4c955bde13 Added Scrutinizer support 2015-11-07 11:22:33 +13:00
Damian Mooyman
1e8581d7f8 Merge pull request #17 from dhensby/patch-1
Move to new travis containerised infrastructure
2015-08-26 12:32:30 +12:00
Daniel Hensby
e67fb97672 Move to new travis containerised infrastructure 2015-08-25 15:28:20 +01:00
Damian Mooyman
832437e4bf Merge pull request #15 from kinglozzer/patch-2
FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14)
2015-07-02 12:00:48 +12:00
Loz Calver
9ea4b79543 FIX: SolrCellTextExtractor always reporting itself as unavailable (fixes #14) 2015-06-08 12:42:31 +01:00
Damian Mooyman
9b36af2791 Merge pull request #13 from assertchris/use-symfony-mime-type-detection
Using Symfony mime type detection
2015-05-14 11:33:04 +12:00
Christopher Pitt
fbc31692e7 Using Symfony mime type detection 2015-05-13 21:36:05 +12:00
Damian Mooyman
fd917f04a1 Merge pull request #12 from chillu/pulls/check-file-existence
Check file existence in for_file()
2015-05-12 16:45:35 +12:00
Ingo Schommer
da6c554acb Check file existence in for_file()
finfo() will silently fail the whole request (at least on my PHP 5.4 install)
if invoked on a file that doesn't exist, so fail early here.
2015-05-12 16:45:03 +12:00
Ingo Schommer
15f9647bca Merge pull request #11 from tractorcow/pulls/invalidate
API Only invalidate cache when file is changed
2015-05-12 16:08:45 +12:00
Damian Mooyman
c9d74f83db API Only invalidate cache when file is changed 2015-05-12 16:01:38 +12:00
Damian Mooyman
6cf09f26c8 Merge pull request #9 from chillu/pulls/tika-logging
Improved Tika error logging
2015-05-12 15:27:08 +12:00
Damian Mooyman
6c7ffa2c6f Merge pull request #10 from chillu/pulls/truncate-db-cache
Truncate FileContentCache by default to avoid SQL query errors
2015-05-12 15:25:59 +12:00
Damian Mooyman
1f4083dda4 BUG Fix incorrect cache key generation 2015-05-12 15:23:14 +12:00
Ingo Schommer
8aca06aef2 Truncate FileContentCache by default to avoid SQL query errors
MySQL has a packet limit of 1MB as a default
(http://dev.mysql.com/doc/refman/5.0/en/packet-too-large.html).
This interferes with the UPDATE queries required
to add file content caches. Since the query can't be terminated
correctly, the whole content will be discarded with a query error.

This change allows to truncate content prior to the UPDATE operation,
and defaults to 500 characters. This leaves some room for multibyte
characters as well as other parts of the SQL query.
2015-05-07 19:14:02 +12:00
Ingo Schommer
72ce8fc0bc Improved Tika error logging 2015-05-07 12:06:59 +12:00
Christopher Pitt
adb71a7823 Merge pull request #8 from tractorcow/pulls/lock-dispatcher
Specify known-working version of stable dependency for php 5.3.3
2015-05-06 13:52:51 +12:00
Damian Mooyman
3ffb303a0b Specify known-working version of stable dependency for php 5.3.3 2015-05-06 13:47:17 +12:00
Ingo Schommer
62637c6197 Merge pull request #7 from tractorcow/pulls/2.0/cache-options
Provide alternative backends for caching of extracted content
2015-05-05 18:21:08 +12:00
Damian Mooyman
98fd4228f9 Provide alternative backends for caching of extracted content
Implement Flushable for clearing the cache
2015-05-05 17:22:45 +12:00
Ingo Schommer
98a83a5bca Clarified Tika docs 2015-04-30 11:39:11 +12:00
Ingo Schommer
1224f0939d Improved Tika docs 2015-04-29 11:59:34 +12:00
Damian Mooyman
fb70c1dd50 Merge pull request #6 from assertchris/php-5-3-compat
Downgraded Guzzle version
2015-03-05 14:11:11 +13:00
Christopher Pitt
b7488577ad Downgraded Guzzle version 2015-03-05 13:57:31 +13:00
Ingo Schommer
4400443163 Small spelling fixes 2015-02-26 23:11:31 +13:00
Ingo Schommer
c813d234f0 Merge pull request #5 from tractorcow/pulls/tika-support
API Support tika server
2015-02-26 22:50:36 +13:00
Damian Mooyman
1ad9e46727 API Support tika server 2015-02-25 17:55:41 +13:00
Ingo Schommer
23d83b7d01 Merge pull request #4 from tractorcow/pulls/tika-support
API Implement Tika support
2015-02-23 10:52:34 +13:00