21 Commits

Author SHA1 Message Date
Steve Boyd
1a0cd6d6a6 API Stop using deprecated API 2022-11-28 19:20:01 +13:00
Steve Boyd
df8b17ab85 ENH PHP 8.1 compatibility 2022-04-13 13:51:04 +12:00
Guy Sartorelli
d03a9f06e2
DOCS Fix incorrect PHPDoc about what null lifetime means. 2022-03-09 16:03:03 +13:00
Russell Michell
42cc545414 FIX: Fixes #58 We always want $content (an array) passed to implode() 2019-12-16 10:06:55 +13:00
Charlie Bergthaler
242e5a307d FIX Change check for cleanup of temp files only if file is instance of File. 2019-06-26 15:18:31 +12:00
Charlie Bergthaler
a9270d73ad FIX Cleanup temporary file after extracting content in TikaServerTextExtractor and TikaTextExtractor 2019-06-26 15:18:31 +12:00
Martin Hipp
bff5eb2b79
Return version number as string instead of floats so '1.20' does not become 1.2 2019-04-05 09:56:45 +13:00
Robbie Averill
759d92ccb4 FIX Ensure Tika responses are casted as strings, fixes broken unit tests
They can be returned as a stream, but the TikaRestClient response is documented as a string
2019-02-13 11:42:51 +07:00
Ishan Jayamanne
21ed6e0f86 Update isAvailable check to work for identical versions
Tika server reports it's version as "Apache Tika 1.7". Unfortunately, `version_compare` in PHP says that version "1.7" is less than version "1.7.0", meaning that Tika server was incorrectly being ruled out unless you used Tika server version 1.8 (where "1.8" > "1.7.0").

Changing the comparison string to just "1.7" means they match exactly, and therefore `version_compare` will return `0` rather than `-1`.
2019-02-13 11:15:54 +07:00
Robbie Averill
e1e7cdbfa4 FIX Update SolrCellTextExtractor to use a Guzzle 6 API implementation 2018-07-06 16:11:59 +12:00
Robbie Averill
231a2091af FIX Update Guzzle implementations in Tika extractors 2018-07-06 16:11:59 +12:00
Robbie Averill
1b8ea2e451 Remove unused symfony dependencies and FileTextExtractor::get_mime 2018-07-04 16:23:22 +12:00
Robbie Averill
9e8ed243d0 Seperate Tika tests, group them for phpunit, further reduce log level, make Extractors injectable 2018-07-03 17:15:18 +12:00
Robbie Averill
397e7a5d40 API FileTextExtractor::getContent now supports a File and a filename path string 2018-07-03 17:03:47 +12:00
Robbie Averill
5e5a1f05da FIX Reduce log level to prevent it being caught in SilverStripe error handler
See https://github.com/silverstripe/silverstripe-framework/issues/8044 for context
2018-07-03 16:40:40 +12:00
Robbie Averill
6bf932e5f0 FIX unlink call checks that a file exists first, and tests pass a File object 2018-07-03 16:30:05 +12:00
Robbie Averill
edb02e9189 API FileTextExtractable::getContent now takes a File instance instead of a path 2018-07-03 15:55:02 +12:00
Robbie Averill
8bd019b2aa Update codebase to ensure relative PSR-2 compliance 2018-07-03 11:37:38 +12:00
Robbie Averill
66c9db8c0d API Update namespaces for FileTextCache and add upgrader mapping 2018-07-03 11:23:27 +12:00
Russell Michell
912c457c7d FIX: Updated namespace refs for GuzzleHttp (from Guzzle\Http) 2017-12-22 14:34:40 +13:00
Russell Michell
f341010d7a FIX: First-pass SS4 compatibility.
- Added namespaces, use statements
- Added missing docblocks etc
- Uses SS4's new Cache system
- Uses proper environment vars
- Cannot instantiate 'FileTextCache' (interface) as a service. This can be configured through YML, so default to FileTextCache_Cache
- Modded YML config to make it run.
- Fixes to allow TIKA to actually get file contents.
- Addresses issues raised by @robbieaverill
- Rebased against github.com/silverstripe/silverstripe-textextraction:master
- Replaced `SS_Log` with Monolog.
2017-12-21 10:41:06 +13:00