Commit Graph

26 Commits

Author SHA1 Message Date
Steve Boyd e8997870c5 MNT Standardise modules 2022-08-01 16:23:36 +12:00
Steve Boyd e5bf4f1322 MNT Use GitHub Actions CI 2022-07-05 19:08:18 +12:00
Steve Boyd 795abde8f1
Update build status badge 2021-01-21 16:43:07 +13:00
Robbie Averill 300941c9e8 Update readme badges and requirements for SilverStripe 4 2018-07-03 10:47:56 +12:00
Dylan Wagstaff d06569c8fd Add supported module badge to readme 2018-06-15 17:50:30 +12:00
Robbie Averill 40ba6a245d
DOCS Fix build badges in readme 2017-11-23 09:52:40 +13:00
Robbie Averill 3d289b4e05 DOCS Add Windows note back into Configuration guide, bump license year 2017-11-23 09:49:05 +13:00
Robbie Averill f8c3015161 Merge pull request #19 from camfindlay/feature/make-supported 2017-11-23 09:39:34 +13:00
Jake Bentvelzen 75ffe7b56a fix(PDFTextExtractor): Added support for Windows, but only if 'binary_location' is defined. Updated documentation to inform the user of this. 2016-05-13 15:07:33 +10:00
Cam Findlay 7b3fb280c6 Add supported module standard docs 2015-11-07 14:06:23 +13:00
Damian Mooyman 6cf09f26c8 Merge pull request #9 from chillu/pulls/tika-logging
Improved Tika error logging
2015-05-12 15:27:08 +12:00
Ingo Schommer 8aca06aef2 Truncate FileContentCache by default to avoid SQL query errors
MySQL has a packet limit of 1MB as a default
(http://dev.mysql.com/doc/refman/5.0/en/packet-too-large.html).
This interferes with the UPDATE queries required
to add file content caches. Since the query can't be terminated
correctly, the whole content will be discarded with a query error.

This change allows to truncate content prior to the UPDATE operation,
and defaults to 500 characters. This leaves some room for multibyte
characters as well as other parts of the SQL query.
2015-05-07 19:14:02 +12:00
Ingo Schommer 72ce8fc0bc Improved Tika error logging 2015-05-07 12:06:59 +12:00
Damian Mooyman 98fd4228f9 Provide alternative backends for caching of extracted content
Implement Flushable for clearing the cache
2015-05-05 17:22:45 +12:00
Ingo Schommer 98a83a5bca Clarified Tika docs 2015-04-30 11:39:11 +12:00
Ingo Schommer 1224f0939d Improved Tika docs 2015-04-29 11:59:34 +12:00
Ingo Schommer 4400443163 Small spelling fixes 2015-02-26 23:11:31 +13:00
Damian Mooyman 1ad9e46727 API Support tika server 2015-02-25 17:55:41 +13:00
Damian Mooyman 2977f85cb5 API Implement Tika support
API Implement support for detection via mime-type as well as file extension
API Implement FileContent property for safe usage in templates
API instead of returning the list of extensions / mime types supported, support is determined on a per-file bases
Marking dev-master as version 2.0 as this contains breaking changes
2015-02-20 15:12:20 +13:00
Sam Minnee e56bdf5e27 Made readme example less specific 2014-02-18 10:28:02 +13:00
Ingo Schommer 49316d99ff Travis support 2013-05-07 21:49:32 +02:00
Ingo Schommer 24a055a741 More docs on how to use extraction with Solr 2013-05-07 20:14:01 +02:00
Ingo Schommer b86483abc4 3.1 compat 2013-05-07 18:47:56 +02:00
Ingo Schommer 9af389f51b NEW SolrCellTextExtractor 2013-02-01 15:35:16 +01:00
Ingo Schommer 847a4e0694 Updated README 2012-08-22 23:22:46 +02:00
Ingo Schommer ec0921c6d1 Initial commit 2012-08-22 17:52:08 +02:00