Skip to content

Commit c589c80

Browse files
janhoyepugh
andauthored
SOLR-18037 Remove "local" tika extraction backend from branch_9x (#3980)
Co-authored-by: Eric Pugh <epugh@opensourceconnections.com>
1 parent a4b810e commit c589c80

File tree

211 files changed

+397
-10259
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

211 files changed

+397
-10259
lines changed

NOTICE.txt

Lines changed: 10 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -99,6 +99,10 @@ This project includes the Malihu Custom Scrollbar Plugin
9999
Copyright (c) Manos Malihutsakis, http://manos.malihu.gr/
100100
License: MIT https://github.com/malihu/malihu-custom-scrollbar-plugin/blob/master/LICENSE.txt
101101

102+
This project includes encryption software "bouncy castle".
103+
Copyright (c) 2000-2006 The Legion Of The Bouncy Castle
104+
(http://www.bouncycastle.org)
105+
102106
=========================================================================
103107
== Antlr2 Notice ==
104108
=========================================================================
@@ -403,39 +407,18 @@ OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
403407
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
404408

405409
=========================================================================
406-
== Apache Tika Notices ==
410+
== Extraction Module Notices ==
407411
=========================================================================
408412

409413
The following notices apply to modules/extraction:
410414

411-
This product includes software developed by the following copyright owners:
412-
413-
Copyright (c) 2000-2006 The Legion Of The Bouncy Castle
414-
(http://www.bouncycastle.org)
415-
416-
Copyright (c) 2003-2005, www.pdfbox.org
417-
418-
Copyright (c) 2003-2005, www.fontbox.org
419-
420-
Copyright (c) 1995-2005 International Business Machines Corporation and others
421-
422-
Copyright 2001-2005 (C) MetaStuff, Ltd. All Rights Reserved.
423-
424-
Copyright 2004 Sun Microsystems, Inc. (Rome JAR)
425-
426-
Copyright 2002-2008 by John Cowan (TagSoup -- http://ccil.org/~cowan/XML/tagsoup/)
427-
428-
Copyright (C) 1994-2007 by the Xiph.org Foundation, http://www.xiph.org/ (OggVorbis)
429-
430-
Copyright 2012 Kohei Taketa juniversalchardet (http://code.google.com/p/juniversalchardet/)
431-
432-
Lasse Collin and others, XZ for Java (http://tukaani.org/xz/java.html)
415+
This product includes Apache Tika Core.
433416

434-
java-libpst is a pure java library for the reading of Outlook PST and OST files.
435-
https://github.com/rjohnsondev/java-libpst
417+
Apache Tika
418+
Copyright 2007-2024 The Apache Software Foundation
436419

437-
JMatIO is a JAVA library to read/write/manipulate with Matlab binary MAT-files.
438-
http://www.sourceforge.net/projects/jmatio
420+
Apache POI
421+
Copyright 2003-2025 The Apache Software Foundation
439422

440423
=========================================================================
441424
== Language Detection Notices ==
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
title: Removed LocalTikaExtractionBackend from the extraction module (SolrCell). Extraction
2+
using a remote Tika Server is now the only and default option. Tika-core is upgraded
3+
to v3.2.3 and still used for some SAX parsing
4+
type: removed
5+
authors:
6+
- name: Jan Høydahl
7+
links:
8+
- name: SOLR-18037
9+
url: https://issues.apache.org/jira/browse/SOLR-18037

solr/licenses/SparseBitSet-1.2.jar.sha1

Lines changed: 0 additions & 1 deletion
This file was deleted.
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
533eac055afe3d5f614ea95e333afd6c2bde8f26

solr/licenses/apache-mime4j-core-0.8.4.jar.sha1

Lines changed: 0 additions & 1 deletion
This file was deleted.

solr/licenses/apache-mime4j-core-LICENSE-ASL.txt

Lines changed: 0 additions & 201 deletions
This file was deleted.

solr/licenses/apache-mime4j-core-NOTICE.txt

Lines changed: 0 additions & 13 deletions
This file was deleted.

solr/licenses/apache-mime4j-dom-0.8.4.jar.sha1

Lines changed: 0 additions & 1 deletion
This file was deleted.

0 commit comments

Comments
 (0)