Skip to content
This repository was archived by the owner on Mar 13, 2023. It is now read-only.

Commit d03722f

Browse files
committed
Merge branch 'release-staging/1.6.0' into release
2 parents 43628ea + 68e4aaf commit d03722f

File tree

570 files changed

+24741
-17495
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

570 files changed

+24741
-17495
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,4 @@ target/
99
*.orig
1010
./distribution/src/main/resources/logs/biomedicus.log
1111
distribution/src/main/resources/logs/biomedicus.log
12+
~$*

Pipeline.pptx

46.9 KB
Binary file not shown.

README.md

Lines changed: 8 additions & 193 deletions
Original file line numberDiff line numberDiff line change
@@ -10,198 +10,18 @@ This is a collaborative project that aims to serve biomedical and clinical resea
1010
with different texts.
1111

1212

13-
Install Guide
13+
Wiki
1414
========================================================================================================================
1515

16-
Along with the source code, we also release distribution of all the files required to immediately start running. You can
17-
download this file and appropriate model files at [http://athena.ahc.umn.edu/biomedicus-downloads/](http://athena.ahc.umn.edu/biomedicus-downloads/).
16+
[Our wiki on GitHub](https://github.com/nlpie/biomedicus/wiki) contains information about installation, configuration,
17+
use, and development of BioMedICUS.
1818

19-
Prerequisites
20-
------------------------------------------------------------------------------------------------------------------------
21-
22-
The following 3rd-party tools need to be installed to download, compile and run BioMedICUS:
23-
24-
1. Java SE Development Kit 8: [download](http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html)
25-
26-
2. Git: [download](http://git-scm.com) \(optional, only required for checking out the project source code\)
27-
28-
3. Maven: [download](http://maven.apache.org) \(optional, only required for building the project from source\)
29-
30-
Environment Variables
31-
------------------------------------------------------------------------------------------------------------------------
32-
33-
The following environment variables need to be set:
34-
35-
1. JAVA_HOME:
36-
37-
* On OS X, after installing Java SE Development Kit, you can find the path to the installation by typing this
38-
command:
39-
40-
/usr/libexec/java_home
41-
42-
* Open the .bash_profile file in your favorite text editor and add the following statement:
43-
44-
export JAVA_HOME="path-to-java-home-from-previous-step"
45-
46-
* Save the .bash_profile, close it and then run the following:
47-
48-
source .bash_profile
49-
50-
* Try 'echo $JAVA_HOME' to make sure the environment is pointing to the right Java version (currently 1.8.0)
51-
52-
53-
Getting BioMedICUS
54-
------------------------------------------------------------------------------------------------------------------------
55-
56-
If you wish to compile BioMedICUS from source, instead of using the prepackaged version above, the following
57-
instructions are provided.
58-
59-
At a command prompt, change directories to where you want to install BioMedICUS, and then enter:
60-
61-
git clone https://github.com/nlpie/biomedicus.git
62-
63-
Notes:
64-
65-
* FYI: This clone command will create a subdirectory called 'biomedicus' in the current working directory and create
66-
a git repository under that subdirectory.
67-
68-
69-
Downloading Large Model Files
70-
------------------------------------------------------------------------------------------------------------------------
71-
72-
After having checked out biomedicus from git using the steps above, you will need to download the models file for the
73-
version of biomedicus you are running. You can download the data zip at
74-
[http://athena.ahc.umn.edu/biomedicus-downloads/](http://athena.ahc.umn.edu/biomedicus-downloads/).
75-
There are two options, the first is an open data set which is freely licensed UMLS data, the second zip is a umls data
76-
set which uses SNOMED data and requires you to log in to a NIH UTS account to verify your UMLS Metathesaurus License.
77-
After extracting, set the BIOMEDICUS_JAVA_OPTS java parameter biomedicus.paths.data to the location of the
78-
extracted data. If you are using a release version of BioMedICUS, you may instead set the paths.data configuration
79-
property in config/biomedicusConfiguration.yml to the location of the BioMedICUS data.
80-
81-
export BIOMEDICUS_JAVA_OPTS=-Dbiomedicus.paths.data=[location of data]
82-
83-
84-
Building BioMedICUS
85-
------------------------------------------------------------------------------------------------------------------------
86-
87-
After cloning completes, run the following to build BioMedICUS
88-
89-
cd biomedicus
90-
mvn clean install
91-
92-
93-
Running BioMedICUS
94-
------------------------------------------------------------------------------------------------------------------------
95-
96-
Then maven install phase will create a "release" folder in the root directory of BioMedICUS. This root directory
97-
contains all resources necessary to run biomedicus.
98-
99-
* config: configuration files such as .properties
100-
101-
* **desc: UIMA descriptor files**
102-
103-
- ae - location for pipelines/aggregates of multiple annotators
104-
- ae/annotator - standard location for annotators
105-
- ae/writer - location for writers to output analyzed documents in a specific format
106-
- ae/util - utility analysis engines
107-
- cr - location for Collection Readers, to read in documents
108-
- tools - descriptors for analysis engines in the tools module.
109-
110-
* lib: Compiled and packaged modules of biomedicus and third-party libraries necessary to run biomedicus
111-
112-
* bin: scripts for running biomedicus tools or uima tools with the correct classpath for biomedicus
113-
114-
- AV.sh run the UIMA Annotation Viewer.
115-
[UIMA help](https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.annotation_viewer)
116-
- CPM.sh run the UIMA Collection Processing Engine Configurator.
117-
[UIMA help](https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.cpe)
118-
- CVD.sh run the UIMA CAS Visual Debugger.
119-
[UIMA help](https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.cvd)
120-
- runClass.sh run any executable java class within the biomedicus classpath
121-
- runCPE.sh run a cpe desciptor directly without opening the visual interface
122-
123-
After successful installation is completed, the simplest way to start is to use the default annotation pipeline with a
124-
UIMA provided graphical user interface - Collection Processing Manager (CPM). The simplest format for the input files is
125-
plain text; however, our system also currently has collection readers defined for certain XML formats as well as text
126-
stored in a database. Here, we will show how to run BioMedICUS on a collection of plain text files. To run the pipeline,
127-
first, you will need to create a local directory containing only the files to be processed
128-
(e.g., ./examples/plaintext/input) and move the files to be processed into the newly created directory. We would also
129-
recommend creating an output directory in a nearby location that will contain the output of the pipeline
130-
(e.g., ./examples/plaintext/output).
131-
132-
Next, navigate to the "bin" directory under the BioMedICUS installation directory:
133-
134-
cd ./release/bin
135-
136-
Next, run the CPM.sh shell script:
137-
138-
./CPM.sh
139-
140-
If you have a cpe descriptor file already created, you can load the CPM descriptor by browsing to the "File" menu in the
141-
CPM and selecting "Open CPE Descriptor." From there a file dialog will popup and you can browse to your CPE Descriptor
142-
file.
143-
144-
You can also run a CPE directly without launching the UI by using the runCPE.sh script:
145-
146-
./runCPE.sh /path/to/CpeDescriptor.xml
147-
148-
To make it easier to start for users new to BioMedICUS and UIMA, we have included an "examples/plaintext" folder with
149-
the BioMedICUS distribution. This folder contains an "input" and "output" subfolders. The ./examples/plaintext/input
150-
folder has a few plain ASCII text files. The ./examples/plaintext/ folder also contains a predefined
151-
PlainTextCPM_Example.launch descriptor file that can be used to start the CPM or be loaded after the CPM has been
152-
started. This launch configuration has relative paths defined - relative to the current working directory from which
153-
CPM.sh is started. So, the best way to use it is to first switch to the "scripts" directory. Try the following:
154-
155-
cd ./release/scripts
156-
./CPM.sh
157-
158-
Follow the instructions for opening a CPE Descriptor and select
159-
${BIOMEDICUS_HOME}/release/examples/plaintext/PlainTextCPM_Example.xml
160-
161-
After the process completes, you can view the results with the Annotation Viewer (AV.sh) like this:
162-
163-
./AV.sh
164-
165-
When AV starts, you can point it to the output directory ../examples/plaintext/output. You will also need to point AV to
166-
the type system that was used by the BioMedICUS pipeline, the type system is outputted to the same directory as the XMI
167-
files.
168-
169-
170-
171-
IntelliJ IDEA Setup (Recommended)
172-
------------------------------------------------------------------------------------------------------------------------
173-
174-
IntelliJ is Java IDE with a Community Edition licensed under the Apache 2.0 license developed by JetBrains S.R.O. You
175-
can find it [here](https://www.jetbrains.com/idea/)
176-
177-
1. Select Import Project
178-
2. Browse to the folder that you cloned the biomedicus code into
179-
3. Select 'Import from external model' and 'Maven'
180-
181-
182-
Eclipse Setup
183-
------------------------------------------------------------------------------------------------------------------------
184-
185-
Prerequisites: Install m2e. [link](http://eclipse.org/m2e/). Recent versions of Eclipse have maven included.
186-
187-
1. Open eclipse with a workspace folder that is not the BioMedICUS root directory.
188-
2. Right click in the package explorer and select 'Import'
189-
3. Expand the 'Maven' folder and select 'Existing Maven Projects'
190-
4. Select the root directory of the BioMedICUS project using the 'Browse' button
191-
5. Confirm that 'pom.xml' and a list of modules appear in the 'Projects' pane
192-
6. Select 'Finish'
193-
7. Under the 'Biomedicus' project in the 'developer' folder there are .launch files for the Annotation Viewer and the
194-
Collection Processing Engine Configurator, they should be automatically included in the Run and Debug favorites.
195-
196-
197-
Changing settings
198-
------------------------------------------------------------------------------------------------------------------------
199-
200-
UIMA defines an annotator's settings in an XML descriptor file. These files are located in the
201-
release/desc folder. To change the behavior of any of the annotators,
202-
the available settings can be changed in the files within that folder.
19+
Contact and Support
20+
========================================================================================================================
20321

204-
The config/biomedicus.properties contains some system-wide per-launch constants such as the location of data models.
22+
BioMedICUS has a [Google Group](https://groups.google.com/a/umn.edu/forum/#!forum/biomedicus) for contacting developers
23+
with questions, suggestions or feedback. Also, bug reports and feature suggestions can be submitted to the Issues tab on
24+
GitHub.
20525

20626
About Us
20727
========================================================================================================================
@@ -215,11 +35,6 @@ with assistance from the
21535
Other Resources
21636
========================================================================================================================
21737

218-
### BioMedICUS
219-
220-
* [Demo](http://athena.ahc.umn.edu/biomedicus/)
221-
* [Source Code](https://github.org/nlpie/biomedicus)
222-
22338
### NLP-TAB
22439

22540
* [Demo](http://athena.ahc.umn.edu/nlptab)

SPECIALIST NLP Tools Terms and Conditions.txt

Lines changed: 0 additions & 27 deletions
This file was deleted.

biomedicus-core/pom.xml

Lines changed: 34 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -19,27 +19,57 @@
1919
<parent>
2020
<artifactId>biomedicus</artifactId>
2121
<groupId>edu.umn.biomedicus</groupId>
22-
<version>1.5.0</version>
22+
<version>1.6.0</version>
2323
</parent>
2424
<modelVersion>4.0.0</modelVersion>
2525

2626
<artifactId>biomedicus-core</artifactId>
2727

28+
<properties>
29+
<jackson.version>2.8.3</jackson.version>
30+
</properties>
31+
2832
<dependencies>
2933
<dependency>
3034
<groupId>com.google.inject</groupId>
3135
<artifactId>guice</artifactId>
32-
<version>4.0</version>
36+
<version>4.1.0</version>
3337
</dependency>
3438
<dependency>
3539
<groupId>com.google.inject.extensions</groupId>
3640
<artifactId>guice-multibindings</artifactId>
37-
<version>4.0</version>
41+
<version>4.1.0</version>
3842
</dependency>
3943
<dependency>
4044
<groupId>org.yaml</groupId>
4145
<artifactId>snakeyaml</artifactId>
42-
<version>1.15</version>
46+
<version>1.17</version>
47+
</dependency>
48+
<dependency>
49+
<groupId>org.mapdb</groupId>
50+
<artifactId>mapdb</artifactId>
51+
<version>3.0.4</version>
52+
</dependency>
53+
<dependency>
54+
<groupId>args4j</groupId>
55+
<artifactId>args4j</artifactId>
56+
<version>2.33</version>
57+
</dependency>
58+
<dependency>
59+
<groupId>com.fasterxml.jackson.core</groupId>
60+
<artifactId>jackson-databind</artifactId>
61+
<version>${jackson.version}</version>
62+
</dependency>
63+
<dependency>
64+
<groupId>org.immutables</groupId>
65+
<artifactId>value</artifactId>
66+
<version>2.4.4</version>
67+
<scope>provided</scope>
68+
</dependency>
69+
<dependency>
70+
<groupId>org.apache.opennlp</groupId>
71+
<artifactId>opennlp-tools</artifactId>
72+
<version>1.7.2</version>
4373
</dependency>
4474
</dependencies>
4575

0 commit comments

Comments
 (0)