@@ -10,198 +10,18 @@ This is a collaborative project that aims to serve biomedical and clinical resea
1010with different texts.
1111
1212
13- Install Guide
13+ Wiki
1414========================================================================================================================
1515
16- Along with the source code, we also release distribution of all the files required to immediately start running. You can
17- download this file and appropriate model files at [ http://athena.ahc.umn.edu/biomedicus-downloads/ ] ( http://athena.ahc.umn.edu/biomedicus-downloads/ ) .
16+ [ Our wiki on GitHub ] ( https://github.com/nlpie/biomedicus/wiki ) contains information about installation, configuration,
17+ use, and development of BioMedICUS .
1818
19- Prerequisites
20- ------------------------------------------------------------------------------------------------------------------------
21-
22- The following 3rd-party tools need to be installed to download, compile and run BioMedICUS:
23-
24- 1 . Java SE Development Kit 8: [ download] ( http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html )
25-
26- 2 . Git: [ download] ( http://git-scm.com ) \( optional, only required for checking out the project source code\)
27-
28- 3 . Maven: [ download] ( http://maven.apache.org ) \( optional, only required for building the project from source\)
29-
30- Environment Variables
31- ------------------------------------------------------------------------------------------------------------------------
32-
33- The following environment variables need to be set:
34-
35- 1 . JAVA_HOME:
36-
37- * On OS X, after installing Java SE Development Kit, you can find the path to the installation by typing this
38- command:
39-
40- /usr/libexec/java_home
41-
42- * Open the .bash_profile file in your favorite text editor and add the following statement:
43-
44- export JAVA_HOME="path-to-java-home-from-previous-step"
45-
46- * Save the .bash_profile, close it and then run the following:
47-
48- source .bash_profile
49-
50- * Try 'echo $JAVA_HOME' to make sure the environment is pointing to the right Java version (currently 1.8.0)
51-
52-
53- Getting BioMedICUS
54- ------------------------------------------------------------------------------------------------------------------------
55-
56- If you wish to compile BioMedICUS from source, instead of using the prepackaged version above, the following
57- instructions are provided.
58-
59- At a command prompt, change directories to where you want to install BioMedICUS, and then enter:
60-
61- git clone https://github.com/nlpie/biomedicus.git
62-
63- Notes:
64-
65- * FYI: This clone command will create a subdirectory called 'biomedicus' in the current working directory and create
66- a git repository under that subdirectory.
67-
68-
69- Downloading Large Model Files
70- ------------------------------------------------------------------------------------------------------------------------
71-
72- After having checked out biomedicus from git using the steps above, you will need to download the models file for the
73- version of biomedicus you are running. You can download the data zip at
74- [ http://athena.ahc.umn.edu/biomedicus-downloads/ ] ( http://athena.ahc.umn.edu/biomedicus-downloads/ ) .
75- There are two options, the first is an open data set which is freely licensed UMLS data, the second zip is a umls data
76- set which uses SNOMED data and requires you to log in to a NIH UTS account to verify your UMLS Metathesaurus License.
77- After extracting, set the BIOMEDICUS_JAVA_OPTS java parameter biomedicus.paths.data to the location of the
78- extracted data. If you are using a release version of BioMedICUS, you may instead set the paths.data configuration
79- property in config/biomedicusConfiguration.yml to the location of the BioMedICUS data.
80-
81- export BIOMEDICUS_JAVA_OPTS=-Dbiomedicus.paths.data=[location of data]
82-
83-
84- Building BioMedICUS
85- ------------------------------------------------------------------------------------------------------------------------
86-
87- After cloning completes, run the following to build BioMedICUS
88-
89- cd biomedicus
90- mvn clean install
91-
92-
93- Running BioMedICUS
94- ------------------------------------------------------------------------------------------------------------------------
95-
96- Then maven install phase will create a "release" folder in the root directory of BioMedICUS. This root directory
97- contains all resources necessary to run biomedicus.
98-
99- * config: configuration files such as .properties
100-
101- * ** desc: UIMA descriptor files**
102-
103- - ae - location for pipelines/aggregates of multiple annotators
104- - ae/annotator - standard location for annotators
105- - ae/writer - location for writers to output analyzed documents in a specific format
106- - ae/util - utility analysis engines
107- - cr - location for Collection Readers, to read in documents
108- - tools - descriptors for analysis engines in the tools module.
109-
110- * lib: Compiled and packaged modules of biomedicus and third-party libraries necessary to run biomedicus
111-
112- * bin: scripts for running biomedicus tools or uima tools with the correct classpath for biomedicus
113-
114- - AV.sh run the UIMA Annotation Viewer.
115- [ UIMA help] ( https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.annotation_viewer )
116- - CPM.sh run the UIMA Collection Processing Engine Configurator.
117- [ UIMA help] ( https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.cpe )
118- - CVD.sh run the UIMA CAS Visual Debugger.
119- [ UIMA help] ( https://uima.apache.org/d/uimaj-2.6.0/tools.html#ugr.tools.cvd )
120- - runClass.sh run any executable java class within the biomedicus classpath
121- - runCPE.sh run a cpe desciptor directly without opening the visual interface
122-
123- After successful installation is completed, the simplest way to start is to use the default annotation pipeline with a
124- UIMA provided graphical user interface - Collection Processing Manager (CPM). The simplest format for the input files is
125- plain text; however, our system also currently has collection readers defined for certain XML formats as well as text
126- stored in a database. Here, we will show how to run BioMedICUS on a collection of plain text files. To run the pipeline,
127- first, you will need to create a local directory containing only the files to be processed
128- (e.g., ./examples/plaintext/input) and move the files to be processed into the newly created directory. We would also
129- recommend creating an output directory in a nearby location that will contain the output of the pipeline
130- (e.g., ./examples/plaintext/output).
131-
132- Next, navigate to the "bin" directory under the BioMedICUS installation directory:
133-
134- cd ./release/bin
135-
136- Next, run the CPM.sh shell script:
137-
138- ./CPM.sh
139-
140- If you have a cpe descriptor file already created, you can load the CPM descriptor by browsing to the "File" menu in the
141- CPM and selecting "Open CPE Descriptor." From there a file dialog will popup and you can browse to your CPE Descriptor
142- file.
143-
144- You can also run a CPE directly without launching the UI by using the runCPE.sh script:
145-
146- ./runCPE.sh /path/to/CpeDescriptor.xml
147-
148- To make it easier to start for users new to BioMedICUS and UIMA, we have included an "examples/plaintext" folder with
149- the BioMedICUS distribution. This folder contains an "input" and "output" subfolders. The ./examples/plaintext/input
150- folder has a few plain ASCII text files. The ./examples/plaintext/ folder also contains a predefined
151- PlainTextCPM_Example.launch descriptor file that can be used to start the CPM or be loaded after the CPM has been
152- started. This launch configuration has relative paths defined - relative to the current working directory from which
153- CPM.sh is started. So, the best way to use it is to first switch to the "scripts" directory. Try the following:
154-
155- cd ./release/scripts
156- ./CPM.sh
157-
158- Follow the instructions for opening a CPE Descriptor and select
159- ${BIOMEDICUS_HOME}/release/examples/plaintext/PlainTextCPM_Example.xml
160-
161- After the process completes, you can view the results with the Annotation Viewer (AV.sh) like this:
162-
163- ./AV.sh
164-
165- When AV starts, you can point it to the output directory ../examples/plaintext/output. You will also need to point AV to
166- the type system that was used by the BioMedICUS pipeline, the type system is outputted to the same directory as the XMI
167- files.
168-
169-
170-
171- IntelliJ IDEA Setup (Recommended)
172- ------------------------------------------------------------------------------------------------------------------------
173-
174- IntelliJ is Java IDE with a Community Edition licensed under the Apache 2.0 license developed by JetBrains S.R.O. You
175- can find it [ here] ( https://www.jetbrains.com/idea/ )
176-
177- 1 . Select Import Project
178- 2 . Browse to the folder that you cloned the biomedicus code into
179- 3 . Select 'Import from external model' and 'Maven'
180-
181-
182- Eclipse Setup
183- ------------------------------------------------------------------------------------------------------------------------
184-
185- Prerequisites: Install m2e. [ link] ( http://eclipse.org/m2e/ ) . Recent versions of Eclipse have maven included.
186-
187- 1 . Open eclipse with a workspace folder that is not the BioMedICUS root directory.
188- 2 . Right click in the package explorer and select 'Import'
189- 3 . Expand the 'Maven' folder and select 'Existing Maven Projects'
190- 4 . Select the root directory of the BioMedICUS project using the 'Browse' button
191- 5 . Confirm that 'pom.xml' and a list of modules appear in the 'Projects' pane
192- 6 . Select 'Finish'
193- 7 . Under the 'Biomedicus' project in the 'developer' folder there are .launch files for the Annotation Viewer and the
194- Collection Processing Engine Configurator, they should be automatically included in the Run and Debug favorites.
195-
196-
197- Changing settings
198- ------------------------------------------------------------------------------------------------------------------------
199-
200- UIMA defines an annotator's settings in an XML descriptor file. These files are located in the
201- release/desc folder. To change the behavior of any of the annotators,
202- the available settings can be changed in the files within that folder.
19+ Contact and Support
20+ ========================================================================================================================
20321
204- The config/biomedicus.properties contains some system-wide per-launch constants such as the location of data models.
22+ BioMedICUS has a [ Google Group] ( https://groups.google.com/a/umn.edu/forum/#!forum/biomedicus ) for contacting developers
23+ with questions, suggestions or feedback. Also, bug reports and feature suggestions can be submitted to the Issues tab on
24+ GitHub.
20525
20626About Us
20727========================================================================================================================
@@ -215,11 +35,6 @@ with assistance from the
21535Other Resources
21636========================================================================================================================
21737
218- ### BioMedICUS
219-
220- * [ Demo] ( http://athena.ahc.umn.edu/biomedicus/ )
221- * [ Source Code] ( https://github.org/nlpie/biomedicus )
222-
22338### NLP-TAB
22439
22540 * [ Demo] ( http://athena.ahc.umn.edu/nlptab )
0 commit comments