omnipresent07
diff --git a/‎Documents/panel.md
Lines changed: 92 additions & 11 deletions b/‎Documents/panel.md
Lines changed: 92 additions & 11 deletions
diff --git a/‎README.md
Lines changed: 66 additions & 95 deletions b/‎README.md
Lines changed: 66 additions & 95 deletions
diff --git a/‎api/src/main/resources/application-dev.conf
Lines changed: 1 addition & 0 deletions b/‎api/src/main/resources/application-dev.conf
Lines changed: 1 addition & 0 deletions
diff --git a/‎api/src/main/resources/application.conf
Lines changed: 1 addition & 0 deletions b/‎api/src/main/resources/application.conf
Lines changed: 1 addition & 0 deletions
diff --git a/‎api/src/main/resources/logback.xml
Lines changed: 1 addition & 0 deletions b/‎api/src/main/resources/logback.xml
Lines changed: 1 addition & 0 deletions
@@ -1,30 +1,106 @@
 # Panel CSV Loader
 
-## Overview
-The panel loader is designed to read a CSV file and load the data onto the HMDA-Platform.  The CSV file should use the `|` (pipe) delimiter, and should include a header row as the first line.
+The panel loader is designed to read a CSV file of institution data and load them onto the HMDA-Platform. It can be used to load data either into a local Cassandra instance or a remote one (e.g. in a cluster).
 
-## Environment Variables
-There is only one environment variable used by the panel loader.  It must be set correctly in order for the data to be sent to the admin API.
+## The Panel File
 
-For testing on an API running in SBT, no changes need to be made.  The default for this variable will point to the correct local admin API.
+The CSV file should use the `|` (pipe) delimiter, and should include a header row as the first line.
+
+A small example file (~200 institutions) is located at `panel/src/main/resources/inst_data_2017_dummy.csv`
+
+The real panel file (~160,000 institutions) is located at `panel/src/main/resources/inst_data_2017.csv`
+
+
+## Loading Institutions Remotely
+
+For loading panel data into a remote system or into a local Docker container, you don't need to have any services running on your local environment as dependencies. You will need to set the `HMDA_HTTP_ADMIN_URL` environment variable.
 
-For loading panel data into a remote system or into a local Docker container, you'll need to set the following environment variable:
 ```shell
-> export HMDA_HTTP_ADMIN_URL={base URL}
+> export HMDA_HTTP_ADMIN_URL={admin URL}
 ```
 
 **IMPORTANT NOTE:** The base URL should *include* `http://` or `https://`, but *exclude* any trailing backslash `/`.  For example:
 
+To load panel data into the cluster, simply find the URL of the admin API (for the release branch: `https://hmda-ops-api.demo.cfpb.gov/admin`).
+
+To load panel data into a Docker container running locally, the URL will depend on your Docker Machine's IP. If it uses the default IP, this will be the admin API URL:
 ```shell
 > export HMDA_HTTP_ADMIN_URL=http://192.168.99.100:8081
 ```
 
-## Running the parser
-A small example file is located at `panel/src/main/resources/inst_data_2017_dummy.csv`
+Once that variable is set, use the instructions in [Running the Loader](#running-the-loader) to load the data.
+
 
-The real panel file is located at `panel/src/main/resources/inst_data_2017.csv`
+## Loading Institutions Locally
 
-In order for the panel data to be loaded locally, the API project must be up and running, along with Docker containers running Cassandra and Zookeper, or run the full `docker-compose` setup.  To load panel data into the cluster, simply find the URL of the admin api (for the release branch: `https://hmda-ops-api.demo.cfpb.gov/admin`).  No other running services are necessary.
+In order for the panel data to be loaded locally, the API project must be up and running, along with Docker containers running Cassandra, PostgreSQL, and Zookeper. Once the dependencies are running, use the instructions in [Running the Loader](#running-the-loader) to load the data.
+
+### Running the Dependencies
+
+#### Cassandra
+
+The easiest way to run a Cassandra server to support this application for testing is to do it through Docker:
+
+```shell
+docker run --name cassandra -p 9042:9042 -p 7000:7000 -p 7199:7199 cassandra:3.10
+```
+
+If you want to connect to this server, the following `docker` command will give you access to the Cassandra instance started in the previous step:
+
+```shell
+docker run -it --link cassandra:cassandra --rm cassandra cqlsh cassandra
+```
+
+#### Apache Zookeeper
+
+The `HMDA Platform` is a distributed system that is meant to be run as a clustered application in production.
+As such, it needs a mechanism for storing configuration information for additional nodes joining the cluster.
+`Apache Zookeeper` is used to store this information. To run the project, zookeeper must be running and available in the local network.
+An easy way to satisfy this requirement is to launch a docker container with `ZooKeeper`, as follows:
+
+```shell
+$ docker run --rm -p 2181:2181 -p 2888:2888 -p 3888:3888 jplock/zookeeper
+```
+
+#### PostgreSQL
+
+To run Postgres from a Docker container with the correct ports to connect to the HMDA Platform, use the following command:
+
+```shell
+docker run -e POSTGRES_PASSWORD=postgres -e POSTGRES_USER=postgres -e POSTGRES_DB=hmda -p 54321:5432 postgres:9.6.1
+```
+
+#### HMDA API
+
+* Set the environement variables for Zookeper. `ZOOKEEPER_HOST` uses your Docker Machine's IP address. In this example, we use the default Docker Machine IP:
+
+```shell
+export ZOOKEEPER_HOST=192.168.99.100
+export ZOOKEEPER_PORT=2181
+```
+
+* Set the environment variables for the local Cassandra instance. `CASSANDRA_CLUSTER_HOSTS` also uses the Docker Machine IP:
+
+```shell
+export CASSANDRA_CLUSTER_HOSTS=192.168.99.100
+export CASSANDRA_CLUSTER_PORT=9042
+```
+
+* Tell the platform to use Cassandra as its database instead of LevelDB:
+
+```shell
+export HMDA_IS_DEMO=false
+```
+
+* Start sbt using the command `sbt`, then use these commands at the sbt prompt:
+
+```shell
+project api
+clean
+re-start
+```
+
+## Running the Loader
 
 In a terminal, execute the following commands:
 
@@ -41,6 +117,7 @@ sbt> assembly
 ```
 Then the panel loader can be run with `java -jar  panel/target/scala-2.12/panel.jar path/to/institution_file.csv`
 
+
 ## Error codes
 There are four ways the panel loader can fail.  The exit code and error message should tell you what happened.
 
@@ -49,7 +126,11 @@ There are four ways the panel loader can fail.  The exit code and error message
 3. The call to `institutions/create` didn't return the correct response.  This can indicate that you don't have the correct environment variables set, or that something is wrong with the hmda-platform.
 4. The loader didn't finish processing all the institutions.  This will happen when running the real panel file, but unsure as to why this happens.
 
+
 ## Testing
+
+Once you have run the Panel Loader with an institution file, you can check the HMDA API to see that the data loaded correctly.
+
 Make sure your authorization header is updated with a few real `id_rssd` fields from the given file.  This can be found in the API log output (first field argument in the `InstitutionQuery` object), or in the CSV file (seventh field).
 
 Try out the endpoint `localhost:8080/institutions`, and you should see a response with real panel data.
@@ -14,6 +14,10 @@ For more information on HMDA, checkout the [About HMDA page](http://www.consumer
 
 This repository contains the code for the entirety of the HMDA platform backend. This platform has been designed to accommodate the needs of the HMDA filing process by financial institutions, as well as the data management and publication needs of the HMDA data asset.
 
+The HMDA Platform uses sbt's multi-project builds, each project representing a specific task. The platform is an Akka Cluster
+application that can be deployed on a single node or as a distributed application. For more information on how Akka Cluster
+is used, see the documentation [here](Documents/cluster.md)
+
 The HMDA Platform is composed of the following modules:
 
 ### Parser (JS/JVM)
@@ -70,90 +74,101 @@ The HMDA Platform is written in [Scala](http://www.scala-lang.org/). To build it
 
 In addition, you'll need Scala's interactive build tool [sbt](http://www.scala-sbt.org/0.13/tutorial/index.html). Please refer to sbt's [installation instructions](http://www.scala-sbt.org/0.13/tutorial/Setup.html) to get started.
 
-## Building and Running
+### Docker
 
-The HMDA Platform uses sbt's multi-project builds, each project representing a specific task. The platform is an Akka Cluster
-application that can be deployed on a single node or as a distributed application. For more information on how Akka Cluster 
-is used, see the documentation [here](Documents/cluster.md)
+Though Docker is not a dependency of the Scala project, it is very useful for running and smoke testing locally.
+Use the following steps to prepare a local environment for running the Platform with docker:
 
-### Interactive
+First, make sure that you have the [Docker Toolbox](https://www.docker.com/docker-toolbox) installed.
 
-* The write side of this system is supported by either a local `leveldb` database or Cassandra. By default, the local `leveldb` is utilized, and some sample data is loaded automatically.
-If using `Cassandra` is desired, the following environment variable needs to be set:
+If you don't have a Docker machine created, you can create one with the default parameters using the command below.
+This will be sufficient for running most docker containers (e.g. the dev dependencies for the API), but not for running the entire platform.
 
 ```shell
-export HDMA_IS_DEMO=false
+docker-machine create --driver virtualbox dev
 ```
 
-The easiest way to run a Cassandra server to support this application for testing is to do it through Docker:
+If you wish to run the entire platform using Docker (currently the only way to run the entire platform),
+you'll need to dedicate more resources to the Docker machine.
+We've found that for the full stack to run efficiently, you need approximately:
 
-```shell
-docker run --name cassandra -p 9042:9042 -p 7000:7000 -p 7199:7199 cassandra:3.10
-```
+* 4 CPUs
+* 6 GB RAM
+* 80 GB Disk space
 
-If you want to connect to this server, the following `docker` command will give you access to the Cassandra instance started in the previous step:
+Assuming you are using Docker Machine to provision your Docker
+environment, you can check you current settings with the following
+(ignore the second `Memory`):
 
 ```shell
-docker run -it --link cassandra:cassandra --rm cassandra cqlsh cassandra
+    $ docker-machine inspect | grep 'CPU\|Memory\|DiskSize'
+        "CPU": 4,
+        "Memory": 6144,
+        "DiskSize": 81920,
+        "Memory": 0,
 ```
 
-Once the `Cassandra` server is running, set the following environment variable to the appropriate Cassandra host (in this example, the default local docker host for a machine running MacOs X):
+If your settings are below these suggestions, you should create a new
+Docker VM. The following will create a VM named `hmda-platform` with
+the appropriate resources:
 
 ```shell
-export CASSANDRA_CLUSTER_HOSTS=192.168.99.100
+    $ docker-machine create \
+    --driver virtualbox \
+    --virtualbox-disk-size 81920 \
+    --virtualbox-cpu-count 4 \
+    --virtualbox-memory 6144 \
+    hmda-platform
 ```
 
-To load data into `Cassandra`, you can run the following (the Cassandra server needs to be running and correct environment variables configured as per the previous instructions):
-
+After the machine is created, make sure that you connect your shell with the newly created machine
 ```shell
-$ sbt
-project panel
-run <full local path to sample file>
+$ eval "(docker-machine env dev)"
 ```
-A sample file is located in the following folder: `panel/src/main/resources/inst_data_2017_dummy.csv`
 
 
-* In order to support the read side, a local PostgreSQL and Cassandra server are needed. Assuming it runs on the default port, on the same machine as the API, the following environment variable needs to be set:
+## Building and Running
+
+### Building the .jar
+
+* To build JVM artifacts (the default, includes all projects), from the sbt prompt:
 
 ```shell
-export JDBC_URL='jdbc:postgresql://localhost/hmda?user=postgres&password=postgres'
+> clean assembly
 ```
 
-where `hmda` is the name of the `PostgreSQL` database, owned by the default user with default password (`postgres`)
-
-For Cassandra, the following environment variables need to be set (assuming Cassandra is running on a docker container as described above):
+This task will create a `fat jar`, which can be executed directly on any JDK8 compliant JVM:
 
 ```shell
-export CASSANDRA_CLUSTER_HOSTS=192.168.99.100
-export CASSANDRA_CLUSTER_PORT=9042
+java -jar target/scala-2.11/hmda.jar
 ```
 
-**Note: if you are running the backend only through sbt, the database needs to be created manually in advance, see instructions [here](https://www.postgresql.org/docs/9.1/static/manage-ag-createdb.html)**
 
-* The `HMDA Platform` is a distributed system that is meant to be run as a clustered application in production.
-As such, it needs a mechanism for storing configuration information for additional nodes joining the cluster.
-`Apache Zookeeper` is used to store this information. To run the project, zookeeper must be running and available in the local network.
-An easy way to satisfy this requirement is to launch a docker container with `ZooKeeper`, as follows:
+### Running Interactively
 
-```shell
-$ docker run --rm -p 2181:2181 -p 2888:2888 -p 3888:3888 jplock/zookeeper
-```
+#### Running the Dependencies
 
-* Set the environemnet variables for Zookeper
+Assuming you have Docker-Compose installed (according to the [Docker](#docker) instructions above),
+the easiest way to get all of the platform's dependencies up and running with the provided docker-compose dev setup:
 
 ```shell
-export ZOOKEEPER_HOST=192.168.99.100
-export ZOOKEEPER_PORT=2181
+docker-compose -f docker-dev.yml up
 ```
 
-Alternatively, these dependencies (`Cassandra`, `Zookeeper` and `PostgreSQL`) can be started from `docker` providing default resources for the `HMDA Platform`:
+When finished, use `docker-compose down` to gracefully stop the running containers.
+
 
-`docker-compose -f docker-dev.yml up`
+#### Running the API
 
-* If you want to use the sample files in this repo for testing the app, run the edits in demo mode. Otherwise, edit S025 will trigger for all files.
+Once the dependencies (above) are running, follow these steps in a separate terminal session to get the API running with sbt:
+
+* For smoke testing locally, add the following two environment variables:
+  * `EDITS_DEMO_MODE`: This will allow you to use the sample files in this repo for testing the app. Otherwise, edit S025 will trigger for all files.
+  * `HMDA_IS_DEMO`: This uses configuration files that allow running the app locally, instead of in a cluster.
 
 ```shell
 export EDITS_DEMO_MODE=true
+export HMDA_IS_DEMO=true
 ```
 
 * Start `sbt`
@@ -173,38 +188,17 @@ $ sbt
 
 Confirm that the platform is up and running by browsing to http://localhost:8080
 
-* To build JVM artifacts (the default, includes all projects), from the sbt prompt:
-
-```shell
-> clean assembly
-```
-
-This task will create a `fat jar`, which can be executed directly on any JDK8 compliant JVM:
-
-```shell
-java -jar target/scala-2.11/hmda.jar
-```
-
+When finished, press enter to get the sbt prompt, then stop the project by entering `reStop`.
 
-### Docker
 
-First, make sure that you have the [Docker Toolbox](https://www.docker.com/docker-toolbox) installed.
+### Running the Project with Docker
 
-If you don't have a Docker machine created, you can create one by issuing the following:
-```shell
-docker-machine create --driver virtualbox dev
-```
-
-After the machine is created, make sure that you connect your shell with the newly created machine
-```shell
-$ eval "(docker-machine env dev)"
-```
+#### To run only the API
 
-Ensure there's a compiled jar to create the Docker image with:
+First, ensure there's a compiled jar to create the Docker image with:
 ```shell
 sbt clean assembly
 ```
-#### To run only the API
 
 Build the docker image
 ```shell
@@ -219,35 +213,12 @@ docker run -d -p "8080:8080 -p 8082:8082" hmda-api
 The Filing API will run on `$(docker-machine ip):8080`
 The Public API will run on `$(docker-machine ip):8082`
 
+By default, the `HDMA Platform` runs with a log level of `INFO`. This can be changed by establishing a different log level in the `HMDA_LOGLEVEL` environment variable.
+For the different logging options, see the [reference.conf](https://github.com/akka/akka/blob/master/akka-actor/src/main/resources/reference.conf#L38) default configuration file for `Akka`. 
+
 #### To run the entire platform
 
-1. Dedicate appropriate resources to your Docker environment.  We've found
-    that for the full stack to run efficiently, you need approximately:
-
-    * 4 CPUs
-    * 6 GB RAM
-    * 80 GB Disk space
-
-    Assuming you are using Docker Machine to provision your Docker
-    environment, you can check you current settings with the following 
-    (ignore the second `Memory`):
-
-        $ docker-machine inspect | grep 'CPU\|Memory\|DiskSize'
-            "CPU": 4,
-            "Memory": 6144,
-            "DiskSize": 81920,
-            "Memory": 0,
-
-    If your settings are below these suggestions, you should create a new
-    Docker VM. The following will create a VM named `hmda-platform` with 
-    the appropriate resources:
-
-        $ docker-machine create \
-        --driver virtualbox \
-        --virtualbox-disk-size 81920 \
-        --virtualbox-cpu-count 4 \
-        --virtualbox-memory 6144 \
-        hmda-platform
+1. Ensure you have a Docker Machine with sufficient resources, as described in the [Docker](#docker) section above.
 
 1. Clone [hmda-platform-ui](https://github.com/cfpb/hmda-platform-ui) and 
     [hmda-platform-auth](https://github.com/cfpb/hmda-platform-auth) into the same
 
@@ -1,6 +1,7 @@
 akka {
   loggers = ["akka.event.slf4j.Slf4jLogger"]
   loglevel = "INFO"
+  loglevel = ${?HMDA_LOGLEVEL}
   logging-filter = "akka.event.slf4j.Slf4jLoggingFilter"
   http.parsing.max-content-length = 1G
   http.server.default-host-header = "cfpb.gov"
 
@@ -1,6 +1,7 @@
 akka {
   loggers = ["akka.event.slf4j.Slf4jLogger"]
   loglevel = "INFO"
+  loglevel = ${?HMDA_LOGLEVEL}
   logging-filter = "akka.event.slf4j.Slf4jLoggingFilter"
   http.parsing.max-content-length = 1G
   http.server.default-host-header = "cfpb.gov"
 
@@ -18,5 +18,6 @@
   <logger name="com.zaxxer.hikari" level="INFO" />
   <logger name="com.datastax.driver" level="INFO" />
   <logger name="org.apache.zookeeper" level="WARN" />
+  <logger name="de.heikoseeberger.constructr" level="INFO"/>
 
 </configuration>