ActivitySim
diff --git a/‎.github/workflows/python-package.yml‎
Lines changed: 37 additions & 0 deletions b/‎.github/workflows/python-package.yml‎
Lines changed: 37 additions & 0 deletions
diff --git a/‎.gitignore‎
Lines changed: 3 additions & 3 deletions b/‎.gitignore‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎.pre-commit-config.yaml‎
Lines changed: 19 additions & 0 deletions b/‎.pre-commit-config.yaml‎
Lines changed: 19 additions & 0 deletions
diff --git a/‎.python-version‎
Lines changed: 1 addition & 0 deletions b/‎.python-version‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎.vscode/launch.json‎
Lines changed: 16 additions & 0 deletions b/‎.vscode/launch.json‎
Lines changed: 16 additions & 0 deletions
diff --git a/‎.vscode/settings.json‎
Lines changed: 9 additions & 0 deletions b/‎.vscode/settings.json‎
Lines changed: 9 additions & 0 deletions
diff --git a/‎MANIFEST.in‎
Lines changed: 0 additions & 7 deletions b/‎MANIFEST.in‎
Lines changed: 0 additions & 7 deletions
diff --git a/‎README.md‎
Lines changed: 10 additions & 0 deletions b/‎README.md‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎docs/application_configuration.rst‎
Lines changed: 24 additions & 24 deletions b/‎docs/application_configuration.rst‎
Lines changed: 24 additions & 24 deletions
@@ -0,0 +1,37 @@
+# This workflow will install Python dependencies, run tests and lint with a variety of Python versions
+# For more information see: https://docs.github.com/en/actions/automating-builds-and-tests/building-and-testing-python
+
+name: Python package
+
+on:
+  push:
+    branches: [ "master", "develop"]
+  pull_request:
+    branches: [ "master" ]
+
+jobs:
+  build:
+
+    runs-on: ubuntu-latest
+    strategy:
+      fail-fast: false
+      matrix:
+        python-version: ["3.9", "3.10", "3.11", "3.12"]
+
+    steps:
+    - uses: actions/checkout@v4
+
+    - name: Install uv and set the python version
+      uses: astral-sh/setup-uv@v5
+      with:
+        python-version: ${{ matrix.python-version }}
+        version: "0.6.14"
+
+    - name: Install the project
+      run: uv sync --all-extras --dev
+
+    - name: Lint with ruff
+      uses: astral-sh/ruff-action@v3
+
+    - name: Run tests
+      run: uv run pytest
@@ -1,10 +1,10 @@
-sandbox/
+scripts/calm_validation_results/
+
 regress/
-example_test_no_integerizing/
-example_mtc/
 .idea
 .ipynb_checkpoints
 
+.coverage.*
 
 # Byte-compiled / optimized / DLL files
 __pycache__/
 
@@ -0,0 +1,19 @@
+repos:
+  - repo: https://github.com/pre-commit/pre-commit-hooks
+    rev: v5.0.0  # Use latest stable version
+    hooks:
+      # - id: check-yaml
+      - id: end-of-file-fixer
+      - id: trailing-whitespace
+
+  - repo: https://github.com/psf/black
+    rev: 24.3.0  # Use latest Black version
+    hooks:
+      - id: black
+        language_version: python3  # Ensures compatibility with Python 3+
+
+  - repo: https://github.com/astral-sh/ruff-pre-commit
+    rev: v0.3.3  # Replace with latest Ruff release
+    hooks:
+      - id: ruff
+        args: [--fix]  # Optional: auto-fix simple issues
@@ -0,0 +1 @@
+3.12
@@ -0,0 +1,16 @@
+{
+    // Use IntelliSense to learn about possible attributes.
+    // Hover to view descriptions of existing attributes.
+    // For more information, visit: https://go.microsoft.com/fwlink/?linkid=830387
+    "version": "0.2.0",
+    "configurations": [
+        {
+            "name": "Python Debugger: Current File",
+            "type": "debugpy",
+            "request": "launch",
+            "program": "${file}",
+            "console": "integratedTerminal",
+            "justMyCode": true,
+        }
+    ]
+}
@@ -0,0 +1,9 @@
+{
+    "python.testing.pytestArgs": [
+        "populationsim",
+        "tests"
+    ],
+    "python.testing.unittestEnabled": false,
+    "python.testing.pytestEnabled": true,
+    "ruff.enable": true,
+}
@@ -10,6 +10,16 @@ easily adapted for statewide, regional, and urban transportation planning
 needs.  PopulationSim is implemented in the
 [ActivitySim](https://github.com/activitysim/activitysim) framework.
 
+## Command-Line Interface
+
+PopulationSim can be run directly from the command line:
+
+```bash
+populationsim -c /path/to/configs -d /path/to/data -o /path/to/output
+```
+
+See the [examples directory](examples/) for more information on using the command-line interface.
+
 ## Documentation
 
 https://activitysim.github.io/populationsim/
@@ -121,7 +121,7 @@ PopulationSim is configured using the settings.yaml file. PopulationSim can be c
 
 :regular mode:
 
-  The regular configuration runs PopulationSim from beginning to end and produces a new synthetic population.  This can run either single-process or multi-processed to save on runtime.  
+  The regular configuration runs PopulationSim from beginning to end and produces a new synthetic population.  This can run either single-process or multi-processed to save on runtime.
 
 :repop mode:
 
@@ -263,17 +263,17 @@ This sub-directory is populated at the end of the PopulationSim run. The table b
 Configuring Settings File
 ~~~~~~~~~~~~~~~~~~~~~~~~~
 
-PopulationSim is configured using the *configs/settings.yaml* file. The user has the flexibility to specify algorithm functionality, list geographies, invoke tracing, provide inputs specifications, select outputs, list the steps to run, and specify multiprocess settings. 
+PopulationSim is configured using the *configs/settings.yaml* file. The user has the flexibility to specify algorithm functionality, list geographies, invoke tracing, provide inputs specifications, select outputs, list the steps to run, and specify multiprocess settings.
 
 .. note::
-   When running PopulationSim, multiple settings files can be specified so long as the ``inherit_settings: True`` setting is included in 
+   When running PopulationSim, multiple settings files can be specified so long as the ``inherit_settings: True`` setting is included in
    subsequent files.  This feature is used for the multi-processing configuration described below.  To utilize this feature, once can run PopulationSim
-   with the following command: ``python run_populationsim.py -c configs_mp -c configs``.  This command specifies two config folders, each with 
+   with the following command: ``python run_populationsim.py -c configs_mp -c configs``.  This command specifies two config folders, each with
    a settings file, and the ``configs_mp`` settings inherit from the earlier ``configs`` settings.
 
 The settings shown below are from the PopulationSim application for the CALM region as an example of how a run can be configured. The meta geography for CALM region is named as *Region*, the seed geography is *PUMA* and the two sub-seed geographies are *TRACT* and *TAZ*. The settings below are for this four geography application, but the user can configure PopulationSim for any number of geographies and use different geography names.
 
-Some of the setting are configured differently for the *repop* mode. The settings specific to the *repop* mode are described in the :ref:`settings_repop` section.  The settings specific to the *multiprocessing* setup are described in the :ref:`settings_mp` section.  
+Some of the setting are configured differently for the *repop* mode. The settings specific to the *repop* mode are described in the :ref:`settings_repop` section.  The settings specific to the *multiprocessing* setup are described in the :ref:`settings_mp` section.
 
 **Algorithm/Software Configuration**:
 
@@ -395,11 +395,11 @@ Note that Seed-Households, Seed-Persons and Geographic CrossWalk are all require
 	- tablename: households
 		filename : seed_households.csv
 		index_col: hh_id
-		column_map:
+		rename_columns:
 		hhnum: hh_id
 	- tablename: persons
 		filename : seed_persons.csv
-		column_map:
+		rename_columns:
 		hhnum: hh_id
 		SPORDER: per_num
 		# drop mixed type fields that appear to have been incorrectly generated
@@ -414,7 +414,7 @@ Note that Seed-Households, Seed-Persons and Geographic CrossWalk are all require
 		- naicsp07
 	- tablename: geo_cross_walk
 		filename : geo_cross_walk.csv
-		column_map:
+		rename_columns:
 		TRACTCE: TRACT
 	- tablename: TAZ_control_data
 		filename : control_totals_taz.csv
@@ -454,7 +454,7 @@ Note that Seed-Households, Seed-Persons and Geographic CrossWalk are all require
 +--------------+---------------------------------------------------------------------------------------+
 | index_col    | Name of the unique ID field in the seed household data                                |
 +--------------+---------------------------------------------------------------------------------------+
-| column_map   | Column map of fields to be renamed. The format for the column map is as follows: |br| |
+| rename_columns   | Column map of fields to be renamed. The format for the column map is as follows: |br| |
 |              | ``Name in CSV: New Name``                                                             |
 +--------------+---------------------------------------------------------------------------------------+
 | drop_columns | List of columns to be dropped from the input data                                     |
@@ -627,17 +627,17 @@ For detailed information on software implementation refer to :ref:`core_componen
 Configuring Settings File for Multiprocessing
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 
-This sections describes the settings that are additionally configured for running PopulationSim with 
-multiprocessing to reduce runtime.  PopulationSim uses ActivitySim's multiprocessing capabilities, which 
+This sections describes the settings that are additionally configured for running PopulationSim with
+multiprocessing to reduce runtime.  PopulationSim uses ActivitySim's multiprocessing capabilities, which
 are described in more detail `here <https://activitysim.github.io/activitysim/howitworks.html#multiprocessing>`_.
 
-The example below can be found in the ``example_calm\configs_mp\settings.yaml`` file.  The group of model steps 
-identified as ``mp_seed_balancing`` and starting with ``input_pre_processor`` 
-are run single process until the next group of model steps identified as ``mp_sub_balancing_TAZ`` and starting with 
+The example below can be found in the ``example_calm\configs_mp\settings.yaml`` file.  The group of model steps
+identified as ``mp_seed_balancing`` and starting with ``input_pre_processor``
+are run single process until the next group of model steps identified as ``mp_sub_balancing_TAZ`` and starting with
 ``sub_balancing.geography=TAZ`` is reached, at which time PopulationSim runs these steps in parallel using two processors
-by slicing the problem into separate geographic batches based on the ``slice_geography: TRACT`` setting.  It then 
-returns to single process with the final group of model steps identified as ``mp_summarize`` and 
-beginning with ``expand_households``.  
+by slicing the problem into separate geographic batches based on the ``slice_geography: TRACT`` setting.  It then
+returns to single process with the final group of model steps identified as ``mp_summarize`` and
+beginning with ``expand_households``.
 
 ::
 
@@ -666,8 +666,8 @@ beginning with ``expand_households``.
           - trace_TAZ_weights
     - name: mp_summarize
       begin: expand_households
-    
-    
+
+
 +-------------------------------+--------------------------------------------------------------------------------------------------------------+
 | Attribute                     | Description                                                                                                  |
 +===============================+==============================================================================================================+
@@ -859,7 +859,7 @@ Some conventions for writing expressions:
   * Expressions must be vectorized expressions and can use most numpy and pandas expressions.
   * When editing the CSV files in Excel, use single quote ' or space at the start of a cell to get Excel to accept the expression
 
-.. _importance: 
+.. _importance:
 
 What are importance weights
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
@@ -882,18 +882,18 @@ Where, :math:`z_{i}` are relaxation factors and :math:`a_{in}` are incidence val
 
 Where, :math:`u_{i}` are the penalties termed as importance factors or importance weights in PopulationSim.
 
-:math:`x_{n}` and :math:`z_{i}`  are the parameters solved by the optimization while importance weights (:math:`u_{i}`) are the hyperparameters that are exposed to the user and impact the optimization externally. The objective of the relative entropy optimization is to find a set of weights that are uniform and satisfy marginal controls. The importance weights allow the user to trade-off between these objectives. High importance weights (e.g., 1E10) on all controls result in a hard constrained optimization which gives a high preference to matching marginal controls. Low importance weights (e.g., <50) results in an almost unconstrained problem. The user may also specify different importance weights for each marginal control. In this case, the controls with higher importance weights are given preference over the ones with low importance weights. Therefore, both absolute and relative value of the importance weights impacts the optimization problem and the solution. 
+:math:`x_{n}` and :math:`z_{i}`  are the parameters solved by the optimization while importance weights (:math:`u_{i}`) are the hyperparameters that are exposed to the user and impact the optimization externally. The objective of the relative entropy optimization is to find a set of weights that are uniform and satisfy marginal controls. The importance weights allow the user to trade-off between these objectives. High importance weights (e.g., 1E10) on all controls result in a hard constrained optimization which gives a high preference to matching marginal controls. Low importance weights (e.g., <50) results in an almost unconstrained problem. The user may also specify different importance weights for each marginal control. In this case, the controls with higher importance weights are given preference over the ones with low importance weights. Therefore, both absolute and relative value of the importance weights impacts the optimization problem and the solution.
 
-.. _setting-importance: 
+.. _setting-importance:
 
 Setting importance weights
 ~~~~~~~~~~~~~~~~~~~~~~~~~~~
 
 Given the flexibility that importance weights offer to the user, they need to be tuned to get the desired optimality in the outputs for the given seed sample and marginal controls. The quality of the outputs is defined by a uniformity measure of the weights and goodness of fit across marginal controls. Here are general guidelines on setting importance weights:
 
    * Start with a reasonable importance factor value across all controls (e.g., 1000 has typically worked well for multiple regions). This excludes the control on the total number of households which should be set to very high importance to ensure that the right number of households is generated for each zone.
-   * After achieving reasonable goodness of fit across controls, the importance weights can be increased/decreased to favor one control over the other, or all importance weights can be reduced to improve the uniformity of the weights. Which controls to favor depends on the type of application and the quality of the marginal data. 
-   * The importance weights are generally updated in factors of 10. The user may need to run PopulationSim multiple times using various combinations of importance weights to reach the desired quality of outputs. 
+   * After achieving reasonable goodness of fit across controls, the importance weights can be increased/decreased to favor one control over the other, or all importance weights can be reduced to improve the uniformity of the weights. Which controls to favor depends on the type of application and the quality of the marginal data.
+   * The importance weights are generally updated in factors of 10. The user may need to run PopulationSim multiple times using various combinations of importance weights to reach the desired quality of outputs.