Skip to content

Commit e9e002f

Browse files
authored
Merge pull request #42 from fidelity/batch_processing
Batch processing
2 parents c6a6829 + 7887bf7 commit e9e002f

37 files changed

+13353
-1029
lines changed

.github/workflows/ci.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ jobs:
1313
runs-on: ${{ matrix.os }}
1414
strategy:
1515
matrix:
16-
python-version: ['3.6', '3.7', '3.8', '3.9', '3.10']
16+
python-version: ['3.7', '3.8', '3.9', '3.10']
1717
os: [ubuntu-latest, macos-latest, windows-latest]
1818
steps:
1919
- uses: actions/checkout@v3

CHANGELOG.txt

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,16 @@
22
CHANGELOG
33
=========
44

5+
-------------------------------------------------------------------------------
6+
Apr, 12, 2023 1.4.0
7+
-------------------------------------------------------------------------------
8+
9+
Major:
10+
- Enable running Seq2Pat on batches of sequences in parallel
11+
- Add a usage example notebook to show how batch processing is called
12+
- Enrich documentations and comments relevant to batch processing and a few tests
13+
- Explicitly require that input integer sequences only contain positive integers
14+
515
-------------------------------------------------------------------------------
616
July, 13, 2022 1.3.4
717
-------------------------------------------------------------------------------

README.md

Lines changed: 17 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,10 @@ Seq2Pat is developed as a joint collaboration between Fidelity Investments
2222
and the Tepper School of Business at CMU. Documentation is available at [fidelity.github.io/seq2pat](https://fidelity.github.io/seq2pat).
2323

2424
## Quick Start
25+
26+
We present examples for constraint-based sequential pattern mining and dichotomic pattern mining.
27+
Sequences can be represented as strings or positive integers.
28+
2529
### Constraint-based Sequential Pattern Mining
2630
```python
2731
# Example to show how to find frequent sequential patterns
@@ -95,7 +99,8 @@ The library offers various constraint types, including a number of non-monotone
9599
## Usage Examples
96100

97101
Examples on how to use the available constraints can be found
98-
in the [Usage Example Notebook](https://github.com/fidelity/seq2pat/blob/master/notebooks/usage_example.ipynb).
102+
in the [Usage Example Notebook](https://github.com/fidelity/seq2pat/blob/master/notebooks/sequential_pattern_mining.ipynb).
103+
You can also find out how to scale up the mining capability, by running Seq2Pat on batches of sequences in parallel in [Batch Processing Notebook](https://github.com/fidelity/seq2pat/blob/master/notebooks/batch_processing.ipynb).
99104

100105
Supported by Seq2Pat, we proposed **Dichotomic Pattern Mining** ([X. Wang and S. Kadioglu, 2022](https://arxiv.org/abs/2201.09178)) to analyze the correlations between
101106
mined patterns and different outcomes of sequences. DPM allows generating feature vectors based on mined patterns and plays an integrator role between Sequential
@@ -109,7 +114,7 @@ our [documentation](https://fidelity.github.io/seq2pat/installation.html).
109114

110115
### Requirements
111116

112-
The library requires ```Python 3.6+```, the ```Cython``` package, and a ```C++``` compiler.
117+
The library requires ```Python 3.7+```, the ```Cython``` package, and a ```C++``` compiler.
113118
See [requirements.txt](requirements.txt) for dependencies.
114119

115120
## Support
@@ -124,11 +129,11 @@ If you use Seq2Pat in a publication, please cite it as:
124129
@article{seq2pat2022,
125130
title={Seq2Pat: Sequence-to-Pattern Generation for Constraint-based Sequential Pattern Mining},
126131
author={Wang Xin, Hosseininasab Amin, Colunga Pablo, Kadioglu Serdar, van Hoeve Willem-Jan},
127-
url={https://github.com/fidelity/textwiser},
128132
journal={Proceedings of the AAAI Conference on Artificial Intelligence},
129-
volume={TBD},
130-
number={TBD},
131-
pages={TBD},
133+
url={https://ojs.aaai.org/index.php/AAAI/article/view/21542},
134+
volume={36},
135+
number={11},
136+
pages={12665-12671},
132137
year={2022}
133138
}
134139
```
@@ -139,10 +144,9 @@ To cite the Dichotomic Pattern Mining framework, please cite it as:
139144
@article{Frontiers2022,
140145
title={Dichotomic Pattern Mining Integrated with Constraint Reasoning for Digital Behaviour Analyses},
141146
author={Sohom Ghosh, Shefali Yadav, Xin Wang, Bibhash Chakrabarty, Serdar Kadioglu},
142-
journal={Frontiers Journal on Knowledge Discovery from Unstructured Data in Finance},
143-
volume={TBD},
144-
number={TBD},
145-
pages={TBD},
147+
journal={Frontiers in Artificial Intelligence},
148+
url={https://www.frontiersin.org/articles/10.3389/frai.2022.868085},
149+
volume={5},
146150
year={2022}
147151
}
148152
```
@@ -152,9 +156,9 @@ To cite the Dichotomic Pattern Mining framework, please cite it as:
152156
title={Dichotomic Pattern Mining with Applications to Intent Prediction from Semi-Structured Clickstream Datasets},
153157
author={Xin Wang and Serdar Kadioglu},
154158
booktitle={The AAAI-22 Workshop on Knowledge Discovery from Unstructured Data in Financial Services},
155-
year={2022},
156-
eprint={2201.09178},
157-
archivePrefix={arXiv}
159+
publisher={arXiv},
160+
url={https://arxiv.org/abs/2201.09178},
161+
year={2022}
158162
}
159163
```
160164

docs/.buildinfo

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
# Sphinx build info version 1
22
# This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done.
3-
config: dc64ee1df7963973e104a0b15e897a4e
3+
config: f4e53c38a24fe1d8ea17a0366eda0aae
44
tags: 645f666f9bcd5a90fca523b33c5a78b7

docs/_modules/index.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@
1313
<script data-url_root="../" id="documentation_options" src="../_static/documentation_options.js"></script>
1414
<script src="../_static/jquery.js"></script>
1515
<script src="../_static/underscore.js"></script>
16+
<script src="../_static/_sphinx_javascript_frameworks_compat.js"></script>
1617
<script src="../_static/doctools.js"></script>
1718
<script src="../_static/js/theme.js"></script>
1819
<link rel="index" title="Index" href="../genindex.html" />

0 commit comments

Comments
 (0)