Skip to content

Commit ede710f

Browse files
authored
Merge pull request #10 from lisadunlap/cleanup
Cleanup
2 parents cd3cd16 + 76b3ec6 commit ede710f

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

70 files changed

+4562
-5303
lines changed

.env

Lines changed: 0 additions & 1 deletion
This file was deleted.

.github/workflows/test.yml

Lines changed: 4 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -27,19 +27,15 @@ jobs:
2727
python -m pip install --upgrade pip
2828
pip install -e .
2929
pip install -r requirements.txt
30-
pip install pytest pytest-cov black mypy
31-
32-
- name: Check formatting with black
33-
run: |
34-
black --check stringsight/
30+
pip install pytest pytest-cov mypy
3531
3632
- name: Type check with mypy
3733
run: |
3834
mypy stringsight/ --ignore-missing-imports
3935
40-
- name: Test with pytest
41-
run: |
42-
pytest --cov=stringsight --cov-report=xml --cov-report=term-missing
36+
# - name: Test with pytest
37+
# run: |
38+
# pytest --cov=stringsight --cov-report=xml --cov-report=term-missing
4339

4440
- name: Upload coverage to Codecov
4541
uses: codecov/codecov-action@v3

Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ COPY scripts/ ./scripts/
4545
RUN useradd -m appuser && chown -R appuser:appuser /app
4646

4747
# Copy entrypoint script
48-
COPY docker-entrypoint.sh /usr/local/bin/
48+
COPY docker/docker-entrypoint.sh /usr/local/bin/
4949
RUN chmod +x /usr/local/bin/docker-entrypoint.sh
5050

5151
ENTRYPOINT ["docker-entrypoint.sh"]

MANIFEST.in

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,5 +4,6 @@ include requirements.txt
44
include config.yaml
55
recursive-include stringsight *.py
66
recursive-include stringsight/frontend_dist *
7+
recursive-exclude stringsight/frontend_dist/taubench_airline_data *
78

89

airline_data_demo.jsonl

Lines changed: 0 additions & 100 deletions
This file was deleted.

data/airline_data_demo.jsonl

100755100644
File mode changed.
Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
11
# Development configuration with volume mounts for live code reloading
22
# Usage:
3-
# Option 1: docker-compose -f docker-compose.yml -f docker-compose.dev.yml up
4-
# Option 2: cp docker-compose.dev.yml docker-compose.override.yml && docker-compose up
3+
# Option 1 (recommended): docker compose -f docker-compose.yml -f docker/docker-compose.dev.yml up
4+
# Option 2 (auto-loaded override): cp docker/docker-compose.dev.yml docker-compose.override.yml && docker compose up
55

66
version: '3.8'
77

88
services:
99
api:
1010
volumes:
1111
# Mount source code for live reload
12-
- .:/app
12+
- ..:/app
1313
# Exclude Python bytecode and build artifacts to avoid permission issues
1414
- /app/__pycache__
1515
- /app/.pytest_cache
@@ -21,7 +21,7 @@ services:
2121
worker:
2222
volumes:
2323
# Mount source code for live reload
24-
- .:/app
24+
- ..:/app
2525
# Exclude Python bytecode and build artifacts
2626
- /app/__pycache__
2727
- /app/.pytest_cache
File renamed without changes.

docs/advanced/performance.md

Lines changed: 0 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -152,26 +152,6 @@ for chunk in pd.read_csv("large_file.csv", chunksize=5000):
152152
result, _ = explain(chunk, output_dir="results/chunk")
153153
```
154154

155-
## Cost Estimation
156-
157-
```python
158-
# Estimate costs before running
159-
num_conversations = len(df)
160-
avg_response_length = 500 # tokens
161-
162-
# Extraction cost (input + output)
163-
extraction_cost = num_conversations * (
164-
(avg_response_length / 1_000_000) * 3.50 + # input
165-
(200 / 1_000_000) * 14.00 # output (estimated)
166-
)
167-
168-
# Embedding cost
169-
num_properties = num_conversations * 1.5 # estimate
170-
embedding_cost = (num_properties * 50 / 1_000_000) * 0.02
171-
172-
print(f"Estimated cost: ${extraction_cost + embedding_cost:.2f}")
173-
```
174-
175155
## Benchmarks
176156

177157
Typical performance on common hardware:

0 commit comments

Comments
 (0)