You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -9,165 +9,145 @@ This page provides a detailed guide to the main functions, classes, and extensib
9
9
### `parse`
10
10
11
11
```python
12
-
from hario_core import parse
12
+
from hario_core.parseimport parse
13
13
```
14
14
15
15
Parses a HAR file from a path, bytes, or file-like object and returns a validated `HarLog` model. Automatically selects the correct Pydantic model for each entry (including extensions).
Returns a function that generates a random UUIDfor each entry.
125
126
126
127
**Signature:**
127
128
```python
128
-
def uuid() ->EntryIdFn
129
+
def uuid() ->Callable[[dict], str]
129
130
```
130
131
131
-
**Example:**
132
-
```python
133
-
from hario_core.utils import uuid
134
-
id_fn = uuid()
135
-
```
136
-
137
-
---
138
-
139
-
## Transformers
140
-
141
-
Transformers are functions that mutate or normalize HAR entry data for storage or analysis.
142
-
143
-
144
132
### `flatten`
145
-
146
133
Flattens nested structures in a HAR entry to a flat dictwith keys joined by separator. If a listis encountered, array_handler is called (default: str). Useful for exporting to CSV, analytics, or custom DB schemas.
Copy file name to clipboardExpand all lines: docs/changelog.md
+10Lines changed: 10 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,15 @@
1
1
# Changelog
2
2
3
+
### v0.4.0
4
+
- BREAKING: Pipeline now requires a list of transformers and a PipelineConfig instance (no more id_fn/id_field in constructor).
5
+
- BREAKING: Pipeline.process now expects a list of dicts (e.g., from HarLog.model_dump()["entries"]).
6
+
- New: PipelineConfig class for configuring batch size, processing strategy (sequential/thread/process/async), and max_workers.
7
+
- New: Parallel and batch processing strategies for large HAR files (process, thread, async).
8
+
- New: Benchmarks and benchmarking scripts for pipeline performance (see `benchmarks/`).
9
+
- New: All transformers (`flatten`, `normalize_sizes`, `normalize_timings`, `set_id`) are now implemented as picklable callable classes, fully compatible with multiprocessing.
10
+
- New: `set_id` transformer for assigning IDs to entries using any function (e.g., by_field, uuid).
11
+
- Internal: Test suite and samples updated for new API and real-world HAR compatibility.
12
+
3
13
### v0.3.1
4
14
- FIX real-world HAR compatibility: made nested fields like `postData.params` optional in models, so parsing DevTools and other real HAR files is more robust.
5
15
- All test samples are now based on real HAR data with valid `pages` and `pageref` links.
0 commit comments