You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Xet upload: backtrack when dedup info is received (#1708)
Fix#1703
cc @assafvayner for viz, cc @Kakulukian too
## Note
Only backtrack since the end of the last file, and only in the current
xorb.
It means that we maybe lose ~2MB on average at the end of a xorb - only
if we filled the first 60MB of the xorb with new data
## Improvement
Running `pnpm --filter hub bench`:
```console
=== BENCHMARK RESULTS ===
File Statistics:
================
📄 64-8bits.tflite:
Size: 119.36 MB
Deduplication: 99.90%
📄 64-fp16.tflite:
Size: 236.77 MB
Deduplication: 100.00%
=== SUMMARY ===
Total files: 2
Total size: 356.13 MB
Total xorbs: 1
Total shards: 1
Total xorb bytes: 119 926 bytes
Total shard bytes: 1 400 bytes
Average deduplication: 99.95%
```
we bump the second file from 83% to 100% dedup
0 commit comments