Skip to content

Commit e7261a4

Browse files
authored
Merge branch 'main' into fix_zenodraft
2 parents d88e956 + 8a89cf9 commit e7261a4

File tree

3 files changed

+1
-4
lines changed

3 files changed

+1
-4
lines changed

.pre-commit-config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
repos:
22
- repo: https://github.com/rbubley/mirrors-prettier
3-
rev: v3.5.3
3+
rev: v3.6.2
44
hooks:
55
- id: prettier

language_guides/python.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -249,7 +249,6 @@ If you use another editor, perhaps it is more convenient to pick another service
249249
- List of other available software can be found on the [Python wiki page on debugging tools](https://wiki.python.org/moin/PythonDebuggingTools).
250250

251251
- If you are looking for some tutorials to get started:
252-
253252
- https://pymotw.com/2/pdb
254253
- https://github.com/spiside/pdb-tutorial
255254
- https://www.jetbrains.com/help/pycharm/2016.3/debugging.html

technology/datasets.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,6 @@ SQLite is a transactional database, so if you have a dataset that is changing wi
2525

2626
- DuckDB can also create views (virtual tables) from other sources like files, other databases, but with SQLite you always have to import the data before running any queries.
2727
- DuckDB is multi-threaded. This can be an advantage for large databases, where aggregation queries tend to be faster than sqlite.
28-
2928
- However if you have a really large dataset, say 100Ms of rows, and want to perform a deeply nested query, it would require substantial amount of memory, making it unfeasible to run on personal laptops.
3029
- There are options to customize memory handling, and push what is possible on a single machine.
3130

@@ -44,7 +43,6 @@ SQLite is a transactional database, so if you have a dataset that is changing wi
4443
Note, if your query is deeply nested, you should have sufficient disk space for DuckDB to use; e.g. for 4 nested levels of `INNER JOIN` combined with a `GROUP BY`, we observed a disk spill over of 30x the original dataset. However we found this was not always reliable.
4544

4645
In this kind of borderline cases, it might be possible to address the limitation by splitting the workload into chunks, and aggregating later, or by considering one of the alternatives mentioned below.
47-
4846
- You can also optimize the queries for DuckDB, but that requires a deeper dive into the documentation, and understanding how DuckDB query optimisation works.
4947

5048
- Both databases support setting (unique) indexes. Indexes are useful and sometimes necessary

0 commit comments

Comments
 (0)