Skip to content

Commit 19b5876

Browse files
authored
fixed punctation [ci skip]
1 parent 4c1d099 commit 19b5876

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

notebooks/02-dimension-reduction-and-discretization.ipynb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -425,7 +425,7 @@
425425
"metadata": {},
426426
"source": [
427427
"### Streaming memory discretization\n",
428-
"For real world case examples it is often not possible to load entire datasets into main memory. We can perform the whole discretization step without the need of having the dataset fit into memory. Keep in mind that this is not as efficient as loading into memory, because certain things (eg. featurization) will have to be recomputed during iterations."
428+
"For real world case examples it is often not possible to load entire datasets into main memory. We can perform the whole discretization step without the need of having the dataset fit into memory. Keep in mind that this is not as efficient as loading into memory, because certain calculations (e.g. featurization), will have to be recomputed during iterations."
429429
]
430430
},
431431
{
@@ -447,7 +447,7 @@
447447
"cell_type": "markdown",
448448
"metadata": {},
449449
"source": [
450-
"We should mention that regular space clustering does not require to load the TICA output into memory, while $k$-means does. Use the minibatch version, if your TICA output does not fit memory. Since the minibatch version takes more time to converge, it is therefore desirable to to shrink the TICA output to fit into memory. We split the pipeline for cluster estimation, and re-use the reader to for the assignment of the full dataset."
450+
"We should mention that regular space clustering does not require to load the TICA output into memory, while $k$-means does. Use the minibatch version if your TICA output does not fit memory. Since the minibatch version takes more time to converge, it is therefore desirable to to shrink the TICA output to fit into memory. We split the pipeline for cluster estimation, and re-use the reader to for the assignment of the full dataset."
451451
]
452452
},
453453
{

0 commit comments

Comments
 (0)