zarr-developers
diff --git a/‎docs/_static/donotdelete‎ b/‎docs/_static/donotdelete‎
diff --git a/‎docs/api/storage.rst‎
Lines changed: 3 additions & 0 deletions b/‎docs/api/storage.rst‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎docs/release.rst‎
Lines changed: 14 additions & 0 deletions b/‎docs/release.rst‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎docs/spec/v2.rst‎
Lines changed: 1 addition & 0 deletions b/‎docs/spec/v2.rst‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/tutorial.rst‎
Lines changed: 8 additions & 2 deletions b/‎docs/tutorial.rst‎
Lines changed: 8 additions & 2 deletions
diff --git a/‎notebooks/.ipynb_checkpoints/blosc_microbench-checkpoint.ipynb‎
Lines changed: 200 additions & 0 deletions b/‎notebooks/.ipynb_checkpoints/blosc_microbench-checkpoint.ipynb‎
Lines changed: 200 additions & 0 deletions
@@ -14,4 +14,7 @@ can be used as a Zarr array store.
 .. autoclass:: TempStore
 .. autoclass:: ZipStore
 
+    .. automethod:: close
+    .. automethod:: flush
+
 .. autofunction:: migrate_1to2
@@ -4,6 +4,20 @@ Release notes
 * Added :class:`zarr.storage.TempStore` class for convenience to provide
   storage via a temporary directory
   (`#59 <https://github.com/alimanfoo/zarr/issues/59>`_)
+* Fixed performance issues with ``ZipStore`` class
+  (`#66 <https://github.com/alimanfoo/zarr/issues/27>`_)
+* The Blosc extension has been modified to return bytes instead of array
+  objects from compress and decompress function calls. This should
+  improve compatibility and also provides a small performance increase for
+  compressing high compression ratio data
+  (`#55 <https://github.com/alimanfoo/zarr/issues/55>`_).
+* Added ``overwrite`` keyword argument to array and group creation methods
+  on the :class:`zarr.hierarchy.Group` class
+  (`#71 <https://github.com/alimanfoo/zarr/issues/71>`_).
+* Added ``cache_metadata`` keyword argument to array creation methods.
+* The functions :func:`zarr.creation.open_array` and
+  :func:`zarr.hierarchy.open_group` now accept any store as first argument
+  (`#56 <https://github.com/alimanfoo/zarr/issues/56>`_).
 
 .. _release_2.0.1:
 
 
@@ -442,6 +442,7 @@ Here is the same example using a Zip file as storage::
     >>> sub_grp = root_grp.create_group('foo')
     >>> a = sub_grp.create_dataset('bar', shape=(20, 20), chunks=(10, 10))
     >>> a[:] = 42
+    >>> store.close()
 
 What has been stored::
 
 
@@ -230,7 +230,7 @@ the delta filter::
     ...                chunks=(1000, 1000), compressor=compressor)
     >>> z
     Array((10000, 10000), int32, chunks=(1000, 1000), order=C)
-      nbytes: 381.5M; nbytes_stored: 248.9K; ratio: 1569.6; initialized: 100/100
+      nbytes: 381.5M; nbytes_stored: 248.9K; ratio: 1569.7; initialized: 100/100
       compressor: LZMA(format=1, check=-1, preset=None, filters=[{'dist': 4, 'id': 3}, {'preset': 1, 'id': 33}])
       store: dict
 
@@ -327,7 +327,7 @@ provided that all processes have access to a shared file system. E.g.::
     ...                     synchronizer=synchronizer)
     >>> z
     Array((10000, 10000), int32, chunks=(1000, 1000), order=C)
-      nbytes: 381.5M; nbytes_stored: 326; ratio: 1226993.9; initialized: 0/100
+      nbytes: 381.5M; nbytes_stored: 323; ratio: 1238390.1; initialized: 0/100
       compressor: Blosc(cname='lz4', clevel=5, shuffle=1)
       store: DirectoryStore; synchronizer: ProcessSynchronizer
 
@@ -515,6 +515,7 @@ Here is an example storing an array directly into a Zip file::
       nbytes: 3.8M; nbytes_stored: 21.8K; ratio: 179.2; initialized: 100/100
       compressor: Blosc(cname='lz4', clevel=5, shuffle=1)
       store: ZipStore
+    >>> store.close()
     >>> import os
     >>> os.path.getsize('example.zip')
     30721
@@ -536,12 +537,17 @@ Re-open and check that data have been written::
            [42, 42, 42, ..., 42, 42, 42],
            [42, 42, 42, ..., 42, 42, 42],
            [42, 42, 42, ..., 42, 42, 42]], dtype=int32)
+    >>> store.close()
 
 Note that there are some restrictions on how Zip files can be used,
 because items within a Zip file cannot be updated in place. This means
 that data in the array should only be written once and write
 operations should be aligned with chunk boundaries.
 
+Note also that the ``close()`` method must be called after writing any data to
+the store, otherwise essential records will not be written to the underlying
+zip file.
+
 The Dask project has implementations of the ``MutableMapping``
 interface for distributed storage systems, see the `S3Map
 <http://s3fs.readthedocs.io/en/latest/api.html#s3fs.mapping.S3Map>`_
 
@@ -0,0 +1,200 @@
+{
+ "cells": [
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'2.0.1'"
+      ]
+     },
+     "execution_count": 1,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "import numpy as np\n",
+    "import zarr\n",
+    "zarr.__version__"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "10 loops, best of 3: 110 ms per loop\n",
+      "1 loop, best of 3: 235 ms per loop\n",
+      "Array((100000000,), int64, chunks=(200000,), order=C)\n",
+      "  nbytes: 762.9M; nbytes_stored: 11.2M; ratio: 67.8; initialized: 500/500\n",
+      "  compressor: Blosc(cname='lz4', clevel=5, shuffle=1)\n",
+      "  store: dict\n"
+     ]
+    }
+   ],
+   "source": [
+    "z = zarr.empty(shape=100000000, chunks=200000, dtype='i8')\n",
+    "data = np.arange(100000000, dtype='i8')\n",
+    "%timeit z[:] = data\n",
+    "%timeit z[:]\n",
+    "print(z)\n",
+    "assert np.all(z[:] == data)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "1 loop, best of 3: 331 ms per loop\n",
+      "1 loop, best of 3: 246 ms per loop\n",
+      "Array((100000000,), float64, chunks=(200000,), order=C)\n",
+      "  nbytes: 762.9M; nbytes_stored: 724.8M; ratio: 1.1; initialized: 500/500\n",
+      "  compressor: Blosc(cname='lz4', clevel=5, shuffle=1)\n",
+      "  store: dict\n"
+     ]
+    }
+   ],
+   "source": [
+    "z = zarr.empty(shape=100000000, chunks=200000, dtype='f8')\n",
+    "data = np.random.normal(size=100000000)\n",
+    "%timeit z[:] = data\n",
+    "%timeit z[:]\n",
+    "print(z)\n",
+    "assert np.all(z[:] == data)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "'2.0.2.dev0+dirty'"
+      ]
+     },
+     "execution_count": 1,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "import numpy as np\n",
+    "import sys\n",
+    "sys.path.insert(0, '..')\n",
+    "import zarr\n",
+    "zarr.__version__"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "10 loops, best of 3: 92.7 ms per loop\n",
+      "1 loop, best of 3: 230 ms per loop\n",
+      "Array((100000000,), int64, chunks=(200000,), order=C)\n",
+      "  nbytes: 762.9M; nbytes_stored: 11.2M; ratio: 67.8; initialized: 500/500\n",
+      "  compressor: Blosc(cname='lz4', clevel=5, shuffle=1)\n",
+      "  store: dict\n"
+     ]
+    }
+   ],
+   "source": [
+    "z = zarr.empty(shape=100000000, chunks=200000, dtype='i8')\n",
+    "data = np.arange(100000000, dtype='i8')\n",
+    "%timeit z[:] = data\n",
+    "%timeit z[:]\n",
+    "print(z)\n",
+    "assert np.all(z[:] == data)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "1 loop, best of 3: 338 ms per loop\n",
+      "1 loop, best of 3: 253 ms per loop\n",
+      "Array((100000000,), float64, chunks=(200000,), order=C)\n",
+      "  nbytes: 762.9M; nbytes_stored: 724.8M; ratio: 1.1; initialized: 500/500\n",
+      "  compressor: Blosc(cname='lz4', clevel=5, shuffle=1)\n",
+      "  store: dict\n"
+     ]
+    }
+   ],
+   "source": [
+    "z = zarr.empty(shape=100000000, chunks=200000, dtype='f8')\n",
+    "data = np.random.normal(size=100000000)\n",
+    "%timeit z[:] = data\n",
+    "%timeit z[:]\n",
+    "print(z)\n",
+    "assert np.all(z[:] == data)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.5.1"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 1
+}