You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> Each ZIM file has "main page" attribute which defines the landing page set for the ZIM archive.
168
-
> It is often different than the "main page" of upstream Wikipedia.
167
+
> Each ZIM file has "main page" attribute which defines the landing page set for the ZIM archive.
168
+
> It is often different than the "main page" of upstream Wikipedia.
169
169
> Kiwix Main page needs to be passed in the next step, so until there is an automated way to determine "main page" of ZIM, you need to open ZIM in Kiwix reader and eyeball the name of the landing page.
170
170
171
171
### Step 5: Convert the unpacked zim directory to a website with mirror info
@@ -242,7 +242,7 @@ Make sure at least two full reliable copies exist before updating DNSLink.
242
242
243
243
## mirrorzim.sh
244
244
245
-
It is possible to automate steps 3-6 via a wrapper script named `mirrorzim.sh`.
245
+
It is possible to automate steps 3-6 via a wrapper script named `mirrorzim.sh`.
246
246
It will download the latest snapshot of specified language (if needed), unpack it, and add it to IPFS.
247
247
248
248
To see how the script behaves try running it on one of the smallest wikis, such as `cu`:
A `Dockerfile` with all the software requirements is provided.
256
+
A `Dockerfile` with all the software requirements is provided.
257
257
For now it is only a handy container for running the process on non-Linux
258
-
systems or if you don't want to pollute your system with all the dependencies.
258
+
systems or if you don't want to pollute your system with all the dependencies.
259
259
In the future it will be end-to-end blackbox that takes ZIM and spits out CID
260
260
and repo.
261
261
@@ -340,3 +340,7 @@ We are working on improving deduplication between snapshots, but for now YMMV.
340
340
## Code
341
341
342
342
If you would like to contribute more to this effort, look at the [issues](https://github.com/ipfs/distributed-wikipedia-mirror/issues) in this github repo. Especially check for [issues marked with the "wishlist" label](https://github.com/ipfs/distributed-wikipedia-mirror/labels/wishlist) and issues marked ["help wanted"](https://github.com/ipfs/distributed-wikipedia-mirror/labels/help%20wanted).
343
+
344
+
## GitHub Actions Workflow
345
+
346
+
The GitHub Actions workflow that is available in this repository takes information about the wiki website that you want to mirror, downloads its' zim, unpacks it, converts it to a website and uploads it to S3 as a tar.gz package which is publicly accessible.
0 commit comments