Add 'chunk position' metadata to embedded content

I tried the framework with the 'web' data loader and it worked well enough. However, I'd like to see the chunk position captured in the metadata of each document. This would tell me where in the source of the page the chunk actually is - which is useful for downstream processes at retrieval time, but also for debugging chunking strategy (e.g. size and overlap).