Make data sources traceable by having content source url included in the parsed text. Add a flag to include a link to the source url in the generated dataset. cc: @murilopmachado @egecansen