-
Notifications
You must be signed in to change notification settings - Fork 21
Convert html to markdown #1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
Many of the output files (e.g. ah_copy_file.md, ah_delete_file.md) have backslashes before newlines and quotes, which seems unnecessary (and less readable). GitHub's parser doesn't require them, does Tasker's? |
|
Yes, there are some issues, Pandoc seems to have issues with Althought that works in HTML, it isn't good style and doesn't translate well to markdown: It would be better, to change the HTML to |
|
Well, we can either manually edit those markdown files or edit the HTML files before doing a conversion again |
|
I'd expect a sed script to handle stripping the superfluous backslashes. It's just a matter of creating a blacklist or whitelist and applying it; sed could help there too. Has a bug been filed for Pandoc? |
|
no bug has been filed with pandoc since I am unsure if it's a bug in pandoc or bad code in the html. |
|
Backslashes seem to be in e.g. ah_copy_file.md due to |
|
Yes, Pent probably did have a tool, but for different reasons (en/index.html is hand-crafted, e.g. some list elements have closing tags and some don't). You can see the XML source for the actions and events and states. The XML's actions include 5.0's Take Screenshot and Set App Shortcuts, so it was definitely in use pre-João. I would expect that the tool was included with what Pent provided, but I don't see it here. The tool probably reads the source (res/values/*.xml), as the A-Z files and individual files have names that the XML doesn't (e.g. "Clear Key"), and the A-Z file is obviously alphabetically sorted (the XML seems to be randomized). Some of the entries (e.g. action_help_clear_encryption and action_help_airplane_radios) do include HTML (italic and bold respectively) so that's allowed. It's also possible that a tool could convert Markdown files into XML and we can come full circle. |
The script to convert html files to md files is https://github.com/emansih/TaskerDocumentation/blob/ec316935bd02e3412a0b6d7aba63ad304da3d7ae/converter.sh
All files under en directory are ok. However
userguide_summary.mdfile is borked.