C Javadoc Extractor

This tool extracts "documentation" comments from source files and "assembles" them into markdown files mirroring the structure of the source code but without the actual "code". github and other tools now commonly present such files in an enhanced manner to make them more readable than just plain text but they remain very readable without fancy formatting.

In some sense this tool is the dual of a compiler which completely ignores your comments when it compiles your source code since this tool in turn completely ignores the code.

Wait, what? Why?

Yes, we completely ignore the code. The ridiculously simple idea is that a developer simply wants their code and documentation in the same place to simplify the task of keeping them in sync and while some inconsistencies can be caught by parsing both the documentation and code together, an LLM based tool is going to do this task better in the long run so we want to be the stupid, easy, and predictable tool for documentation extraction.

Ordering

Ordering is based on tags and then what appears after tags (using simple ASCII based alphabetization for the early versions). While we are not doing anything clever to understand UTF-8, that is the expected format of all source files and more importantly the documentation comments.

"Unix" Usage

The simplest way to to use the tool is to install "tcc" into /usr/bin (and then no explicit compilation is required, just pretend it's a shell script or a precompiled binary):

  c-javadoc-extractor --output-dir=src-doc *.[ch]

Or if you have a "complex" directory structure:

  find . -name '*.[ch]' | xargs c-javadoc-extractor --output-dir=src-doc

If someone wants to make a package for debian or arch then I would be delighted. ¹

Usage Windows

TBD.

I'm seriously considering using Co

Output Structure/Format

The output structure is one or more files placed into the directory specified by --output-dir= (or the current directory if --output-dir if not specified, not recommended (unless you only have a couple of files in your repo and don't expect it to grow in which case you may want to rename "index.md" to "README.md" using the option --index-file-name=README.md").

A single root file called index.md is always created (unless --no-index=true or --index-file-name=xyz are used) which will contain links to all other generated files (in a nice alphabetical/hierarchical order (according to strcmp, so possibly much better in English and may depend on the C locale --- I admit we should improve on that).

All generated files are named the same as each input file except the extension (i.e., ".c" or ".h") is replace with ".md". If there are javadoc comments in foo.c and foo.h (in the same directory), then they are merged. So document things where they make the most sense to you.

My style would be to only document in a C file unless the entity only exists in a header file. This is where the person writing code is mostly likely to see the documention and therefore either abide by it or enhance it since it is so close to where they are editing.

If you rename your source files, then as long as your "src-doc" directory doesn't contain your own hand-generated artifacts (which is why the default was named that instead of "doc" or "documentation" by the way), simply erase it before generation if you are using a version control system like git which is both smart enough to realize when file contents haven't changed and when they have probably moved or been deleted.

Comparision with Doyxgen

Doxygen was designed to generate HTML output by default (as far as I can tell). I asked "bard" and it says the "-m" or OUTPUT_FORMAT = MARKDOWN options enable Doxygen to also produce markdown format instead of HTML (and this is actually a real thing, see https://www.doxygen.nl/manual/markdown.html). However, I think these options just tell Doxygen that that your documentation comments are markdown not that it should itself generate markdown. It looks like there are tools such as Moxygen that will convert an XML file that Doyxgen can produce to markdown. I have not tried that tool.

When Doxygen emits HTML, it certainly does much more than this tool does (for example, it clearly invokes "dot" to create graphical output of graphs of some sort) but I was actually a bit confused the first time I used Doxygen. Maybe I was confused by how the default HTML is styled badly and didn't fit on my laptop's screen all that well - I don't think it employs so called "responsive" HTML (though maybe someone else solved that via CSS you can just include into you Doyxgen somehow)). I was just looking at doxygen manual itself on a bigger screen, and while it is not typographically a work of art, it's OK - definitely not "repsonsive" - a column or two should disappear as the width of the window is decreased instead the text part just keeps shrinking).

Doxygen supports many more formats for documentation comments than just "javadoc" (which is only convenient is C type languages) though consistency matters and I'm pretty comfortable with the Javadoc comment syntax myself for C derived languages that accept "/" and "/" as comment delimiters ¹.

Incremental Usage

--no-index=true provides some capabilities for incremental update though the goal is to be very fast and I develop everything on a 1.3GHz processor Pixelbook so a faster machine should make very light work of large code bases.

Status

Well obviously this has actually been written yet. Comment extraction is complete, we should have our initial output soon.

Footnotes

I'm not a fan now of putting HTML style markdown in Javadoc although being "markdown" means this is legal and possibly desireable if you want to do something markdown can't handle. As an author and a reader, I just want comments to be pretty. ↩ ↩²

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
src-doc		src-doc
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
_clang-format		_clang-format
main.c		main.c
meson.build		meson.build

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

C Javadoc Extractor

Wait, what? Why?

Tags

Ordering

"Unix" Usage

Usage Windows

Output Structure/Format

Comparision with Doyxgen

Incremental Usage

Status

Footnotes

About

Uh oh!

Releases

Packages

Languages

License

jasonaaronwilson/c-javadoc-extractor

Folders and files

Latest commit

History

Repository files navigation

C Javadoc Extractor

Wait, what? Why?

Tags

Ordering

"Unix" Usage

Usage Windows

Output Structure/Format

Comparision with Doyxgen

Incremental Usage

Status

Footnotes

Footnotes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages