Create bib. sections using an automated search for referenced citations by Copilot · Pull Request #4700 · JacquesCarette/Drasil

Copilot · 2026-01-28T19:12:54Z

Blocked on #4731

Avoid using findAllCitations to gather bibliography entries.

Why? findAllCitations gets all instances of Citations from a ChunkDB which means we gather more than we need for the documents.

This PR uses Set whenever possible, but does not clean up code related to the existing 'gather all referenced symbols' code for the table of symbols creation. That should be done in a separate PR.

Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

Previously only extracted citations from Ref constructors in sentences. Now also extracts from DecRefs stored in DataDefinition, GenDefn, InstanceModel, and TheoryModel structures. This fixes missing citations like koothoor2013 in SWHSNoPCM which are referenced via dRefInfo in InstanceModels. Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

Copilot seems to forget to add dependencies...

Pull fixes from b76a720 that implement thorough sentence extraction: - Extract sentences from IOrgSec's Section parameter - Extract from PDSub subsections - Extract from CorrSolnPpties and FReqsSub LabelledContent - Extract from Defini and Graph comprehensively - Add helper functions getIntroSub, getPDSub, getC, getLC Revert GetChunks back to using findAllCitations - the plate-based approach was incomplete. The real issue was missing sentence extraction in ExtractDocDesc, which is now fixed. Remove ExtractNotebook.hs as it's no longer needed. Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

code/drasil-docLang/lib/Drasil/ExtractDocDesc.hs

code/drasil-docLang/lib/Drasil/ExtractNotebook.hs

code/drasil-docLang/lib/Drasil/GetChunks.hs

code/drasil-docLang/lib/Drasil/ExtractDocDesc.hs

…xtraction - Re-add getCitations, citeDB, and modelRefPlate functions to ExtractDocDesc - Restore ExtractNotebook.hs for notebook citation extraction - Update GetChunks to export lookupCitations helper (not findAllCitations) - Keep comprehensive sentence extraction improvements from upstream - Remove commented ciPlate code as requested This combines the plate-based citation extraction approach with the improved sentence traversal logic, ensuring all citations are found while maintaining the goal of only including references actually used in documents. Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

The issue was that some citations (like smithLai2005 in orgOfDocIntro) are only created when DocDesc is converted to Sections during mkSections. The previous extraction only scanned DocDesc, missing these citations. Solution: - Added getCitationsFromSections to extract from generated Section list - Added citeDBFromSections to look up citations from sections - Modified fillReferences to extract from both DocDesc AND generated sections - Updated mkSections to accept optional BibRef for second pass - In mkDoc, compute full bibliography after fillReferences and pass to mkSections This ensures all citations are found, including those in sentences generated during document construction like the organization section intro. Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

Co-authored-by: balacij <1627302+balacij@users.noreply.github.com>

balacij · 2026-02-04T19:29:56Z

@JacquesCarette @smiths This PR is ready for review now. Thanks!

@Xinlu-Y If you don't mind reviewing it as well, that would be appreciated!