This is a research repository on the direction of PDF documents and extracting various PDF metadata for information.
The research can be found here.
- Install NodeJS 12+.
- Install all the 3rd party libraries
npm install. - Write out the environment variables in a
.envfile (follow.sample.env). - Run
npm run downloadto download all the PDF files locally. - Run
npm run parseto parse the metadata and output the summary locally.