Skip to content

Commit 3a34da0

Browse files
committed
Merge branch 'main' into update-readme
2 parents 5f8f6a9 + 4b42fbe commit 3a34da0

15 files changed

+4475
-2250
lines changed

Pipfile

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,18 +8,19 @@ flickrapi = "*"
88
GitPython = "*"
99
google-api-python-client = "*"
1010
h11 = ">=0.16.0" # Ensure dependency is secure
11-
internetarchive = "*"
11+
internetarchive = ">=5.5.1"
1212
jupyterlab = ">=3.6.7"
1313
matplotlib = "*"
1414
numpy = "*"
1515
pandas = "*"
1616
plotly = "*"
17+
pillow = ">=11.3.0" # Ensure dependency is secure
1718
Pyarrow = "*"
1819
Pygments = "*"
1920
python-dotenv = "*"
2021
requests = ">=2.31.0"
2122
seaborn = "*"
22-
urllib3 = ">=1.26.18"
23+
urllib3 = ">=2.5.0"
2324
wordcloud = "*"
2425

2526
[dev-packages]

Pipfile.lock

Lines changed: 1563 additions & 1285 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

README.md

Lines changed: 35 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,23 @@ This project seeks to quantify the size and diversity of the commons--the
99
collection of works that are openly licensed or in the public domain.
1010

1111

12+
### Meaningful
13+
14+
The reports generated by this project (and the data fetched and processed to
15+
support it) seeks to be meaningful. We hope this project will provide data and
16+
analysis that helps inform discussions about the commons--the collection of
17+
works that are openly licensed or in the public domain.
18+
19+
The goal of this project is to help answer questions like:
20+
- How has the world's use of the commons changed over time?
21+
- How is the knowledge and culture of the commons distributed?
22+
- Who has access (and how much) to the commons?
23+
- What significant trends can be observed in the commons?
24+
- Which public domain dedication or licenses are the most popular?
25+
- What are the correlations between public domain dedication or licenses and
26+
region, language, domain/endeavor, etc.?
27+
28+
1229
## Code of conduct
1330

1431
[`CODE_OF_CONDUCT.md`][org-coc]:
@@ -106,10 +123,9 @@ modules:
106123
[homebrew]: https://brew.sh/
107124

108125

109-
### Running scripts that require client credentials
126+
### Managing client credentials
110127

111-
To successfully run scripts that require client credentials, you will need to
112-
follow these steps:
128+
Client credentials should be stored in an environment file:
113129
1. Copy the contents of the `env.example` file in the script's directory to
114130
`.env`:
115131
```shell
@@ -122,8 +138,22 @@ follow these steps:
122138
GCS_CX = your_pse_id
123139
```
124140
3. Save the changes to the `.env` file.
125-
4. You should now be able to run scripts that require client credentials
126-
without any issues.
141+
142+
You should now be able to run scripts that require client credentials without
143+
any issues. The `.env` file is ignored by git to help ensure sensitive data is
144+
not distributed.
145+
146+
147+
### Running the scripts
148+
149+
All of the scripts should be run from the root of the repository using pipenv. For example:
150+
```bash
151+
pipenv run ./scripts/1-fetch/github_fetch.py -h
152+
```
153+
154+
When run this way, the shared library (`scripts/shared.py`) provides easy access
155+
to all of the necessary paths and all of the modules managed by pipenv are
156+
available.
127157
128158
129159
### Static analysis

0 commit comments

Comments
 (0)