You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+43-3Lines changed: 43 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,9 +4,17 @@ Python utility package for scraping information on SINTA (Science and Technology
4
4
5
5
## A. Documentation
6
6
7
-
### A.1. Author Verification
7
+
### A.1. Installation
8
8
9
-
#### A.1.i. Authentication
9
+
You can install `sintautils` using PIP as follows:
10
+
11
+
```sh
12
+
pip install sintautils
13
+
```
14
+
15
+
### A.2. Author Verification
16
+
17
+
#### A.2.i. Authentication
10
18
11
19
Author verification menu is a restricted menu of SINTA. You must be registered as a university administrator and obtain an admin credential in order to use this function. An author verification (AV) admin's credential consists of an email-based username and a password.
After importing the modules and initializing the `AV` class, you can start dumping research information of a given author in SINTA using the `dump_author()` method. The following code dumps all research data pertaining to a SINTA author and saves the result to an Excel file named `sintautils_dump_author-1234.xlsx` under the current working directory. Each data category (IPR, book, Google Scholar publication, etc.) is represented by a separate Excel sheet.
39
+
40
+
```python
41
+
# Change "1234" to the respective author's SINTA ID.
42
+
scraper.dump_author('1234')
43
+
```
44
+
45
+
You can customize which data type to scrape by specifying the `fields` parameter:
Also, you can change the output format, save directory, and filename prefix as follows:
55
+
56
+
```python
57
+
# Possible values for the "out_format" parameter:
58
+
# csv, json, json-pretty, xlsx
59
+
scraper.dump_author('1234',
60
+
out_format='json-pretty',
61
+
out_folder='/path/to/save/directory',
62
+
out_prefix='filename_prefix-'
63
+
)
64
+
```
65
+
66
+
If multiple fields are specified when using `out_format=csv`, each data type will be saved as a separate CSV file under the same `out_folder` directory.
0 commit comments