You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/todo.rst
+11-1Lines changed: 11 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,4 +16,14 @@ Add PMCID and grants to pymed package.
16
16
17
17
Add expanded search to orcid package or look for more up to date package to use. Expanded search was added to ORCID's API with 3.0 release. orcid package appears to be 2.0 only.
18
18
19
-
Add capability to get the citations each paper cites.
19
+
Add capability to get the citations each paper cites.
20
+
21
+
Switch to a merge style from each source, so try to fill in information that wasn't found previously.
22
+
Keep the queries from each source, and do 2 passes with the new merge logic. This makes it so that if
23
+
a publication was on PubMed, but an author couldn't be matched, but an author was matched at another
24
+
source we can merge the information with the second pass. Would need to change the logic to first look
25
+
and see if the publication is in the list already and if it is then we don't need to make an author match
26
+
because an author match was made from another source. The big changes are to keep the queries, do a second
27
+
pass, merge information, and use existence in the list as verification in addition to an author match.
28
+
29
+
Save references out in "citation" format. Look at formats Google Scholar offers, example EndNote.
--prev-pub=<file-path> Filepath to json or csv with publication ids to ignore.
35
39
Enter "ignore" for the <file_path> to not look for previous publications.json files in tracker directories.
36
40
--prev_pub=<file-path> Deprecated. Use --prev-pub instead.
41
+
--save-all-queries Save all queried results from each source in "all_results.json".
42
+
--citation-match-ratio=<num> An integer from 0-100. The threshold to consider 2 citations the same using fuzzy matching. Lower is more forgiving. [default: 65]
37
43
38
44
Reference Type Options:
39
45
--PMID-reference Indicates that the reference_file is a PMID file and only PubMed info will be returned.
"""Query PubMed, ORCID, Google Scholar, and Crossref for publications for the authors.
91
98
92
99
Args:
@@ -96,61 +103,62 @@ def build_publication_dict(config_dict, prev_pubs, no_ORCID, no_GoogleScholar, n
96
103
no_GoogleScholar (bool): if True search Google Scholar else don't.
97
104
no_Crossref (bool): If True search Crossref else don't.
98
105
no_PubMed (bool): If True search PubMed else don't.
106
+
citation_match_ratio (int): if the fuzzy ratio between 2 citations is greater than or equal to this, then consider them to match.
99
107
100
108
Returns:
101
-
publication_dict (dict): The dictionary matching the publication JSON schema.
102
-
prev_pubs (dict): Same as input, but updated with the new publications found.
109
+
running_pubs (dict): The dictionary matching the publication JSON schema.
110
+
all_queries (dict): The pubs searched for each source and each author. {"PubMed":{"author1":[pub1, ...], ...}, "ORCID":{"author1":[pub1, ...], ...}, "Google Scholar":{"author1":[pub1, ...], ...}, "Crossref":{"author1":[pub1, ...], ...}}
103
111
"""
104
112
105
113
## Get publications from PubMed
106
114
helper_functions.vprint("Finding author's publications. This could take a while.")
0 commit comments