-
Notifications
You must be signed in to change notification settings - Fork 4
Uploading content using etd loader
Dan Kerchner edited this page Jun 10, 2018
·
16 revisions
-
Ensure that the
base_pathdirectory configured inconfig.pyis owned by thescholarspaceuser (including the contents ofbase_path) -
Import content with etd-loader
% sudo su - scholarspace % cd /opt/etd-loader % source ENV/bin/activate % python etd_loader.py --only retrieve
If you get an error similar to paramiko.ssh_exception.SSHException: No hostkey for host <HOST> found, you should be able to resolve this by successfully sftp-ing to the host once.
-
Run the rake task to ingest
% sudo su - scholarspace % cd /opt/etd-loader % source bin/ENV/activate % nohup python etd_loader.py --only import &>etdload.out &Monitor
etdload.outfor problems. If an ETD cannot be loaded successfully, remove its.zipfile from theetd_to_be_importeddirectory, and re-runnohup python etd_loader.py --only import &>etdload.out &which will continue loading ETDs from where it left off. -
Create MARC file
% sudo su - scholarspace
% cd /opt/etd-loader
% source bin/ENV/activate
% python etd_loader.py --only marc
- Run the rake task to reindex everything. This job can take a while; running it with
nohupwill allow it to continue running even if you close your shell session.
% cd /opt/scholarspace/scholarspace-hyrax
% rvmsudo rake gwss:reindex_everything RAILS_ENV=production &
- Some errors are recoverable. When an item upload results in:
EOFError: end of file reachedthen occasionally, this upload may succeed with another try. This error almost always crashes solr, requiring solr to be restarted on the Fedora/Solr server.
TBD